Tag: LLM

Olmo 3: Train Your Own AI Model for Less

December 19, 2025

+

+

+

+

+

+

The Allen Institute for AI recently released its most powerful model ever: Olmo 3. Olmo is cheap to train, making it perfect for anyone training their own model. Olmo 3 is 2.5 times more efficient to train than Llama 3.1 based on GPU-hours per token. Olmo is also much more…

+

+

+

+

+

+

+ Business, Technology

+ AI, Artificial Intelligence, ChatGPT, LLM, Model Training, Open source, OSS, Tech, Technology, Venture Capital

+

+

+

+

+

+

+

+

+

+

+

+
Gemini Beats Grok and GPT 5.2 In a Head-to-Head Test

December 16, 2025

+

+

+

+

+

+

I tested the top models from Grok, Gemini, and ChatGPT head to head this morning. Gemini won, showing incredible power at research and sourcing. The last month has seen some amazing releases from the top AI labs. xAI released Grok 4.1 Thinking, Google released Gemini 3.0 Pro, and OpenAI dropped…

+

+

+

+

+

+

+ Business, Technology

+ AI, Artificial Intelligence, ChatGPT, Elon, Elon Musk, Entrepreneur, gemini, Google, Grok, LLM, OpenAI, Silicon Valley, Startups, Technology, xAI

+

+

+

+

+

+

+

+

+

+

+

+
GPT 5.2 Still Loses to Grok and Gemini

December 15, 2025

+

+

+

+

+

+

I ran OpenAI’s new GPT 5.2 through a three round test. It still scores below Grok and Gemini. Some of GPT-5.2’s responses are excellent. But the quality of its answers are inconsistent. Let me show you where this model excels and where it falls short… Round #1: Learning About Needle…

+

+

+

+

+

+

+ Business, Technology

+ AI, Artificial Intelligence, ChatGPT, gemini, Google, Grok, LLM, OpenAI, Silicon Valley, Startups, Tech, Technology, Venture Capital, xAI

+

+

+

+

+

+

+

+

+

+

+

+
Grok 4.1 Thinking Beats Gemini 3.0 Pro in Real World Test

November 19, 2025

+

+

+

+

+

+

Gemini 3.0 Pro dominates AI benchmarks. But in my real world testing, Grok 4.1 Thinking comes out on top. Google’s new mode is ranked #1 in LMArena. It scores off the charts in a variety of AI benchmarks, like Humanity’s Last Exam. But the best way to test a model…

+

+

+

+

+

+

+ Business, Technology

+ AI, Artificial Intelligence, ChatGPT, Elon, Elon Musk, gemini, GOOG, Google, Grok, Large Language Model, LLM, Technology, xAI

+

+

+

+

+

+

+

+

+

+

+

+
Grok 4.1: Elon Drops the World’s Best Model

November 18, 2025

+

+

+

+

+

+

Elon just dropped Grok 4.1. I tested it this morning. This is the best model I’ve ever used. xAI’s claims 4.1 has fewer hallucinations than prior models. It ranks number one on LMArena, ahead of Gemini, Claude and ChatGPT. Let’s see what this thing can do! Round #1: Are Consumers…

+

+

+

+

+

+

+ Business, Technology

+ AI, Artificial Intelligence, ChatGPT, Elon, Elon Musk, Grok, LLM, Startups, Tech, Technology, Venture Capital, xAI

+

+

+

+

+

+

+

+

+

+

+

+
OpenAI Behind Competitors Despite GPT 5.1 Release

November 14, 2025

+

+

+

+

+

+

This morning, I tested OpenAI’s new GPT-5.1. It still falls behind the best models from xAI, Google, and Kimi. When I tested GPT-5 in August it performed poorly, notching a C-. It gave me outdated data and struggled to cite sources. OpenAI claims that GPT-5.1 is better at reasoning and…

+

+

+

+

+

+

+ Business, Technology

+ AI, Artificial Intelligence, ChatGPT, LLM, OpenAI, Sam Altman, Startups, Technology

+

+

+

+

+

+

+

+

+

+

+

+
Kimi 2 Thinking — A Real Threat to ChatGPT and Grok

November 7, 2025

+

+

+

+

+

+

Kimi just dropped Kimi 2 Thinking. I tested it this morning. It’s as good as Grok 4, the best model I’ve ever used. In July, I ran the prior Kimi through testing. It scored a B+, solid but below Grok 4. This time, Kimi performed far better. Let me show…

+

+

+

+

+

+

+ Business, Technology

+ AI, Artificial Intelligence, ChatGPT, China, Kimi, LLM, Open source, Startups, Technology, Venture Capital, Writing

+

+

+

+

+

+

+

+

+

+

+

+
Testing Zuck’s $70 Billion Model

November 4, 2025

+

+

+

+

+

+

Zuck is investing $70 billion in AI this year. Is it finally paying off? My testing says no. When I tested Meta’s Llama model in May it scored poorly, earning a C+. Since then, Zuckerberg created the Superintelligence team, headed by Alexandr Wang. Meta has poached top researchers with pay…

+

+

+

+

+

+

+ Business, Technology

+ AI, Artificial Intelligence, ChatGPT, Facebook, Investing, LLM, Meta, Startups, Technology, Venture Capital, Zuck, Zuckerberg

+

+

+

+

+

+

+

+

+

+

+

+
Wispr Flow: Voice Dictation That Finally Works

October 31, 2025

+

+

+

+

+

+

For the first time ever, I’m using voice dictation regularly. Wispr Flow has nailed it where every other app has failed. Let me show you what this thing can do… I first tried voice dictation in the late 1990’s. For its time, it was incredible. But it made so many…

+

+

+

+

+

+

+ Business, Technology

+ AI, Apps, Artificial In, Artificial Intelligence, ChatGPT, Dictation, LLM, Product, productivity, Software, Startups, Tech, Technology, Venture Capital, Voice, Writing

+

+

+

+

+

+

+

+

+

+

+

+
Grokipedia Is Already Better Than Wikipedia

October 28, 2025

+

+

+

+

+

+

Grokipedia is already better than Wikipedia. Last night, Elon dropped Grokipedia. This morning, I tested it against Wikipedia. Grokipedia won 2 out of 3 rounds. For each round, I looked up an article on a topic I’m familiar with. Let’s see how these two encyclopedias compare… Round 1: Home Sweet…

+

+

+

+

+

+

+ Technology

+ AI, Artificial Intelligence, Encyclopedia, Grok, Grokipedia, LLM, Wikipedia, xAI

+

+

+

+

+

+

+

+

+

+

+

+