Tag: LLM

MiniMax M2.7: China’s $40 Billion AI Flop

March 23, 2026

+

+

+

+

+

+

MiniMax, a $40 billion Chinese AI startup, just dropped its most powerful model yet. M2.7 claims to be one of the best models in the world, but it flopped in my testing… Round #1: How Bad Is the Damage from the Iran War? This morning on Bloomberg, I heard that…

+

+

+

+

+

+

+ Business, Technology

+ AI, Artificial Intelligence, Books, China, Finance, Food, Health, LLM, Open source, Stock Market, Stocks, Travel, Venture Capital, Writing

+

+

+

+

+

+

+

+

+

+

+

+
A SOTA Model On Your Phone? — Testing Qwen 3.5

March 3, 2026

+

+

+

+

+

+

What if you could run a SOTA model on your phone for free? Alibaba claims Qwen 3.5 is it. Qwen 3.5 is a new, small model that can run on device. This means you don’t have to pay for inference. Qwen isn’t the most powerful model out there. But for…

+

+

+

+

+

+

+ Business, Technology

+ AI, Alibaba, Artificial Intelligence, ChatGPT, Developers, Finance, Investing, LLM, Tech, Technology

+

+

+

+

+

+

+

+

+

+

+

+
Gemini 3.1 Pro Disappoints with Slow Outputs, Weak Sourcing

March 2, 2026

+

+

+

+

+

+

Google recently dropped Gemini 3.1 Pro. Google says it’s the most powerful model ever, but it flopped in my testing. This morning I ran 3.1 Pro through a three-round test. I asked it real questions I need the answer to. Let me show you where this model falls short… Round…

+

+

+

+

+

+

+ Business, Technology

+ Technology, Stocks, Tech, Stock Market, Silicon Valley, AI, Artificial Intelligence, Google, ChatGPT, GOOG, LLM, Grok, gemini

+

+

+

+

+

+

+

+

+

+

+

+
Grok 4.20 Beta Trounces Gemini in Head-to-Head Test

February 23, 2026

+

+

+

+

+

+

Gemini ruled the roost until Elon dropped Grok 4.20 Beta. This morning, I tested them head-to-head. Grok clobbered Gemini. Here’s why Grok now reigns supreme… Pushing Grok and Gemini to Their Limits I gave them both the same prompt, asking them to analyze the likelihood of war with Iran. I…

+

+

+

+

+

+

+ Business, Technology

+ AI, Artificial Intelligence, ChatGPT, Elon, Elon Musk, gemini, Google, Grok, Iran, Israel, LLM, Middle East, Silicon Valley, Startups, Technology, Trump

+

+

+

+

+

+

+

+

+

+

+

+
Elon Drops Grok 4.20 Beta, The Best Model Yet

February 20, 2026

+

+

+

+

+

+

Elon just dropped Grok 4.20 Beta. This model blows away anything else I’ve ever used. From speed of search to depth of reasoning, the new Grok shines. Let me show you what this thing can do… Round #1: Energy and the AI Build-Out Yesterday, I met with an interesting startup…

+

+

+

+

+

+

+ Business, Technology

+ AI, Artificial Intelligence, ChatGPT, Elon, Elon Musk, Grok, Iran, LLM, Silicon Valley, Startups, Tech, Technology, Venture Capital

+

+

+

+

+

+

+

+

+

+

+

+
MiniMax’s New M2.5 Model Flops in Testing

February 18, 2026

+

+

+

+

+

+

Chinese startup MiniMax just dropped M2.5, its most powerful model yet. But M2.5 flopped in my testing, scoring a C+. That’s only slightly better than MiniMax’s prior model, which I gave a C last month. Let me show you where this model works and where it fails… Round #1: Automating…

+

+

+

+

+

+

+ Business, Technology

+ AI, Artificial Intelligence, ChatGPT, China, LLM, MiniMax, Open source, Technology, Venture Capital, Writing

+

+

+

+

+

+

+

+

+

+

+

+
Kimi K2.5 Is Slow and Stupid

February 9, 2026

+

+

+

+

+

+

Moonshot AI recently dropped Kimi K2.5, its most powerful model ever. But in my testing, K2.5 failed miserably. When I reviewed Kimi K2 in November, it was a real threat to ChatGPT and Grok. But K2.5 feels like a major downgrade. Across a range of queries, K2.5 delivered useless results.…

+

+

+

+

+

+

+ Business, Technology

+ AI, Artificial Intelligence, ChatGPT, China, DeepSeek, Investing, Kimi, LLM, Open source, Startups, Technology, Venture Capital

+

+

+

+

+

+

+

+

+

+

+

+
Making Impossible Videos With the New Grok Imagine 1.0

February 3, 2026

+

+

+

+

+

+

Elon just dropped Grok 1.0 Imagine, xAI’s best video model ever. It’s faster, clearer, and better at making your ideas come alive. This morning, I put it through a three round test. It did a great job overall, while still struggling to follow my prompts at times. Let’s see what…

+

+

+

+

+

+

+ Business, Technology

+ AI, AI Video, Artificial Intelligence, ChatGPT, Elon, Elon Musk, Grok, LLM, Silicon Valley, Startups, Technology, xAI

+

+

+

+

+

+

+

+

+

+

+

+
MiniMax: China’s $7 Billion AI Model Flops in Testing

January 7, 2026

+

+

+

+

+

+

Chinese AI startup MiniMax is going public this week at a $7 billion valuation. But its model flopped in my testing. I ran MiniMax through three tests with real world questions. These are harder to game than benchmarks. Let me show you where MiniMax does well and where it struggles……

+

+

+

+

+

+

+ Business, Technology

+ AI, Artificial Intelligence, ChatGPT, China, Finance, Investing, Large Language Model, LLM, Startups, Stock Market, Tech, Technology, Venture Capital

+

+

+

+

+

+

+

+

+

+

+

+
China’s Zhipu AI Is About to IPO, But its Product Is Weak

January 6, 2026

+

+

+

+

+

+

China is about to launch its first AI model IPO, Zhipu AI. Investors may be excited, but Zhipu’s product is weak. China’s first IPO of an LLM startup, Zhipu AI, will start trading Thursday. The IPO is expected to value Zhipu at nearly USD $7 billion. This morning, I ran…

+

+

+

+

+

+

+ Business, Technology

+ AI, Artificial Intelligence, ChatGPT, China, Finance, Investing, IPO, LLM, Tech, Technology, Venture Capital

+

+

+

+

+

+

+

+

+

+

+

+