Tag: LLM
-
This morning, I tested OpenAI’s new GPT-5.1. It still falls behind the best models from xAI, Google, and Kimi. When I tested GPT-5 in August it performed poorly, notching a C-. It gave me outdated data and struggled to cite sources. OpenAI claims that GPT-5.1 is better at reasoning and…
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
-
Kimi just dropped Kimi 2 Thinking. I tested it this morning. It’s as good as Grok 4, the best model I’ve ever used. In July, I ran the prior Kimi through testing. It scored a B+, solid but below Grok 4. This time, Kimi performed far better. Let me show…
+
+
+
+
+
+
+ AI, Artificial Intelligence, ChatGPT, China, Kimi, LLM, Open source, Startups, Technology, Venture Capital, Writing+
+
+
+
+
+
+
+
+
+
+
+
-
+
+
+
+
+
+
For the first time ever, I’m using voice dictation regularly. Wispr Flow has nailed it where every other app has failed. Let me show you what this thing can do… I first tried voice dictation in the late 1990’s. For its time, it was incredible. But it made so many…
+
+
+
+
+
+
+ AI, Apps, Artificial In, Artificial Intelligence, ChatGPT, Dictation, LLM, Product, productivity, Software, Startups, Tech, Technology, Venture Capital, Voice, Writing+
+
+
+
+
+
+
+
+
+
+
+
-
+
+
+
+
+
+
Grokipedia is already better than Wikipedia. Last night, Elon dropped Grokipedia. This morning, I tested it against Wikipedia. Grokipedia won 2 out of 3 rounds. For each round, I looked up an article on a topic I’m familiar with. Let’s see how these two encyclopedias compare… Round 1: Home Sweet…
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
-
+
+
+
+
+
+
Can DeepSeek beat the best American models? When I tested DeepSeek in June, the outputs were garbage. This morning, I gave it a re-test… Like in June, I asked DeepSeek three real questions I need the answer to. Last time, it scored a B-. Let’s see if DeepSeek’s recent updates…
+
+
+
+
+
+
+ AI, Artificial Intelligence, ChatGPT, DeepSeek, Finance, Grok, Investing, Large Language Model, LLM, Money, Startups, Tech, Technology, Venture Capital, xAI+
+
+
+
+
+
+
+
+
+
+
+
-
+
+
+
+
+
+
Ever ask AI a question and get an answer from a random Reddit thread? Sick of bad answers, I tested the Encyclopedia Brittanica Chatbot. Brittanica Chatbot uses articles from the Encyclopedia Brittanica to answer your question. These articles are written by experts in their fields. This morning, I ran it…
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
-
+
+
+
+
+
+
GPT-5 sucks. I tested it Friday, bad. I tested it again today, even worse. Stick a fork in it. Let me show you how flawed this model is. When I tested it on Friday, OpenAI was having routing issues that may have caused GPT-5 to underperform. So this morning, I…
+
+
+
+
+
+
+ AI, Artificial Intelligence, ChatGPT, Investing, LLM, OpenAI, Silicon Valley, Startups, Tech, Technology, Venture Capital+
+
+
+
+
+
+
+
+
+
+
+
-
+
+
+
+
+
+
I just watched the first AI movie, Red Panda: Firefox of the Clouds. It was fantastic! And it makes me wonder what’s next… Red Panda is a full hour long film, made entirely with AI. Every element, from the visuals to the music, is synthetic. I spoke with the creator…
-
+
+
+
+
+
+
Open source dominates server environments and databases. But will open source win AI? This morning, I investigated which models are most popular among developers… America’s Next Top Model There’s no one place to go to definitively determine which models are most popular among developers. But one strong indicator comes from…
+
+
+
+
+
+
+ AI, Artificial Intelligence, ChatGPT, DeepSeek, Entrepreneur, LLM, Silicon Valley, Startups, Tech, Technology+
+
+
+
+
+
+
+
+
+
+
+