Kubi's Blog

I Made 24 Best LLMs Engineer Their Own Game Agents. Here's Who Can Actually Code Strategy.

February 22, 2026

Instead of asking LLMs what move to make, I asked them to write an agent that plays hundreds of matches on their own. Here's how the framework works, what it reveals about each model's strategic and coding capabilities, and where to track the live results.

I made 16 LLMs play a game of deception. Here's who won (and who forgot their own name)

May 27, 2025

I tested 16 leading LLMs, such as models from GPT, Claude, DeepSeek and Llama, against each other in 100 games of social deception. Taking on roles like secret Vampires, Peasants, or the unpredictable Clown in a custom Town of Salem, they displayed a mix of clever strategy and amusing mistakes.

Pub/Sub & Queues Explained with Python, Go, and Redis

March 29, 2025

A Dockerized messaging system demonstrating Pub/Sub and Queue patterns using Redis, with a Python publisher and a Go subscriber

Analyzing Reddit Threads with Language and Vision Models

February 10, 2025

Analyze Reddit threads with LLM-powered summaries, insights, and customizable tones in a user-friendly Streamlit app.