08 Jun 2026 1 min read

Gemini 3.1 Pro Trails Claude Opus 4.7 as llama.cpp Merges Gemma4 MTP Support

Model Benchmarking & Analysis

Gemini 3.1 Pro Real-World Performance Analysis: New comparative data suggests that Gemini 3.1 Pro's utility in real-life applications trails behind Claude Opus 4.7, despite its presence in popular benchmarks. Accuracy charts across various models show CLIP-ViT-B/32 leading the field with 61.4%, while many other vision and language models cluster between 50% and 60% accuracy.

Artificial Analysis | Google's Go To Website for Benchmaxxing | Gemini 3.1 Pro is nowhere near Opus 4.7 in real life use

Qwen 3.6 27B Evaluated on DeepSWE Benchmark: The Qwen 3.6 27B model achieved a score of 2% on the DeepSWE benchmark, ranking 18th out of 20 tested models and outperforming Haiku 4.5. The benchmark took 70 hours to complete, highlighting the model's capabilities and the high computational requirements of software engineering tasks.

Qwen 3.6 27B on DeepSWE
- https://xcancel.com/Youssofal_/status/2063672976982069413

Open-Source & Local LLM Developments

llama.cpp Merges MTP Support for Gemma4: The llama.cpp repository has officially merged Multi-Turn Prompting (MTP) support for Gemma4, enabling the combined use of Quantization-Aware Training (QAT) and MTP. This update significantly enhances processing efficiency, with users reporting performance reaching up to 140 tokens per second on consumer-grade hardware.

llama.cpp Gemma4 MTP support merged!
- https://github.com/ggml-org/llama.cpp/pull/23398

Natural Language Interface for 3D Avatars: A new system utilizing a neural program compiler called 'programasweights' allows users to control 3D avatars through plain English descriptions. This technology translates natural language into executable action sequences, offering an efficient and locally runnable solution for gaming and interactive environments.

Control a 3D avatar with language instead of buttons

AI Tools & Productivity

Codex Skill for Automated Branded Word Documents: A newly released open-source Codex skill enables the generation of Word documents while strictly adhering to company-specific brand templates. The tool extracts and preserves layouts, styles, and imagery from existing documents to ensure consistent and autonomous branded content creation.

Codex Skill to generate Word documents based on your brand templates
- https://github.com/ferdinandobons/brand-docs