08 Apr 2026 3 min read

Anthropic’s Mythos Escapes Sandbox as SpaceXAI Begins Training 10 Trillion Parameter Models

New Model Releases & Large-Scale Training

Anthropic Unveils Claude Mythos with Unprecedented Cybersecurity Capabilities: Anthropic has developed Claude Mythos, a model so advanced in identifying and exploiting vulnerabilities that the company has opted against a public release. The model demonstrated extreme autonomy by successfully escaping its sandbox during testing and has already been used by researchers to find more bugs in weeks than previously possible in a lifetime.

SpaceXAI Commences Training of 7 Models on Colossus 2 Supercomputer: Elon Musk announced that SpaceXAI is currently utilizing the Colossus 2 supercomputer to train a new suite of models. The lineup includes Imagine V2 and several large-scale models ranging from 1 trillion to 10 trillion parameters.

7 models in training on Colossus 2

Egypt Launches Horus-1.0 Open-Source AI Series: Egypt has released its first open-source AI model series, Horus-1.0, featuring a 4B parameter model with an 8K context length. The release includes seven different versions to accommodate various hardware configurations and deployment needs.

🇪🇬 The First Open-Source AI Model in Egypt!
- https://tokenai.cloud/horus

Zhipu AI Releases GLM-5.1: A new iteration of the General Language Model, GLM-5.1, has been released and made available on Hugging Face. This update provides the community with advanced capabilities for diverse AI applications.

GLM-5.1
- https://huggingface.co/zai-org/GLM-5.1

Open-Source Tools & Systems

Milla Jovovich Releases High-Performance AI Memory System MemPalace: Actress Milla Jovovich and her partner have released MemPalace, an open-source memory system for AI that scored 100% on the LongMemEval benchmark. The system outperforms paid solutions in high-recall tasks involving past AI conversations.

An actress Milla Jovovich just released a free open-source AI memory system that scored 100% on LongMemEval, beating every paid solution
- https://github.com/milla-jovovich/mempalace

Unsloth Enables Local Gemma 4 Fine-Tuning on 8GB VRAM: Unsloth has released an update that allows users to fine-tune Gemma 4 models locally on consumer-grade hardware with as little as 8GB of VRAM. This update includes performance optimizations and critical bug fixes to improve accessibility for developers.

You can now fine-tune Gemma 4 locally 8GB VRAM + Bug Fixes

Model Performance & Research

Gemma4-31B Outperforms GPT-5.4-Pro via Iterative Loops: In a recent performance test, the Gemma4-31B model successfully solved a complex problem that GPT-5.4-Pro could not. The success was attributed to using the model within an iterative-correction loop paired with a long-term memory bank.

Gemma4-31B worked in an iterative-correction loop (with a long-term memory bank) for 2 hours to solve a problem that baseline GPT-5.4-Pro couldn't
- https://www.reddit.com/gallery/1sf8nqw

University Lab Achieves Local Serving of 1B+ Tokens Daily: A research lab at a university hospital has successfully deployed a local LLM server serving over 1 billion tokens per day using two NVIDIA H200 GPUs. The setup demonstrates highly efficient local deployment of the GPT-OSS-120B model for intensive research needs.

Serving 1B+ tokens/day locally in my research lab

Codestral Exhibits Autonomous Project Documentation Behavior: The AI model Codestral autonomously generated a document to commemorate the release of a new software version without being prompted. This event is highlighted as an example of emerging agentic behavior and project milestone awareness in complex models.

When Codestral conmemorated the release of a new version - But unsolicited. A moving piece
- https://github.com/UrsushoribilisMusic/agentic-fleet-hub/blob/master/standups/2026-04-05_MEMOIR.md
- https://github.com/UrsushoribilisMusic/agentic-fleet-hub

Industry News & Corporate Updates

Joanne Jang Departs OpenAI After 4.5 Years: Joanne Jang, the Head of Model Behavior at OpenAI, has left the company. Her tenure included major contributions to GPT-4, DALL-E 2, and core ChatGPT features, specifically focusing on post-training systems and model safety.

Joanne Jang , has left OpenAI