Software

Software & Models

6 articles covering this beat

Latest Analysis

Standard Kernel Raises $20M to Bet That AI Can Write Its Own GPU Code

Standard Kernel has raised a $20 million seed round led by Jump Capital to build an autonomous kernel generation platform — software that uses AI to write the low-level GPU code that AI itself runs on. The company claims 80% to 4x performance gains over NVIDIA's cuDNN on H100 workloads. If that holds at scale, the implications reach far beyond one startup.

kernel-generationgpunvidiacudainfrastructurefunding

Software2d ago

DeepSeek V3.2: How a Chinese Lab Matched Frontier Performance Under Export Controls

DeepSeek's V3.2 — 685 billion parameters, 37 billion active per token — achieves gold at the IMO and matches GPT-5 on key benchmarks, all trained on export-restricted hardware. Its FP8 training framework and MoE innovations prove that chip restrictions may force innovation rather than prevent it. And V4, optimized for Huawei Ascend, signals something bigger.

DeepSeekExport ControlsMoEFP8HuaweiChina AI

Software2d ago

The MoE Revolution: How Mixture-of-Experts Became the Dominant Frontier Architecture

Every major frontier model released in the past year uses Mixture-of-Experts. DeepSeek V3.2: 685B parameters, 37B active. Llama 4 Behemoth: 2 trillion total, 288B active. Gemini, Mixtral, and reportedly GPT-4 — all MoE. NVIDIA says Blackwell runs MoE 10x faster at 1/10th the token cost. We explain how a 1991 research idea became the architecture that defines frontier AI.

Mixture-of-ExpertsMoEDeepSeekLlama 4GeminiMixtralNVIDIABlackwellModel ArchitectureSparse Models

Wire Dispatches

09:05 UTCDeepSeek releases V4: ~1 trillion parameters, million-token context, Huawei Ascend optimizedSoftware 07:30 UTCNVIDIA Blackwell NVL72 delivers 10x faster MoE inference at one-tenth cost per tokenSoftware

The Weekly Briefing

Every Sunday. The most important supply chain developments of the week, with analysis on what they mean for the AI ecosystem. Free.