AI

59 articles

Grok 4.3 Beta: $300/Month, No Benchmarks, No Blog Post

xAI quietly drops Grok 4.3 with native video input, document generation, and a 2M token context window — but only for SuperGrok Heavy subscribers at $300/month. No model card, no benchmarks, no announcement.

Apr 18, 2026

AI Software

Claude Opus 4.7: 87.6% SWE-bench and a Hidden Cost Increase

Anthropic's Opus 4.7 jumps to 87.6% on SWE-bench Verified, adds 3.3x vision resolution and a new xhigh effort level — but an updated tokenizer means the same input now costs up to 35% more.

Apr 17, 2026

AI Hardware

Huawei Ascend 950PR: China's 1.56 Petaflop Answer to Nvidia

Huawei's new AI chip delivers 2.8x the FP4 performance of Nvidia's H20 with 112GB of in-house HBM. ByteDance has already committed $5.6 billion in orders.

Apr 10, 2026

Alibaba's HappyHorse Topped the Video Leaderboard Before Anyone Knew Who Built It

An anonymous 15B-parameter video model appeared on Artificial Analysis, took #1 in text-to-video with 1389 Elo, and then turned out to be Alibaba's. HappyHorse-1.0 generates video and audio in a single pass.

Apr 10, 2026

Meta Muse Spark: The Proprietary Pivot

Meta Superintelligence Labs ships its first model — a small, fast, closed-source reasoner that's 10x more compute-efficient than Llama 4 Maverick. Three days after releasing open-weight Llama 4, Meta went proprietary.

Apr 9, 2026

Meta Llama 4: Two MoE Models, 10M Token Context

Llama 4 Scout runs 109B total parameters across 16 experts with a 10 million token context window. Maverick scales to 400B across 128 experts. Both are open-weight, natively multimodal, and Meta's first MoE release.

Apr 6, 2026

AI Software

Microsoft MAI-Transcribe-1 and MAI-Voice-1: Fast, Cheap, In-House

Microsoft's first in-house speech models: MAI-Transcribe-1 beats Whisper on all 25 languages at half the GPU cost, and MAI-Voice-1 generates 60 seconds of expressive audio in under one second.

Apr 3, 2026

Google Gemma 4: Open-Weight AI That Punches Way Up

Four model sizes, Apache 2.0, natively multimodal from 2B to 31B — and the 26B MoE variant scores 88.3% on AIME 2026 with only 3.8B active parameters.

Apr 3, 2026

AI Software

ElevenLabs Gives IBM's AI Agents a Voice — in 70 Languages

IBM watsonx Orchestrate integrates ElevenLabs TTS and STT, bringing 10,000+ voices across 70 languages to enterprise AI agents — with PCI compliance and HIPAA-ready data handling built in.

Mar 29, 2026

AI Software

Shopify Just Put 5.6 Million Stores Inside ChatGPT

Shopify flipped Agentic Storefronts on by default for every eligible merchant. Your products now surface in ChatGPT, Google AI Mode, and Microsoft Copilot — no app install, no setup, no opt-in required.

Mar 29, 2026

AI Software

Cursor Composer 2 Is Fast, Cheap, and Built on Kimi K2.5

Cursor's new coding agent costs one-tenth the price of frontier models and scores 61.7% on Terminal-Bench. Then developers found the model ID — and it pointed straight to Moonshot AI's Kimi K2.5.

Mar 23, 2026

Microsoft MAI-Image-2 Hits #3 on Arena.ai

Microsoft's second-gen image model jumps from top-10 to #3 on the Arena.ai leaderboard. Photorealism, in-image text, and rich scene generation — rolling out to Copilot and Bing now.

Mar 19, 2026

MiniMax M2.7 Helped Build Itself

MiniMax's M2.7 participated in its own training loop — running 100+ autonomous iteration rounds on its own scaffold, achieving a 30% performance boost. SWE-Pro hits 56.22%.

Mar 18, 2026

Xiaomi MiMo-V2: Three Models That See, Code, and Sing

Xiaomi drops three frontier models at once — an omni-modal agent, a 1T-parameter coding model that topped OpenRouter under a fake name, and a TTS model that can sing. All at a fraction of Claude's price.

Mar 18, 2026

AI Software

Google Stitch Is Now a Full AI Design Canvas

Stitch graduates from Google Labs with a complete redesign: infinite canvas, voice design agent, instant prototypes, and DESIGN.md — an agent-friendly file for sharing design systems.

Mar 18, 2026

AI Software

Claude Cowork Can Now Take Tasks from Your Phone

Cowork's new Dispatch feature lets you send Claude a task from your phone and pick up the result on your desktop. One persistent thread, full access to your local files.

Mar 18, 2026

AI Software

Gemini API: Tool Combos, Context Circulation, Maps

Google's Gemini API now lets developers mix function calling with built-in tools like Search and Maps in a single call. Context circulation and spend caps also ship.

Mar 18, 2026

Baidu Is Turning Its Smart Devices Into OpenClaw Agents

Baidu is integrating OpenClaw's agentic AI framework into Xiaodu smart speakers, moving toward multi-step voice-controlled task execution as its core business erodes.

Mar 17, 2026

World's AgentKit Puts a Human Check on AI Shopping

World's AgentKit lets AI agents carry cryptographic proof of the human behind them, using World ID and zero-knowledge proofs. Built for agentic commerce at scale.

Mar 17, 2026

Nvidia's Nemotron Coalition: Cursor, Perplexity, Mistral

Nvidia is forming an 8-lab coalition to build open frontier models on DGX Cloud. Mira Murati's Thinking Machines Lab is a founding member.

Mar 17, 2026

Alibaba Wukong Brings AI Agents to Enterprise Work

Alibaba's Wukong platform coordinates multiple AI agents for complex business workflows. Available in closed beta via DingTalk and a desktop app, powered by Qwen.

Mar 17, 2026

Google Personal Intelligence Is Now Free for All US Users

Google's Gemini feature that connects to your Gmail, Photos, Calendar, and Drive is dropping the paywall for all US users. Five prompts a day, 32K context.

Mar 17, 2026

OpenAI GPT-5.4 Mini and Nano: Built for Agents

GPT-5.4 mini hits 54.4% on SWE-Bench at $0.75/M tokens. Nano goes cheaper still. Both unlock agentic tool calling at the small-model tier.

Mar 17, 2026

AI Hardware

Nvidia Vera Rubin and Groq 3: The GTC 2026 Hardware Blitz

Seven chips in production, a purpose-built agentic CPU, 256-LPU inference racks, and $1 trillion in orders. Nvidia's GTC 2026 keynote rewrites the datacenter playbook.

Mar 16, 2026

Z.ai Ships GLM-5-Turbo for Agentic AI Workloads

Zhipu AI's closed-source GLM-5-Turbo targets the OpenClaw agent ecosystem with 744B parameters, 200K context, and pricing that undercuts Claude by 5x. Shares jumped 16%.

Mar 16, 2026

AI Software

Microsoft Copilot Health Turns Your Body Data Into AI Advice

Microsoft launches Copilot Health, connecting wearables, health records from 50,000+ providers, and lab results into a single AI-powered health companion — with a medical superintelligence roadmap behind it.

Mar 13, 2026

Claude Can Now Build Charts and Diagrams in Conversation

Anthropic ships inline visualizations for Claude — interactive charts, diagrams, and visual explanations rendered directly in chat using HTML and SVG. Available to all users, including free tier.

Mar 12, 2026

AI Software

Google Ask Maps Lets You Talk to Google Maps With Gemini

Google Maps gets its biggest upgrade in a decade: Ask Maps turns location search into a Gemini-powered conversation, while Immersive Navigation rebuilds turn-by-turn directions in 3D.

Mar 12, 2026

AI Software

Canva Magic Layers Turns Any Flat Image Into an Editable Design

Upload a PNG, get back separated layers with live editable text. Canva's Magic Layers decomposes flat images into full design projects — available now on Free, Pro, and Enterprise.

Mar 11, 2026

Perplexity's Personal Computer Is a Mac Mini That Never Sleeps

Perplexity launches Personal Computer — an always-on AI agent running on a dedicated Mac mini. $200/month for Max subscribers, with 10,000 monthly credits and remote control from any device.

Mar 11, 2026

Nvidia Nemotron 3 Super: 120B Parameters, 12B Active, 1M Context

Nvidia's Nemotron 3 Super is a 120B hybrid Mamba-Transformer MoE model that activates only 12B parameters per token. Open weights, 1M context, 5x faster than its predecessor.

Mar 11, 2026

How To AI

How to Feed an Entire Website to Claude Using Cloudflare's /crawl API

Cloudflare's new /crawl endpoint turns any website into Claude-ready markdown with one API call. Here's how to set it up, from API tokens to RAG pipelines.

Mar 11, 2026

AI Software

Paper Desktop Lets AI Agents Design Alongside You

Paper launches its desktop app with MCP server support, turning a web-based design tool into something AI agents can actually manipulate. Free tier includes 100 MCP tool calls per week.

Mar 11, 2026

AI Software

Gemini Can Now Build Your Docs, Sheets, and Slides From Scratch

Google rolls out deep Gemini integration across Workspace — Docs drafts from your Drive history, Sheets that build themselves from a description, Slides that match your deck's theme, and cross-app search in Drive.

Mar 10, 2026

Nvidia NemoClaw: An Open-Source Platform for Enterprise AI Agents

Nvidia unveils NemoClaw ahead of GTC 2026 — an open-source framework for deploying autonomous AI agents that can plan, reason, and execute multi-step tasks inside corporate systems. Built on Nemotron models and NIM.

Mar 10, 2026

Anthropic Code Review Sends Multiple AI Agents After Your Pull Requests

Anthropic launches Code Review for Claude Code — a multi-agent system that analyzes pull requests for bugs, logic errors, and security vulnerabilities. Detects issues in 84% of large PRs with under 1% false positives.

Mar 10, 2026

Gemini Embedding 2: One Model for Text, Images, Video, Audio, and PDFs

Google launches Gemini Embedding 2 — the first multimodal embedding model in the Gemini API, mapping text, images, video, audio, and documents into a single vector space. Available now in public preview.

Mar 10, 2026

Karpathy's Autoresearch: 630 Lines That Let AI Optimize Its Own Training

Andrej Karpathy open-sources autoresearch — a 630-line repo where an AI agent autonomously runs LLM training experiments, modifies its own code, and accumulates improvements overnight on a single GPU.

Mar 9, 2026

Pico-Banana-400K: Apple's Open Dataset for Image Editing

Apple releases Pico-Banana-400K, a 400,000-image open dataset for text-guided image editing — built from real photos using Google's Nano Banana, scored by Gemini, and available on GitHub under CC BY-NC-ND 4.0.

Mar 9, 2026

AI Hardware

Arduino Ventuno Q: Dual-Brain Hardware for Real-Time AI

Qualcomm's first post-Arduino-acquisition board pairs a 40 TOPS Dragonwing IQ8 NPU with an STM32HS microcontroller for real-time actuation — under $300, shipping Q2 2026.

Mar 9, 2026

Luma's Uni-1: Understanding and Generation, Together

Luma AI releases Uni-1, a unified model that merges visual understanding and image generation in one decoder-only transformer — outperforming Nano Banana 2 and GPT Image 1.5 on key benchmarks.

Mar 9, 2026

AI Software

Copilot Cowork: Microsoft's $285 Billion Question

Microsoft brings Anthropic's Claude Cowork technology into M365 Copilot, launches Agent 365 at $15/user, and bundles everything into a $99 E7 Frontier Suite — while SaaS stocks reel from a $285 billion selloff.

Mar 9, 2026

AI Software

Claude Marketplace: Anthropic's Zero-Fee AI Store

Anthropic launches Claude Marketplace, an enterprise store for third-party AI software from Snowflake, Harvey, Replit, and others — with zero commission and the option to pay using committed API spend.

Mar 7, 2026

Helios: 14B Video Model That Runs in Real Time

Helios is a 14B-parameter open-weight video generation model that hits 19.5 FPS on a single H100 GPU, generates minute-long videos, and supports text, image, and video inputs — all under Apache 2.0.

Mar 7, 2026

AI Software

Codex Security: OpenAI's AI Agent Hunts Bugs

OpenAI launches Codex Security in research preview — an autonomous agent that discovers, validates, and patches code vulnerabilities. It already found 14 CVEs in OpenSSH, GnuTLS, and Chromium.

Mar 7, 2026

AI Software

OpenAI GPT-5.4: Computer Use and 1M Tokens

OpenAI releases GPT-5.4 in standard, Thinking, and Pro variants — its most capable model yet, with native computer-use mode, 1 million token context, 33% fewer errors, and direct Excel and Google Sheets integration.

Mar 5, 2026

AI Software

Google Workspace CLI: Built for AI Agents

Google open-sources gws, a Rust-built CLI that gives developers and AI agents unified access to every Workspace API — Drive, Gmail, Calendar, Sheets, Docs, and more — from a single command line.

Mar 5, 2026

AI Software

NotebookLM Can Now Turn Your Notes Into Cinematic Videos

Google's NotebookLM launches Cinematic Video Overviews — a feature that transforms uploaded documents into bespoke, animated video summaries using a combination of Gemini 3, Nano Banana Pro, and Veo 3.

Mar 4, 2026

AI Software

Canvas in AI Mode: Search That Creates

Google rolls out Canvas in AI Mode to all US users — turning Search into a creative workspace where you can write documents, generate code, and build interactive apps directly from a search query.

Mar 4, 2026

AI Software

OpenAI Symphony: Task Boards to Autonomous Code

OpenAI open-sources Symphony, a system that monitors project boards like Linear, spawns coding agents to implement tasks, and submits pull requests with CI verification and video walkthroughs.

Mar 4, 2026

AI Hardware

MacBook Pro M5 Pro and M5 Max: 4x AI Boost

Apple announces MacBook Pro with M5 Pro and M5 Max — built on a new Fusion Architecture that combines two 3nm dies into one SoC, delivering up to 4x AI performance and the world's fastest CPU core.

Mar 3, 2026

AI Software

Gemini 3.1 Flash Lite: Fast, Cheap, Capable

Google releases Gemini 3.1 Flash Lite — a model built for speed and cost efficiency. It's 2.5x faster than Gemini 2.5 Flash, costs a fraction of Pro, and still handles 1M tokens of context.

Mar 3, 2026

AI Software

GPT-5.3 Instant: Less Cringe, Smarter ChatGPT

OpenAI rolls out GPT-5.3 Instant — a major update to ChatGPT's default model that addresses user complaints about overly cautious, preachy responses while improving factual accuracy and web search integration.

Mar 3, 2026

AI Software

Nano Banana 2: 4K AI Images Go Mainstream

Google releases Nano Banana 2 — technically Gemini 3.1 Flash Image — bringing 4K resolution, character consistency, and real-time web grounding to AI image generation across the Gemini app, Google Search, and the developer API.

Feb 26, 2026

AI Software

Gemini 3.1 Pro: A Leap in AI Reasoning

Google DeepMind launches Gemini 3.1 Pro with a new three-tier thinking system, 1M token context window, and benchmark scores that leave the competition behind — including a 77.1% on ARC-AGI-2.

Feb 19, 2026

AI Software

Lyria 3: AI Music Generation in Gemini

Google DeepMind releases Lyria 3, its most advanced AI music model yet. Available inside the Gemini app, it generates 30-second tracks with vocals, lyrics, and instruments from a simple text prompt.

Feb 18, 2026

AI Software

Qwen 3.5: Open-Source 397B That Overdelivers

Alibaba's Qwen team releases Qwen 3.5 — a 397-billion-parameter mixture-of-experts model under Apache 2.0 that activates only 17B parameters per token, supports 201 languages, and claims competitive performance against GPT-5.2 and Claude Opus 4.5.

Feb 16, 2026

AI Software

Seedance 2.0: AI Video With Built-In Audio

ByteDance launches Seedance 2.0, an AI video generator that produces cinematic clips with synchronized audio, multi-shot sequences, and realistic physics — all from a single text prompt.

Feb 10, 2026

AI Software

Kling 3.0: Native 4K Video and Multi-Shot AI

Kuaishou launches Kling 3.0 with native 4K at 60fps, multi-shot storyboarding with up to 6 camera cuts, character consistency across generations, and synchronized audio output.

Feb 5, 2026