Canva Magic Layers Turns Any Flat Image Into an Editable Design
Upload a PNG, get back separated layers with live editable text. Canva's Magic Layers decomposes flat images into full design projects — available now on Free, Pro, and Enterprise.
No results found
31 articles
Upload a PNG, get back separated layers with live editable text. Canva's Magic Layers decomposes flat images into full design projects — available now on Free, Pro, and Enterprise.
Perplexity launches Personal Computer — an always-on AI agent running on a dedicated Mac mini. $200/month for Max subscribers, with 10,000 monthly credits and remote control from any device.
Nvidia's Nemotron 3 Super is a 120B hybrid Mamba-Transformer MoE model that activates only 12B parameters per token. Open weights, 1M context, 5x faster than its predecessor.
Cloudflare's new /crawl endpoint turns any website into Claude-ready markdown with one API call. Here's how to set it up, from API tokens to RAG pipelines.
Paper launches its desktop app with MCP server support, turning a web-based design tool into something AI agents can actually manipulate. Free tier includes 100 MCP tool calls per week.
Google rolls out deep Gemini integration across Workspace — Docs drafts from your Drive history, Sheets that build themselves from a description, Slides that match your deck's theme, and cross-app search in Drive.
Nvidia unveils NemoClaw ahead of GTC 2026 — an open-source framework for deploying autonomous AI agents that can plan, reason, and execute multi-step tasks inside corporate systems. Built on Nemotron models and NIM.
Anthropic launches Code Review for Claude Code — a multi-agent system that analyzes pull requests for bugs, logic errors, and security vulnerabilities. Detects issues in 84% of large PRs with under 1% false positives.
Google launches Gemini Embedding 2 — the first multimodal embedding model in the Gemini API, mapping text, images, video, audio, and documents into a single vector space. Available now in public preview.
Andrej Karpathy open-sources autoresearch — a 630-line repo where an AI agent autonomously runs LLM training experiments, modifies its own code, and accumulates improvements overnight on a single GPU.
Apple releases Pico-Banana-400K, a 400,000-image open dataset for text-guided image editing — built from real photos using Google's Nano Banana, scored by Gemini, and available on GitHub under CC BY-NC-ND 4.0.
Qualcomm's first post-Arduino-acquisition board pairs a 40 TOPS Dragonwing IQ8 NPU with an STM32HS microcontroller for real-time actuation — under $300, shipping Q2 2026.
Luma AI releases Uni-1, a unified model that merges visual understanding and image generation in one decoder-only transformer — outperforming Nano Banana 2 and GPT Image 1.5 on key benchmarks.
Microsoft brings Anthropic's Claude Cowork technology into M365 Copilot, launches Agent 365 at $15/user, and bundles everything into a $99 E7 Frontier Suite — while SaaS stocks reel from a $285 billion selloff.
Anthropic launches Claude Marketplace, an enterprise store for third-party AI software from Snowflake, Harvey, Replit, and others — with zero commission and the option to pay using committed API spend.
Helios is a 14B-parameter open-weight video generation model that hits 19.5 FPS on a single H100 GPU, generates minute-long videos, and supports text, image, and video inputs — all under Apache 2.0.
OpenAI launches Codex Security in research preview — an autonomous agent that discovers, validates, and patches code vulnerabilities. It already found 14 CVEs in OpenSSH, GnuTLS, and Chromium.
OpenAI releases GPT-5.4 in standard, Thinking, and Pro variants — its most capable model yet, with native computer-use mode, 1 million token context, 33% fewer errors, and direct Excel and Google Sheets integration.
Google open-sources gws, a Rust-built CLI that gives developers and AI agents unified access to every Workspace API — Drive, Gmail, Calendar, Sheets, Docs, and more — from a single command line.
Google's NotebookLM launches Cinematic Video Overviews — a feature that transforms uploaded documents into bespoke, animated video summaries using a combination of Gemini 3, Nano Banana Pro, and Veo 3.
Google rolls out Canvas in AI Mode to all US users — turning Search into a creative workspace where you can write documents, generate code, and build interactive apps directly from a search query.
OpenAI open-sources Symphony, a system that monitors project boards like Linear, spawns coding agents to implement tasks, and submits pull requests with CI verification and video walkthroughs.
Apple announces MacBook Pro with M5 Pro and M5 Max — built on a new Fusion Architecture that combines two 3nm dies into one SoC, delivering up to 4x AI performance and the world's fastest CPU core.
Google releases Gemini 3.1 Flash Lite — a model built for speed and cost efficiency. It's 2.5x faster than Gemini 2.5 Flash, costs a fraction of Pro, and still handles 1M tokens of context.
OpenAI rolls out GPT-5.3 Instant — a major update to ChatGPT's default model that addresses user complaints about overly cautious, preachy responses while improving factual accuracy and web search integration.
Google releases Nano Banana 2 — technically Gemini 3.1 Flash Image — bringing 4K resolution, character consistency, and real-time web grounding to AI image generation across the Gemini app, Google Search, and the developer API.
Google DeepMind launches Gemini 3.1 Pro with a new three-tier thinking system, 1M token context window, and benchmark scores that leave the competition behind — including a 77.1% on ARC-AGI-2.
Google DeepMind releases Lyria 3, its most advanced AI music model yet. Available inside the Gemini app, it generates 30-second tracks with vocals, lyrics, and instruments from a simple text prompt.
Alibaba's Qwen team releases Qwen 3.5 — a 397-billion-parameter mixture-of-experts model under Apache 2.0 that activates only 17B parameters per token, supports 201 languages, and claims competitive performance against GPT-5.2 and Claude Opus 4.5.
ByteDance launches Seedance 2.0, an AI video generator that produces cinematic clips with synchronized audio, multi-shot sequences, and realistic physics — all from a single text prompt.
Kuaishou launches Kling 3.0 with native 4K at 60fps, multi-shot storyboarding with up to 6 camera cuts, character consistency across generations, and synchronized audio output.