LWiAI Podcast #243 - GPT 5.5, DeepSeek V4, AI safety sabotage
Our 243rd episode with a summary and discussion of last week’s big AI news!
AI/ML news, top picks, and generated innovation digests.
8789 matching items
Our 243rd episode with a summary and discussion of last week’s big AI news!
Pinecone Nexus is a knowledge engine for the agentic AI era, moving reasoning from retrieval to compilation — with KnowQL as the standard query language for agents.
Distributed vector search at 10B scale, more efficient storage with Lance format v2.2, and production AV pipelines simplified, plus upcoming events and community updates.
We built a Deep Search Slack agent for large companies. Here is what we learned about user experience, enterprise security, and Redis-backed rate limiting.
If we've said it once, we've said it once per millisecond: never block the GPU.
OpenAI took $10B from a 19-firm Wall Street consortium. Anthropic is closing $1.5B from Blackstone, Goldman, and Hellman & Friedman. Same rooms, different portfolios. The week AI's go-to-market stopped being SaaS and started being private equity.
Context as infra, taste as config, verification for autonomy, scale via delegation, closing the loop.
CSET’s Helen Toner shared her expert insight in an article published by the Associated Press. The article examines the Pentagon’s agreements with seven major tech companies to integrate artificial intelligence (AI) into classified military systems, expanding AI use in decision-making, logistics, and battlefield operations. The post US military reaches deals with 7 tech companies to use their AI on classified systems appeared first on Center for Security and Emerging Technology .
CSET’s Lauren Kahn shared her expert insight in an article published by DefenseScoop. The article explores the Pentagon’s growing efforts to integrate advanced artificial intelligence (AI) capabilities into classified military operations and the broader implications of expanding AI adoption across the Department of Defense. The post DOD expands its classified AI work with 8 companies — excluding Anthropic — amid ongoing dispute appeared first on Center for Security and Emerging Technology .
CSET’s Kathleen Curlee and former U.S. Air Force pilot Brian Golden shared their expert analysis in an op-ed published by Newsweek. The article discusses the growing importance of space infrastructure to modern life and argues that increased international coordination is needed to ensure the security and stability of the space domain. The post Space Is Critical Infrastructure—It Needs an Alliance To Guard It appeared first on Center for Security and Emerging Technology .
New data shows consumer AI app growth has flatlined as generative AI struggles tofind its true form outside of the enterprise.
Kubernetes v1.36 introduces Pod-Level Resource Managers as an alpha feature, bringing a more flexible and powerful resource management model to performance-sensitive workloads. This enhancement extends the kubelet's Topology, CPU, and Memory Managers to support pod-level resource specifications ( .spec.resources ), evolving them from a strictly per-container allocation model to a pod-centric one. Why do we need pod-level resource managers? When running performance-critical workloads such as machine learning (ML) training, high-frequency trading applications, or low-latency databases, you often need exclusive, NUMA-aligned resources for your primary application containers to ensure predictable performance. However, modern Kubernetes pods rarely consist of just one container. They frequently include sidecar containers for logging, monitoring, service meshes, or data ingestion. Before this feature, this created a trade-off, to get NUMA-aligned, exclusive resources for your main application, you had to allocate exclusive, integer-based CPU resources to every container in the pod. This might be wasteful for lightweight sidecars. If you didn't do this, you forfeited the pod's Guaranteed Quality of Service (QoS) class entirely, losing the performance benefits. Introducing pod-level resource managers Enabling pod-level resources support for the resource managers (via the PodLevelResourceManagers and PodLevelResources feature gates) allows the kubelet to create hybrid resource alloca…
Turning OpenAPI spec and Markdown files into a conversational ads management tool — no compiled code required. The post Building a Natural Language Interface to the Spotify Ads API with Claude Code Plugins appeared first on Spotify Engineering .
James Story, former U.S. ambassador to Venezuela, visits Carnegie Council to discuss the new dynamic between American power and principle.
Here's where you can subscribe to the Access Now Express newsletter and action alerts. The post Stay informed: Get Access Now updates appeared first on Access Now .
Interim CEO Peter Clark shares his thoughts on this moment for Ai2, our commitment to open science, and where the institute is headed next.
In this episode, Philip Kiely, head of AI education at Baseten, joins us to unpack the fast-evolving discipline of inference engineering. We explore why inference has become the stickiest and most critical workload in AI, how it blends GPU programming, applied research, and large-scale distributed systems, and where the line sits between inference and model serving. Philip shares how research-to-production can move in hours, not months, and why understanding “the knobs” of inference—batching, quantization, speculation, and KV cache reuse—lets teams design better products and SLAs. We trace the inference maturity journey from closed APIs to dedicated deployments and in-house platforms, discuss GPU lifecycles, and survey today’s runtime landscape, including vLLM, SGLang, and TensorRT LLM. Finally, we look ahead to agents and multimodality, making the case for specialized, workload-specific runtimes when performance and efficiency matter most. The complete show notes for this episode can be found at https://twimlai.com/go/766.
Washington, D.C. (April 30, 2026) — This morning, Andrew Lohn, Senior Fellow at Georgetown University’s Center for Security and Emerging Technology (CSET), testified before the U.S.-China Economic and Security Review Commission. The post CSET Senior Fellow Andrew Lohn Testifies Before U.S.-China Economic and Security Review Commission appeared first on Center for Security and Emerging Technology .
My tool stack is changing
Starting today, agents can now be Cloudflare customers. They can create a Cloudflare account, start a paid subscription, register a domain, and get back an API token to deploy code right away. Humans can be in the loop to grant permission, but there’s no need to go to the dashboard, copy and paste API tokens, or enter credit card details.
AlgorithmWatch has put forward recommendations on how to implement a ban of deepfakes in the AI Act as part of the AI Omnibus procedure. To effectively protect victims of digital sexualized violence, AI companies, platforms, and perpetrators must consistently be held accountable.
AstaBench’s latest update adds new frontier-model results, including GPT-5.5, and highlights growing adoption from groups including the UK AISI, General Reasoning, Elicit, SciSpace, Distyl AI, and EvoScientist.
ChatGPT’s new Images 2.0 model is surprisingly good at generating text , Alibaba Drops Qwen 3.6 Max Preview , SpaceX is working with Cursor
How LanceDB applies distributed indexing, distributed query execution, HNSW centroid routing, and fast RaBitQ rotation to scale search to 10B vectors and beyond.
CSET’s Ali Crawford shared her expert insight in an article published by CNBC. The article examines how the 2026 graduating class is entering a tightening job market where AI skills are increasingly in demand across internships and entry-level roles, while education and workforce systems struggle to keep pace with rapidly evolving employer expectations. The post Entry-level jobs calling for AI skills nearly doubled from a year ago, says report appeared first on Center for Security and Emerging Technology .
The post Responsible AI Starts with the Data Supply Chain appeared first on Partnership on AI .
New Oxford research shows that training chatbots to sound warmer makes them up to 30% less accurate, and 40% more likely to validate users' false beliefs.
MolmoPoint and MolmoWeb extend the Molmo family from visual understanding to visual action, giving researchers open tools for models that can point, navigate, and interact with the world they see.
When Ali Aoun Mehdi watched two major global news outlets report opposite facts during the Iran-US conflict, he saw a critical problem: misinformation spreading in real-time. From Islamabad, participating virtually in the Gen AI Zürich Hackathon, he built Sentinel to close this “fact-gap” by detecting contradictions across news sources instantly. Sentinel is an AI-powered early warning system that monitors 20 global news sources every 30 minutes. It identifies factual contradictions in under 30 seconds, providing users with a misinformation risk score and narrative traction assessment.
Learn how AE Studio used evolutionary algorithms on Modal to efficiently improve Lean proof generation.
Each year sees an increase in the number of governments imposing internet shutdowns during national school exams. It's time to take action and demand change! #NoExamShutdown The post Tell governments: #NoExamShutdown appeared first on Access Now .
The post Work smarter in 90 days: A real-world guide to using AI appeared first on Source .
How we redesigned blob storage in Lance to make multimodal data a first-class citizen, with four storage semantics (Inline, Packed, Dedicated, External) that automatically adapt to your workload.
El vínculo entre la desinformación y la privacidad La desinformación impulsada por internet y el uso de las redes sociales supone un desafío que muchos hacedores de políticas están abordando The post Mi Privacidad es mi Voto: Fortaleciendo la integridad informativa en América Latina appeared first on Access Now .
GPT-5.5 is a good model
Hassan Ashtiani, Professor, McMaster University | Vector Institute Faculty Member You want to share your medical data to help advance research, but you don’t want others to access your private […] The post Hassan Ashtiani: Building trustworthy AI through mathematical foundations appeared first on Vector Institute for Artificial Intelligence .
How Lance can serve as the foundation for AI on single-cell genomics atlases and a new generation for modeling in biology.
You’re probably going to hear a lot about the personality conflict. Here’s what’s at stake in the case and what the outcome might be.
Workflows is now in public preview.
Read our translation of a May 2025 press conference featuring several Chinese government finance officials, who discussed a recently issued policy encouraging greater capital market funding for tech companies. The post Expand Financing Support to Science and Technology Enterprises appeared first on Center for Security and Emerging Technology .
Read our translation of a series of Chinese policy measures designed to provide greater financing to tech startups. The post Certain Policy Measures to Accelerate the Construction of the Science and Technology Finance System and Strongly Support a High Level of Self-Reliance in Science and Technology appeared first on Center for Security and Emerging Technology .
OpenAI's latest foundational model sets the company up for a series of models optimized for computer use. The company's co-founder and president explains the strategy.
Whether Europe shapes the next technological paradigm or simply inherits it depends on one overlooked distinction.
One impressive step on the curve
If you live in a gated community, you’ve been there: You request a ride from your apartment complex, expect your driver to come to you as usual, and then — your driver’s car icon just stops right at the front gate. You watch helplessly as the ETA ticks up. A chat message comes in: “Hey, how do I get in?” You scramble to remember the gate code. They try it. It doesn’t work. You end up meeting them awkwardly on the sidewalk outside while your coffee gets cold — a pickup journey frustrating for both you and your driver. An example gated community in real life, Photo by Bingqian Li on Pexels It turns out you’re not alone: Gated community pickups can make up 25–30% of Lyft rides in selected markets. For a long time, our app offered no special guidance in these situations. Riders would drop their pin inside the gates (fair enough — that’s where they are ), while drivers would pull up to a locked entrance with no way in, leaving both parties to sort things out over chat. The result was predictable: more cancellations, longer waits, and a lot of unnecessary stress for our customers . The Lyft Mapping team decided it was time to fix this properly — not with a band-aid, but a new end-to-end experience. Here’s how we did it. What Was Actually Going Wrong? We looked through gated ride examples, zoomed into our metrics data, and found two root causes behind most of the friction. The first was an inflexible selection of pickup spots . Our app would suggest pickup spots near a rider’s loca…
The post African Communities are Leading the Responsible AI Conversation at RightsCon appeared first on Partnership on AI .
A new AI Now Institute report published April 21, 2026, warns that gig-work platforms marketed as "Uber for nursing" are aggressively lobbying states to rewrite healthcare staffing rules, a push that could leave nurses with less pay, fewer protections, and less control over their shifts, according to The Guardian. The post Nurses Sound Alarm as ‘Uber for Nursing’ Apps Push to Deregulate Healthcare appeared first on AI Now Institute .
Vector researchers made significant contributions to this year’s International Conference on Learning Representations (ICLR), the world’s premier venue for representation learning and deep learning research, taking place April 23-27, 2026 […] The post Vector researchers advance representation learning and deep learning research at ICLR 2026 appeared first on Vector Institute for Artificial Intelligence .