AI/ML News & Innovations Hub

AI Weekly 2026-06-22 00:00 UTC Score 19.0 AI-133-20260622-newsletters-653b9580 Full article

AI Weekly Issue #506: Washington Blocked One AI Lab. China Blacklisted 56 Companies.

Ten days after Washington pulled Anthropic's top models from foreign hands, the bill came due. This week Beijing blacklisted 56 American firms, Anthropic's own filing admitted the trigger was a routine coding request rival models can run, and Microsoft's CEO warned that letting "a few models eat everything" won't survive politically. The export war just stopped being one-directional — here's the week that made it mutual.

Anthropic Microsoft

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 10.0 AI-079-20260622-research-pap-e1cffe42 Full article

A Few Bad Apples Spoil the Bunch: Preventing Global Entropy Collapse Driven by a Small Set of Tokens in LLM Reasoning

Jaeeun Jang, Hansle Lee and Sangmin Kim in Findings of the Association for Computational Linguistics: ACL 2026

Large Language Models

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 7.0 AI-079-20260622-research-pap-59f549b3 Full article

A Dynamic Self-Evolving Extraction System

Moin Aminnaseri, Hannah Kim and Estevam Hruschka in Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 12.0 AI-079-20260622-research-pap-1a259028 Full article

A Dual-View Analysis of Multiple Languages in Colonial Newspapers

Zhan Su, Xiaoya Chen, Fengran Mo, Ida L. Vos, Prayag Tiwari, Yazhou Zhang, Qian Zheng and Natália da Silva Perez in Findings of the Association for Computational Linguistics: ACL 2026

AI Research & Papers

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 18.0 AI-079-20260622-research-pap-9b4107c7 Full article

A Dual-Task Paradigm to Investigate Sentence Comprehension Strategies in Language Models

Rei Emura and Saku Sugawara in Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

AI Research & Papers

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 21.0 AI-079-20260622-research-pap-775c8c59 Full article

A Dual-Phase Self-Evolution Framework for Large Language Models

Haoran Sun, Zekun Zhang and Shaoning Zeng in Findings of the Association for Computational Linguistics: ACL 2026

Large Language Models

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 10.0 AI-079-20260622-research-pap-e193a0cc Full article

A Diversity Diet for a Healthier Model: A Case Study of French ModernBERT

Louis Estève, Christophe Servan, Thomas Lavergne and Agata Savary in Findings of the Association for Computational Linguistics: ACL 2026

AI Research & Papers

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 21.0 AI-079-20260622-research-pap-eb3c4d7e Full article

A Data-Efficient Path to Multilingual LLMs: Language Expansion via Post-training PARAM𝛥 Integration into Upcycled MoE

Hao Zhou, Tianhao Li, Zhijun Wang, Shuaijie She, Linjuan Wu, Hao-Ran Wei, Baosong Yang, Jiajun Chen and Shujian Huang in Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

Large Language Models AI Research & Papers

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 12.0 AI-079-20260622-research-pap-632b038e Full article

A Data-Centric Approach to Generalizable Speech Deepfake Detection

Wen Huang, Yuchen Mao and Yanmin Qian in Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

AI Research & Papers

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 21.0 AI-079-20260622-research-pap-6d907212 Full article

A Counterfactual Explanation Framework for Retrieval Models

Bhavik Chandna and Procheta Sen in Findings of the Association for Computational Linguistics: ACL 2026

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 10.0 AI-079-20260622-research-pap-159a3158 Full article

A Computational Method for Measuring Open Codes in Qualitative Analysis

John Chen, Alexandros Nikolaos Lotsos, Sihan Cheng, Lexie Zhao, Yanjia Zhang, Jessica Hullman, Bruce Sherin, Uri Wilensky and Michael Horn in Findings of the Association for Computational Linguistics: ACL 2026

Large Language Models

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 18.0 AI-079-20260622-research-pap-3495d9c6 Full article

A Comprehensive Survey of Process Reward Models: Data Generation, Model Construction, and Usage

Congmin Zheng, Jiachen Zhu, Zhuoying Ou, Yuxiang Chen, Kangning Zhang, Rong Shan, Zeyu Zheng, Mengyue Yang, Jianghao Lin, Yong Yu and Weinan Zhang in Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

AI Research & Papers

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 15.0 AI-079-20260622-research-pap-829cc7db Full article

A BERTology View of LLM Orchestrations: Token- and Layer-Selective Probes for Efficient Single-Pass Classification

Gonzalo Ariel Meyoyan and Luciano Del Corro in Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

Large Language Models AI Research & Papers

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 13.0 AI-079-20260622-research-pap-72be9913 Full article

1,729 vs. 1729: The Effect of Scripts and Formats on LLM Numeracy

Varshini Reddy, Craig W Schmidt, Seth Ebner, Adam Wiemerslage, Yuval Pinter and Chris Tanner in Findings of the Association for Computational Linguistics: ACL 2026

Large Language Models

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 7.0 AI-079-20260622-research-pap-4518cb9c Full article

♪ Something Just Like TRuST ♪ *: Toxicity Recognition of Span and Target

Berk Atıl, Namrata Sureddy and Rebecca J. Passonneau in Findings of the Association for Computational Linguistics: ACL 2026

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 13.0 AI-079-20260622-research-pap-15893841 Full article

[b] = [d] - [t] + [p]: Self-supervised Speech Models Discover Phonological Vector Arithmetic

Kwanghee Choi, Eunjung Yeo, Cheol Jun Cho, David Harwath and David R. Mortensen in Findings of the Association for Computational Linguistics: ACL 2026

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 21.0 AI-079-20260622-research-pap-57038641 Full article

“I See What You Did There”: Can Large Vision-Language Models Understand Multimodal Puns?

Naen Xu, Jiayi Sheng, Changjiang Li, Chunyi Zhou, Yuyuan Li, Tianyu Du, Jun Wang, Zhihui Fu, Jinbao Li and Shouling Ji in Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

Multimodal AI AI Research & Papers

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 24.0 AI-079-20260622-research-pap-59f8ba1e Full article

"Penny Wise, Pixel Foolish": Bypassing Price Constraints in Multimodal Agents via Visual Adversarial Perturbations

Jiachen Qian and Zhaolu Kang in Findings of the Association for Computational Linguistics: ACL 2026

Multimodal AI

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 20.0 AI-079-20260622-research-pap-41b65d0d Full article

"Newspaper Eat" Means "Not Tasty": A Taxonomy and Benchmark for Coded Language in Real-World Chinese Online Reviews

Ruyuan Wan, Changye Li and Ting-Hao Kenneth Huang in Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

AI Research & Papers

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 10.0 AI-079-20260622-research-pap-1ba6a47d Full article

"I Don’t Know What to Say": A Fact-Filling Questionnaire Method to Help Non-Experts Talk to LegalAI Assistant

Yuting Huang, Yiquan Wu, Meitong Guo, Ang Li, Xiaozhong Liu, Keting Yin, Fei Wu and Kun Kuang in Findings of the Association for Computational Linguistics: ACL 2026

Read →

ACL Anthology 2026-06-22 00:00 UTC Score 18.0 AI-079-20260622-research-pap-36d896cf Full article

"Excuse me, may I say something..." CoLabScience, A Proactive AI Assistant for Biomedical Discovery and LLM-Expert Collaborations

Yang Wu, Jinhong Yu, Jingwei Xiong, Zhimin Tao and Xiaozhong Liu in Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

Large Language Models AI Research & Papers

Read →

OpenAI News 2026-06-22 00:00 UTC Score 34.0 AI-044-20260622-official-ai--5edb7201

Codex-maxxing for long-running work

Learn how Jason Liu uses Codex to preserve context, manage complex projects, and help work continue beyond a single prompt.

Read →

Simon Willison Weblog 2026-06-21 23:35 UTC Score 42.0 USR-0110-20260621-ai-specialis-1ac2c3c3 Full article

sqlite-utils 4.0rc1 adds migrations and nested transactions

sqlite-utils is my combined Python library and CLI tool for working with SQLite databases. It provides an extensive set of higher-level operations on top of Python's default sqlite3 package , including support for complex table transformations , automatic table creation from JSON data and a whole lot more. I released sqlite-utils 4.0rc1 , the first release candidate for sqlite-utils v4. The major version bump indicates some (minor) backwards incompatible changes, so I'm interested in having people try this out before I commit to a stable release. New feature: migrations There are two significant new features in this RC compared to the previous 4.0 alphas. The first is support for database migrations . This isn't a completely new implementation - it's a slightly modified port of the sqlite-migrate package I released a few years ago. I think that package has proved itself over time, so I'm now ready to bundle it with sqlite-utils directly. Here's what a set of migrations in a migrations.py file looks like: from sqlite_utils import Database , Migrations migrations = Migrations ( "creatures" ) @ migrations () def create_table ( db ): db [ "creatures" ]. create ( { "id" : int , "name" : str , "species" : str }, pk = "id" , ) @ migrations () def add_weight ( db ): db [ "creatures" ]. add_column ( "weight" , float ) This defines a set of two migrations, one creating the creatures table and another adding a column to it. You can then run those migrations either using Python: db = Da…

Model Releases

Read →

Simon Willison Weblog 2026-06-21 23:30 UTC Score 30.0 USR-0110-20260621-ai-specialis-75611f2e Full article

sqlite-utils 4.0rc1

Release: sqlite-utils 4.0rc1 See sqlite-utils 4.0rc1 adds migrations and nested transactions . Tags: sqlite-utils

Read →

OpenAI News 2026-06-21 23:00 UTC Score 42.0 AI-044-20260621-official-ai--663d8726

Samsung Electronics brings ChatGPT and Codex to employees

Samsung Electronics deploys ChatGPT Enterprise and Codex to employees worldwide, marking one of OpenAI’s largest enterprise AI rollouts.

OpenAI GPT / ChatGPT

Read →

Simon Willison Weblog 2026-06-21 22:01 UTC Score 46.0 USR-0110-20260621-ai-specialis-93e5f67a Full article

Temporary Cloudflare Accounts for AI agents

Temporary Cloudflare Accounts for AI agents The announcement says this is "for AI agents" but (as is pretty common these days) the AI hook isn't really necessary, this is an interesting feature for everyone else as well. Short version: you can now create a Cloudflare Workers project and run this, without even creating a Cloudflare account: npx wrangler deploy --temporary Cloudflare will deploy the application to a new, ephemeral project which will stay live for 60 minutes. I had GPT-5.5 xhigh in Codex Desktop build this test application providing a tool for following HTTP redirects and returning the final destination. The temporary deployment worked as advertised. Running the deployment spits out the URL to a page for claiming the new project, for if you want it to last for more than 60 minutes. Here's what that claim screen looks like: Via Hacker News Tags: cloudflare

GPT / ChatGPT AI Agents

Read →

The Neuron 2026-06-21 17:30 UTC Score 39.0 AI-127-20260621-newsletters-d1fd3b87 Full article

😼 DeepMind mapped AI agent controls

PLUS: Claude robotics, Dean Ball at OpenAI, DeepSeek's raise, and sovereign models.

OpenAI Google DeepMind Claude AI Agents Robotics

Read →

Latent Space Podcast 2026-06-21 15:01 UTC Score 18.0 AI-142-20260621-podcasts-and-3ed6b512 Full article

[Exclusive] $250 off AI Engineer tix til Monday

special offer for subscribers - $250 off AI Engineer tix til Monday

Read →

Africa AI News 2026-06-21 13:54 UTC Score 16.0 AI-150-20260621-regional-ai--87551f8a Full article

Burkina Faso adopts national AI roadmap under Relance Plan

2026-2030 'AI for all' roadmap targets health and farming, schools

Read →

ChinaTalk AI 2026-06-21 11:35 UTC Score 25.0 USR-0206-20260621-global-ai-ne-85767697 Full article

Transmission Dominance with Chinese Characteristics

Comparing the US and China Transmission Buildout

Read →

Africa AI News 2026-06-21 11:27 UTC Score 16.0 AI-150-20260621-regional-ai--f9959041 Full article

Morocco signs deal for new YouCode Errachidia coding school

YouCode school targets talented school dropouts, aiming for 100 trainees a year

Read →

Politico Europe AI 2026-06-21 11:24 UTC Score 15.0 AI-170-20260621-regional-ai--f1fe81bf

The political education of Mark Carney

On the world stage, Canada's prime minister is a statesman. In Ottawa, he is a ward boss.

Read →

Exponential View 2026-06-21 03:12 UTC Score 28.0 USR-0108-20260621-ai-specialis-6616041c Full article

🔮 Product eats the AI company; the bitter lesson prevails; Fable 5 as CEO, undersea diplomacy & jellyfish sleep++ #579

Hi all,

Read →

Eugene Yan Blog 2026-06-21 00:00 UTC Score 30.0 USR-0114-20260621-ai-specialis-1a96649e Full article

Patterns for Building Cybersecurity Evals

A sandboxed target, inputs that influence task difficulty, tools, and a grader.

AI Chips & Hardware

Read →

AI Alignment Forum 2026-06-20 20:05 UTC Score 38.0 USR-0151-20260620-community-fo-c0bc42f0 Full article

How transparent is DiffusionGemma (and why it matters)

Authors: Joshua Engels*, Callum McDougall*, Bilal Chughtai*, Janos Kramar, Senthoran Rajamanoharan, Cindy Wu, Arthur Conmy, Asic Q Chen, Jean Tarbouriech, Min Ma, Brendan O'Donoghue+, João Gabriel Lopes de Oliveira+, Rohin Shah+, Neel Nanda+ *Primary Contributor +Advising Paper here: https://arxiv.org/abs/2606.20560 Overview In a recent collaboration between the GDM interpretability team and the GDM text diffusion team, we performed a transparency audit of DiffusionGemma, GDM's new text diffusion model. Overall, we find that DiffusionGemma is not significantly less transparent than Gemma. Gemma and DiffusionGemma perform similarly on monitorability evaluations . Although naively DiffusionGemma has a much larger opaque serial depth , we can apply the logit lens to intermediate vectors and ablate non-interpretable information without harming performance. This implies that these intermediate nodes are interpretable, which reduces the opaque serial depth to be similar to that of Gemma. However, even though the variables that the model uses at different steps are interpretable, this does not necessarily mean that we understand the algorithm that the model uses to reach the final answer. We thus distinguish between variable transparency, which we define as whether we can understand snapshots of the model's computation, and algorithmic transparency, which we define as whether we can use these snapshots to reconstruct the process by which the model arrived at its outputs. By default…

AI Ethics AI Chips & Hardware AI Research & Papers

Read →

Politico Europe AI 2026-06-20 13:51 UTC Score 18.0 AI-170-20260620-regional-ai--a5ee1290

Aviation officials in US turn to AI for combating runway issues

The aviation agency has partnered with tech company Palantir. “The long-term goal is to get to predictive,” said one official.

Read →

IEEE Spectrum Machine Learning 2026-06-20 13:00 UTC Score 46.0 AI-020-20260620-global-ai-ne-4ca1963c

War Taught this Ukrainian Entrepreneur the Value of Resilience

Salome Mikadze-Struk is no stranger to adversity. The daughter of refugees, she built a software-development business as an undergraduate at the height of the COVID-19 pandemic and kept it running despite the outbreak of war in her native Ukraine . Now, she’s drawing on her experiences to mentor tech-startup founders and speak publicly about the importance of resilience in entrepreneurship . Mikadze-Struk was studying at Georgetown University, in Washington, D.C., when COVID-19 struck. Classes went online, and she moved back to Ukraine. In the midst of that disruption she saw an opportunity to develop her business idea, called Movadex , by tapping Ukraine’s pool of talented young engineers. Then Russia invaded in early 2022, during her final semester. Taking online classes from bomb shelters and helping employees evacuate to safer parts of the country was surreal, she says, but the team kept the company afloat and she graduated later that year. In 2023, Mikadze-Struk took a hiatus from her business to pursue an MBA at Stanford University, which she completed this year. In her precious spare time she’s been advising startups and giving talks, using her unique perspective to promote the need for resilience in entrepreneurship—something she thinks is increasingly important in the software industry as AI coding tools upend old business models. “You need to be okay with risk, you need to be resilient. You need to be okay with disruption and okay with uncertainty,” she says, “beca…

AI Funding & Startups AI Research & Papers

Read →

Cross Validated 2026-06-20 11:47 UTC Score 15.0 AI-113-20260620-social-media-1d9bdf39

what is the intuitive interpretation when maximizing an arima likelihood

it's pretty well known that there are issues with initialization and dealing with non-stationarity of resulting estimates etc. But, for this question, one can assume that all these things are taken care of in the algorithm. So, my question is : can the resulting parameter estimates be thought of as those that minimize the one step ahead forecast error of the respective arima model ? Mathematically, any algorithm is maximizing a likelihood, but that likelihood is a function of the estimated residuals so it still feels like one is capturing the best one step ahead forecast ? Thanks for any insights or references ? This question came to my mind recently when I realized that I'm not really interested in the one step ahead forecast.

Read →

ChinaTalk AI 2026-06-20 11:26 UTC Score 23.0 USR-0206-20260620-global-ai-ne-df8ecdb5 Full article

Yes, Transformers Are a Problem...

But So Are Converters, Inverters, and MMCs!

Read →

Politico Europe AI 2026-06-20 10:53 UTC Score 18.0 AI-170-20260620-regional-ai--b30e815a

The political fight behind Washington’s export controls on Anthropic

In an administration that rewards political deference, Anthropic’s CEO has repeatedly clashed with the White House.

Anthropic

Read →

Politico Europe AI 2026-06-20 08:55 UTC Score 16.0 AI-170-20260620-regional-ai--a472f7cd

The secret drone operation that’s changing the Ukraine war

Inside Kyiv’s secret drone program that’s hitting targets deep inside Russia.

Read →

Latent Space Podcast 2026-06-20 08:06 UTC Score 16.0 AI-142-20260620-podcasts-and-6b7473bd Full article

[AINews] not much happened today

a quiet day lets us promo AIE one last time

Read →

Netflix Tech Blog 2026-06-19 23:54 UTC Score 30.0 USR-0049-20260619-ai-specialis-7fec1134

The Data Canary: How Netflix Validates Catalog Metadata

By Celina Amados At Netflix, our catalog metadata is crucial to our member experience, and a single corrupted data state can impact millions of viewers immediately. To protect streaming reliability, we built an automated data canary system that validates data transformations using production traffic. This canary detects issues in under 10 minutes, and blocks bad data from reaching our members. Intro Catalog metadata is what makes Netflix functional. It defines what titles exist, where they’re available, whether they can be played, and more. This data gets transformed and distributed across our vast infrastructure near-continuously, powering everything that helps members find what they want to watch. Accurate catalog data delivers moments of joy. Corrupted catalog data breaks streaming. What Went Wrong A production incident revealed a critical gap in our resilience strategy. No code had been deployed. No configuration had changed. But, a manual mitigation action taken during a previous incident had inadvertently corrupted a data feed, rendering it empty for a subset of titles. The impact was immediate: missing metadata prevented manifest generation, causing failures in our catalog service and playback issues. Engineers were alerted immediately, but identifying the root cause took time. After intense triaging, responders pinpointed the corrupted data feed and pinned services back to a known-good state, restoring playback. The problem? Our sophisticated code canary deployments…

Read →

Netflix Tech Blog 2026-06-19 23:54 UTC Score 43.0 USR-0049-20260619-ai-specialis-03ed7b57

Data Projects: Managing Data Assets at Netflix Scale

By Amer Hesson , Marcelo Mayworm , James Mulcahy , and Brittany Truong The Problem: Managing Assets at Netflix Scale Netflix’s Data Platform is vast. We have millions of tables in our data warehouse and tens of thousands of scheduled workloads running across our orchestration systems. Behind each of these assets sits an engineer, a team, or an initiative — and behind each of those sits a set of decisions about who can access what , and how those workloads execute day after day. For years, the tools we used to manage access and identity for these assets operated at the granularity of the individual asset. Every table had its own Access Control List (ACL). Every workflow ran under the identity of the engineer who authored it. In a workforce that is fluid, where people change teams, change roles, and occasionally leave the company, this fine-grained model broke down in two persistent, painful ways. Problem 1: Permissions that can’t keep up with organizational changes Imagine you’re on a team that owns a few hundred tables. Your org restructures, a neighboring team merges into yours, and you inherit another few hundred. Now you have to find every ACL on every table, figure out who should still have access, and update them one by one. Multiply that by every reorg across every team across the company. The result? Two failure modes: The support team gets flooded. A significant and outsized share of support threads were requests to update table permissions en masse in response to or…

Read →

Netflix Tech Blog 2026-06-19 23:53 UTC Score 35.0 USR-0049-20260619-ai-specialis-204afe5e

Predicting Risk in Content Launches: How Data-Driven Insights can Transform Launch Planning

by Emily Gill Each year, we bring the Analytics Engineering community together for an Analytics Summit — a multi-day internal conference to share analytical deliverables across Netflix, discuss analytic practice, and build relationships within the community. This post is one of several topics presented at the Summit highlighting the breadth and impact of Analytics work across different areas of the business. Understanding Risk in Content Launches Every title you see on Netflix goes through several key phases: Development, Pre-Production, Production/Principal Photography, Post-Production, and finally, Launch Preparation, all leading up to the Title Launch. Once Principal Photography wraps, the focus shifts in Post-Production from content creation to quality assurance and visual effects (if needed). At the end of Post Production, Netflix receives the final audio and video files — often delivered as an IMF (Interoperable Master Format) — which triggers a flurry of Launch Preparation activities, focused on tasks such as the development of artwork and trailers, creation of subtitles, maturity ratings & quality control, that happen within a tight window and rely on having the finalized media assets in hand. Some of this work can be kicked off earlier using a non-final version of the media called the Locked Cut, but since it’s not the absolute final deliverable, this presents a tradeoff: should our teams who prepare content for service wait for the more finalized IMF to begin their…

Model Releases

Read →

Netflix Tech Blog 2026-06-19 23:53 UTC Score 30.0 USR-0049-20260619-ai-specialis-299e066f

The Evolution of Cassandra Data Movement at Netflix

By Guil Pires , Jennifer Prince , Jose Camacho , Ken Kurzweil , Phanindra Chunduru Background In a previous post, we introduced Data Bridge , a unified management plane for batch Data Movement at Netflix. Historically, several bespoke Data Movement connectors were developed across different engineering organizations to fulfill their specific requirements. Over the last few years, the Data Movement team has started centralizing these offerings through an abstraction that provides a catalog of connectors, along with simple UI and APIs to initiate Data Movement jobs. One such case is the Cassandra to Iceberg connector. Apache Cassandra powers mission critical applications at Netflix, including Member, Billing, Recommendations, Subscriptions and many more. These use cases heavily leverage Data Movement to Apache Iceberg for many analytics and operational tasks, and central to this movement was a connector for Cassandra to Iceberg built in-house named Casspactor. As many Cassandra based Data Abstractions emerged, such as Key Value , Time Series and Graph — the need for larger and more complex Data Movement with transformations became more critical to the business. Data movements are fundamentally fulfilled by leveraging the existing Cassandra backup infrastructure. Regularly scheduled backups are performed directly on the Apache Cassandra nodes, via a sidecar process managing the upload of all necessary SSTables and associated Metadata files directly into Amazon S3. When a Data M…

AI Funding & Startups RAG

Read →

Netflix Tech Blog 2026-06-19 23:53 UTC Score 62.0 USR-0049-20260619-ai-specialis-6065b693 Full article

Thinking Fast & Slow for a Personalized Notification System

by Matthew Wood , Ishan Gupta , Kevin Mercurio, Devon Bryant , and Claire Dorman In his seminal book “Thinking, Fast and Slow,” Daniel Kahneman describes two systems that drive human cognition: System 1, which operates automatically and quickly with little effort, and System 2, which allocates attention to more challenging mental activities requiring deliberate focus. This dual-process theory has profound implications not just for understanding human behavior, but for designing intelligent systems that must balance immediate responsiveness with strategic foresight. Similar “plan vs. act” decompositions show up in other domains too — for example, robotics and autonomous driving often separate a slower planning layer (setting goals and constraints over longer horizons) from faster control and execution loops, and modern LLM agents frequently pair deliberate planning with rapid, step-by-step tool use and reaction. At Netflix, our messaging platform faces a similar challenge every day. We send hundreds of millions of personalized notifications — push messages, emails, and in-app alerts — to help members discover content they’ll love. This creates a central tension: optimizing each notification for near-term engagement can conflict with what is best for the member over the long term. Higher message frequency can increase fatigue and opt-out risk, while lower frequency can reduce awareness of relevant titles and features the member would value. This blog post introduces our framew…

Large Language Models Robotics

Read →

Netflix Tech Blog 2026-06-19 23:53 UTC Score 47.0 USR-0049-20260619-ai-specialis-4e454dce Full article

A Human-Augmenting Agentic Workflow for Causal Inference

By Winston Chou, Adrien Alexandre, Lars Olds, Yi Zhang, Garrett Hagemann, and Nathan Kallus Introduction Imagine asking a data agent to analyze the causal relationship between two variables, such as the effect of watching a popular Netflix show on long-term member retention. It queries your data, runs a regression, and confidently returns an answer. How much should you trust it? Can you be confident that the agent accounted for subtle biases — or does it treat passionate fans as if they were the average viewer? Without deep understanding and expertise, would you even be able to tell if it got the answer wrong? Data analysis is increasingly being delegated to software agents. While this reduces human effort and toil, oversight is still needed to ensure the validity of results. This is especially true for specialized tasks like Observational Causal Inference (OCI) , which require substantial judgment and domain expertise. In this blog post, we share an agentic workflow for performing OCI under unconfoundedness . Our workflow is designed for software agents to adhere to rigorous, exhaustive templates for causal inference tasks. Yet, it also seeks to be “ human-augmenting ,” and to enable and empower human inspection and evaluation. We designed this workflow with OCI practitioners in mind. Although OCI requires context and care to do well, aspects of it — e.g., checking and rechecking covariate balance, conducting sensitivity analyses, and keeping track of multiple iterations —…

AI Agents AI Ethics RAG

Read →

Netflix Tech Blog 2026-06-19 23:52 UTC Score 35.0 USR-0049-20260619-ai-specialis-4f8bb7cb

From Silos to Service Topology: Why Netflix Built a Real-Time Service Map

By Parth Jain , Rakesh Sukumar , Yingwu Zhao , Renzo Sanchez-Silva & Nathan Fisher How we built a living map of our distributed infrastructure to help engineers understand dependencies, troubleshoot faster, and keep Netflix running smoothly for our members around the world. The Puzzle with a Thousand Pieces Picture this: It’s 3am, and an engineer gets paged. One of our critical services is showing elevated error rates. Members trying to watch their favorite films and series are seeing degraded experiences. The clock is ticking. A single service at the center of a web of dependencies — services, data stores, and call chains branching in every direction. Without a unified map, engineers have to reason about this structure from memory and scattered signals. In a system with thousands of microservices supporting our entertainment experience for members worldwide, answering these questions quickly can mean the difference between a minor blip and a major incident. We kept hearing variations of this story from engineers across Netflix. The tooling gap was clear: we had plenty of signals, but no unified way to understand how everything connected. The Three Questions Every Engineer Asks When troubleshooting distributed systems, engineers fundamentally need to understand relationships: Which services depend on each other? Not just theoretical dependencies from configuration files or architecture diagrams, but actual runtime connections based on real traffic. What’s the blast radius? W…

AI Funding & Startups

Read →

OpenAI YouTube 2026-06-19 22:57 UTC Score 21.0 AI-146-20260619-podcasts-and-56109157 Full article

What Codex Unlocks for NTT Data

After introducing Codex, it grew to more than 10,000 active users and became one of the largest internal community in just a few months." We chatted with Hiroaki Sato, Head of the AI CoE at NTT DATA who gave us insight into all of the new ways his technical and non-tech teams are leveraging Codex. His Sales teams uses Codex to automate tasks like customer list maintenance, which has created net new workflows and report creation that used to take 2 days now takes 30 min 💥

RAG

Read →

Latest AI/ML News