AI/ML News & Innovations Hub

AI/ML news, top picks, and generated innovation digests.

★ Visit ai-karthik.com
422Sources
5100News Items
8Top Picks
43Blogs
runningLast Run

AI Ethics

48 articles tagged with this keyword, sorted by most recent first.

← All Keywords
Transactions on Machine Learning Research 2026-06-30 00:00 UTC Score 49.0 AI-084-20260630-research-pap-960a167b

Optimizing Attention with Mirror Descent: Generalized Max-Margin Token Selection

Attention mechanisms have revolutionized several domains of artificial intelligence, such as natural language processing and computer vision, by enabling models to selectively focus on relevant parts of the input data. While recent work has characterized the optimization dynamics of gradient descent (GD) in attention-based models and the structural properties of its preferred solutions, less is known about more general optimization algorithms such as mirror descent (MD). In this paper, we investigate the convergence properties and implicit biases of a family of MD algorithms tailored for softmax attention mechanisms, with the potential function chosen as the $p$-th power of the $\ell_p$-norm. Specifically, we show that these algorithms converge in direction to a generalized hard-margin SVM with an $\ell_p$-norm objective when applied to a classification problem using a softmax attention model. Notably, our theoretical results reveal that the convergence rate is comparable to that of traditional GD in simpler models, despite the highly nonlinear and nonconvex nature of the present problem. Additionally, we delve into the joint optimization dynamics of the key-query matrix and the decoder, establishing conditions under which this complex joint optimization converges to their respective hard-margin SVM solutions. Lastly, our numerical experiments on real data demonstrate that MD algorithms improve generalization over standard GD and excel in optimal token selection.

OpenAI Community 2026-06-29 23:08 UTC Score 40.0 AI-116-20260629-social-media-2cc9fa11

Feature Request: Make Project Memory Transparent, Searchable, and User-Controlled

Thanks for sharing this thoughtful feature request. I can see how greater transparency and control over Project Memory and Project retrieval would be valuable, especially for users managing long-term projects where continuity and visibility into retrieved context are important. I'll pass this feedback along to the team for consideration. Thanks again for taking the time to share these suggestions. ~ Smith

Medianama AI 2026-06-29 10:00 UTC Score 40.0 USR-0211-20260629-regional-new-e8ef059c

Lowdown: IRDAI proposes strict disclosures for insurance intermediaries earning over 10 crore in commissions

IRDAI has proposed annual public disclosures for insurance intermediaries earning over Rs 10 crore in commissions, as part of draft rules to improve transparency and accountability. The post Lowdown: IRDAI proposes strict disclosures for insurance intermediaries earning over 10 crore in commissions appeared first on MEDIANAMA .

LessWrong AI 2026-06-28 11:09 UTC Score 58.0 USR-0152-20260628-community-fo-165a11bf

Power Laws in NNs: A Possible Mechanism for Inductive Bias towards Sparse Representations

This post was produced as part of the Iliad Fellowship under the mentorship of Dmitry Vaintrob. Tl;dr: Power-law ("heavy-tailed") distributions have universality theorems similar to those which make Gaussians common. We observe many things in ML are power-law distributed, most robustly and interestingly, the spectra of weight matrices. I explain how we can think of power-laws as being a natural generalization of the idea of 'sparsity', interpolating between true sparsity and Gaussianity according to the 'tail-index' of the distribution. I share some hypotheses about how this might relate to the 'sparse'/'discrete'/'factored' representations that neural networks seem to learn. I promise this is not a Santa-Fe-Institute encomium for power laws or "black swans"; different genre. Contents 1. The generalized central limit theorem proves power-law distributions are universality classes 2. Power laws observed in NNs might help us understand representation learning 2.A. HTSR: phase changes in weight-matrix spectra and data-free prediction of generalization 2.B. BBP transition as a quantum of learning 2.C. HTSR as an extended BBP transition 2.D. Training evidence for heavy tails is mixed, and I'm not sure if they're important 3. The tail exponent α is a smooth proxy for sparsity and compressibility 3.A. α captures compressibility across heavy tails 3.B. α-stable noise can make discrete codebooks optimal 3.C. Heavy-tailed noise can convert analog inputs into discrete codebooks 4. Summ…

Kubernetes Documentation 2026-06-26 18:00 UTC Score 43.0 AI-200-20260626-developer-an-f210b1d6

Open source maintainership in the age of AI

AI has really changed the game around software development. More people are leveraging AI than ever to contribute patches to projects they use. To me, this is a good thing as more folks will contribute patches rather than fork or not fix them. The main problem is that AI has made generating code fast but there has been very little improvement in maintaining code bases. In this post, we will highlight the ways the Kubernetes community is adapting to the world of AI assisted coding. The first step of this journey was to develop an AI policy. This seems mundane and bureaucratic but there were many PRs that derailed into discussions around AI usage. The AI policy helps steer the conversation around the project's stance on AI and provides a clear signal to contributors on how to use these tools responsibly. Kubernetes AI policy The Kubernetes project has established clear guidelines for AI-assisted contributions that balance innovation with accountability. These policies are designed to maintain code quality and ensure human oversight while acknowledging that AI tools can be valuable aids in the development process. Transparency first Contributors must disclose when AI tools have been used to assist with a pull request. A simple statement in the PR description such as "This PR was written in part with the assistance of generative AI" is sufficient. This transparency helps reviewers understand the context and apply appropriate scrutiny. Human accountability While AI tools can assi…

Google DeepMind YouTube 2026-06-23 15:48 UTC Score 61.0 AI-145-20260623-podcasts-and-6366ba2d

When millions of AI agents meet

The conversation of the moment is focused on one topic: AI agents. Unlike traditional language models that simply respond to a prompt, autonomous agents can execute multi-step plans and perform complex tasks on your behalf. But what happens when millions of these agents are not just working for us, but transacting, negotiating, and delegating to one another? Nenad Tomašev, Senior Staff Research Scientist at Google DeepMind, joins host Hannah Fry to discuss the theoretical framework of a future"agentic economy." Together, they discuss the operational shift from single systems to a cooperative "society of specialists," the psychological risk of human automation bias, and the complex cybersecurity landscape—from dynamic cloaking to agentic traps—required to keep distributed intelligence secure. Timecodes: 00:00 Intro 1:07 Defining AI agents 4:44 Agentic exploration in science and research 15:46 Delegation between agents 22:46 Agentic security and traps 29:31 Building an agentic economy 33:22 Cognitive monoculture 36:29 Distributed intelligence To read the research, search for: Distributional AGI Safety, May 2026 Intelligent AI Delegation, February 2026 Virtual Agent Economies, September 2025 Learn more about our AGI control roadmap: https://deepmind.google/blog/securing-the-future-of-ai-agents/ ___ Subscribe to our channel https://www.youtube.com/@googledeepmind Find us on X https://x.com/GoogleDeepMind Follow us on Instagram https://instagram.com/googledeepmind Add us on Linke…

GitHub Engineering 2026-06-23 15:48 UTC Score 49.0 USR-0062-20260623-ai-specialis-41e4c400

GitHub joins coalition advocating for fixes to California AI Transparency Act to protect open source

We’re calling for targeted amendments to resolve conflicts with open source licensing and align with international transparency frameworks while preserving regulatory intent. The post GitHub joins coalition advocating for fixes to California AI Transparency Act to protect open source appeared first on The GitHub Blog .

AI Alignment Forum 2026-06-20 20:05 UTC Score 38.0 USR-0151-20260620-community-fo-c0bc42f0

How transparent is DiffusionGemma (and why it matters)

Authors: Joshua Engels*, Callum McDougall*, Bilal Chughtai*, Janos Kramar, Senthoran Rajamanoharan, Cindy Wu, Arthur Conmy, Asic Q Chen, Jean Tarbouriech, Min Ma, Brendan O'Donoghue+, João Gabriel Lopes de Oliveira+, Rohin Shah+, Neel Nanda+ *Primary Contributor +Advising Paper here: https://arxiv.org/abs/2606.20560 Overview In a recent collaboration between the GDM interpretability team and the GDM text diffusion team, we performed a transparency audit of DiffusionGemma, GDM's new text diffusion model. Overall, we find that DiffusionGemma is not significantly less transparent than Gemma. Gemma and DiffusionGemma perform similarly on monitorability evaluations . Although naively DiffusionGemma has a much larger opaque serial depth , we can apply the logit lens to intermediate vectors and ablate non-interpretable information without harming performance. This implies that these intermediate nodes are interpretable, which reduces the opaque serial depth to be similar to that of Gemma. However, even though the variables that the model uses at different steps are interpretable, this does not necessarily mean that we understand the algorithm that the model uses to reach the final answer. We thus distinguish between variable transparency, which we define as whether we can understand snapshots of the model's computation, and algorithmic transparency, which we define as whether we can use these snapshots to reconstruct the process by which the model arrived at its outputs. By default…

Two Minute Papers 2026-06-19 14:06 UTC Score 36.0 AI-139-20260619-podcasts-and-ae508afa

Scientists Found A Better Language For AI Agents

❤️ Check out Weights & Biases and sign up for a free demo here: https://wandb.me/papers 📝 The paper is available here: https://recursivemas.github.io/ https://github.com/RecursiveMAS/RecursiveMAS Brain reading video: https://www.youtube.com/watch?v=IUg-t609byg 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible: Adam Bridges, Benji Rabhan, B Shang, Cameron Navor, Charles Ian Norman Venn, Christian Ahlin, Eric T, Fred R, Gordon Child, Juan Benet, Michael Tedder, Owen Skarpness, Richard Sundvall, Ryan Stankye, Shawn Becker, Steef, Taras Bobrovytsky, Tazaur Sagenclaw, Tybie Fitzhugh, Ueli Gallizzi

Allen Institute for AI Blog 2026-06-18 08:00 UTC Score 41.0 USR-0021-20260618-research-aca-541c5e53

How Domyn and AISquared built on Ai2's open releases

Domyn and AISquared show how Ai2’s open releases are helping AI labs build models for regulated industries, where transparency, provenance, licensing, and control are essential for customer trust and compliance.

Oxford Internet Institute AI 2026-06-15 08:00 UTC Score 33.0 USR-0028-20260615-research-aca-dde8bc2b

OII researchers head to FAccT 2026

Researchers and DPhil students from the Oxford Internet Institute are set to attend the Association of Computing Machinery (ACM) Conference on Fairness, Accountability and Transparency (FAccT) in Montréal, from 25-28 June 2026.

Stack Overflow Machine Learning Tag 2026-06-11 19:13 UTC Score 12.0 AI-112-20260611-social-media-24a3832d

Why is the cost of my neural network inconsistent (and sometimes increasing)?

I tried to follow this crash course to create a neural network from scratch. It seems to be working, which is great, but as I kept running the simulation I noticed that the cost of the network sometimes behaves, continuously decreasing until it reaches a minimum. Other times, it will hit a low, then go back up and rest at that higher position. Other times, it always increases! Why is it happening? I wrote it in C# as a Visual Studio Console App. int[] layerLengths = { 2, 30, 30, 30, 1 }; double[][,] weights = new double[layerLengths.Length - 1][,]; double[][,] biases = new double[layerLengths.Length - 1][,]; double[][,] layers = new double[layerLengths.Length][,]; Random rand = new Random(); double[,] input = { { 142, 64, 27 }, { 185, 71, 42 }, { 128, 62, 23 }, { 210, 74, 51 }, { 167, 68, 35 }, { 154, 66, 29 }, { 198, 72, 46 }, { 135, 63, 21 }, { 176, 70, 38 }, { 221, 75, 54 }, { 149, 65, 31 }, { 162, 67, 33 }, { 193, 73, 48 }, { 124, 61, 20 }, { 181, 69, 41 }, { 205, 76, 57 }, { 157, 66, 30 }, { 170, 68, 36 }, { 138, 64, 25 }, { 214, 74, 53 }, { 146, 65, 28 }, { 189, 72, 44 }, { 132, 62, 22 }, { 173, 69, 37 }, { 201, 73, 49 }, { 159, 67, 32 }, { 144, 64, 26 }, { 178, 70, 39 }, { 226, 77, 60 }, { 151, 65, 29 }, { 166, 68, 34 }, { 196, 74, 47 }, { 127, 61, 19 }, { 183, 71, 43 }, { 208, 75, 55 }, { 155, 66, 31 }, { 171, 69, 36 }, { 140, 63, 24 }, { 217, 76, 58 }, { 148, 65, 27 }, { 191, 73, 45 }, { 130, 62, 21 }, { 175, 70, 38 }, { 203, 74, 50 }, { 160, 67, 33 }, { 145, 64, 26…

Stack Overflow Machine Learning Tag 2026-06-11 15:12 UTC Score 29.0 AI-112-20260611-social-media-e0ab0049

Why is it so difficult to train an accuracte GAN model?

I'm trying to train a GAN model, but its results are very bad. The Generator doesn't seem to work. Can someone suggest how this can be improved? What is the best way to remember the code? Can this also be done using a builtin library? import os import torch import torchvision import torch.nn as nn import torch.optim as optim import torch.nn.functional as F import torchvision.datasets as datasets import torchvision.transforms as transforms from torch.utils.data import DataLoader import matplotlib.pyplot as plt import numpy as np random_seed = 42 torch.manual_seed(random_seed) BATCH_SIZE = 128 AVAIL_GPUS = min(1, torch.cuda.device_count()) DEVICE = torch.device("cuda" if AVAIL_GPUS else "cpu") LATENT_DIM = 100 transform = transforms.Compose([ transforms.ToTensor(), transforms.Normalize((0.5,), (0.5,)) # scale to [-1, 1] for tanh output ]) dataset = datasets.MNIST(root="./data", train=True, download=True, transform=transform) dataloader = DataLoader(dataset, batch_size=BATCH_SIZE, shuffle=True, num_workers=2, drop_last=True) class Generator(nn.Module): def __init__(self, latent_dim=100, img_channels=1, feature_maps=64): super().__init__() self.net = nn.Sequential( nn.ConvTranspose2d(latent_dim, feature_maps * 4, kernel_size=7, stride=1, padding=0, bias=False), nn.BatchNorm2d(feature_maps * 4), nn.ReLU(True), nn.ConvTranspose2d(feature_maps * 4, feature_maps * 2, kernel_size=4, stride=2, padding=1, bias=False), nn.BatchNorm2d(feature_maps * 2), nn.ReLU(True), nn.ConvTranspose2d(…

Two Minute Papers 2026-06-05 15:50 UTC Score 36.0 AI-139-20260605-podcasts-and-7daff15c

DeepMind’s New AI Found A Strange New Way To Think

❤️ Check out Weights & Biases and sign up for a free demo here: https://wandb.me/papers 📝 The paper is available here: https://github.com/google-deepmind/alphaproof-nexus-results https://arxiv.org/html/2605.22763v1 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible: Adam Bridges, Benji Rabhan, B Shang, Cameron Navor, Charles Ian Norman Venn, Christian Ahlin, Eric T, Fred R, Gordon Child, Juan Benet, Michael Tedder, Owen Skarpness, Richard Sundvall, Ryan Stankye, Shawn Becker, Steef, Taras Bobrovytsky, Tazaur Sagenclaw, Tybie Fitzhugh, Ueli Gallizzi My research: https://cg.tuwien.ac.at/~zsolnai/ Thumbnail design: https://felicia.hu

Stack Overflow Machine Learning Tag 2026-05-31 18:34 UTC Score 12.0 AI-112-20260531-social-media-8041f0ff

Writing a Neural Network which can have an arbitrary number of hidden layers, could someone tell me if this is the best way to do it?

#Multiple Layers import numpy as np class NeuralNetwork: def __init__(self, input_size, hiddenLayerSizes, output_size): self.input_size = input_size self.hiddenLayerSizes = hiddenLayerSizes self.output_size = output_size self.hiddenLayerWeights = [] self.hiddenLayerBiases =[] self.weights_input_hidden1 = np.random.randn(self.input_size, self.hiddenLayerSizes[0]) for i in range(0, len(hiddenLayerSizes)-1): self.hiddenLayerWeights.append(np.random.randn(self.hiddenLayerSizes[i], self.hiddenLayerSizes[i+1])) self.hiddenLayerBiases.append(np.zeros((1, self.hiddenLayerSizes[i]))) self.weights_hidden_output = np.random.randn(self.hiddenLayerSizes[len(self.hiddenLayerSizes)-1], self.output_size) self.bias_output = np.zeros((1, self.output_size)) def sigmoid(self, x): return 1 / (1 + np.exp(-x)) def sigmoid_derivative(self, x): return x * (1 - x) def feedforward(self, x): self.hidden_activations = [] self.hidden_outputs = [] self.hidden_activations.append(np.dot(X, self.weights_input_hidden1) + self.hiddenLayerBiases[0]) self.hidden_outputs.append(self.sigmoid(self.hidden_activations[0])) for i in range(0, len(self.hiddenLayerSizes)-1): self.hidden_activations.append(np.dot(self.hidden_outputs[i], self.hiddenLayerWeights[i]) + self.hiddenLayerBiases[i]) self.hidden_outputs.append(self.sigmoid(self.hidden_activations[i])) self.output_activation = np.dot(self.hidden_outputs[len(self.hidden_outputs)-1], self.weights_hidden_output) + self.bias_output self.predicted_output = self.sigmoid…

Machine Learning Street Talk 2026-05-31 00:14 UTC Score 50.0 AI-141-20260531-podcasts-and-62f308dd

The Ex-Pentagon Chief Sounding the Alarm on AI Weapons — Brad Carson

Brad Carson was the Army's General Counsel, served two terms in Congress and was Acting Under Secretary of Defense for Personnel and Readiness. He now heads Americans for Responsible Innovation, the AI-policy advocacy group he co-founded. Keith Duggar spends roughly eighty minutes pushing back. SPONSOR: --- Cyber Fund built the Monastery to help founders ship products that were impossible a year ago. Applications for Batch 1 are now open. Apply now: https://cyber.fund --- Carson's whole case rests on one line: the genie is not out of the bottle. We have pulled dangerous tech back before. Asilomar halted recombinant DNA in 1975, and the West still controls the chips AI runs on. Calling it unstoppable, he says, is the most dangerous idea in the room. Then Keith drags him somewhere darker. A Palantir heat map scores you 0.73 on whether you are a combatant, and a strike follows. The model is wrong some accepted share of the time, and when it is, nobody answers for it. You cannot court-martial a model, and not even the interpretability researchers can say why it picked you. — Note: after recording, we learned that Americans for Responsible Innovation is backed by EA-aligned philanthropy (not sponsored) --- TIMESTAMPS: 00:00:00 From the Pentagon to AI governance 00:04:52 Regulatory capture vs Silicon Valley networks 00:07:56 Transparency and the Claude tier changes 00:09:40 Tort liability when AI tools cause harm 00:13:40 AI is a product, not a person 00:16:01 Children, suicide, a…

Kubernetes Documentation 2026-05-26 17:30 UTC Score 27.0 AI-200-20260526-developer-an-a473d36c

Reconciling the Past: Correcting Records for Unfixed Kubernetes CVEs

The Kubernetes project relies on transparency to empower cluster administrators and security researchers. One important way we do that is by publishing CVE records into the Common Vulnerabilities and Exposures database. As part of our ongoing effort to mature the official Kubernetes CVE Feed , we have identified some discrepancies. CVE records for a few older, unfixed issues incorrectly include a fixed version field. The Kubernetes Security Response Committee (SRC) will correct the affected CVE records on June 1, 2026. This may result in vulnerability scanners identifying these vulnerabilities in places where they were previously not detected. To help reduce confusion, this post provides a technical update on three vulnerabilities that were disclosed in previous years but remain unfixed: CVE-2020-8561 , CVE-2020-8562 , and CVE-2021-25740 . Why we are updating these records now While these vulnerabilities have been public for several years, the recent work to generate official Open Source Vulnerabilities (OSV) files revealed that their corresponding CVE records did not accurately reflect their status. Specifically, some records suggested a fixed version existed, when in reality, these issues are architectural design trade-offs that cannot be fully remediated through code without breaking fundamental Kubernetes functionality. Correcting these records is vital for the community for: Automation Fidelity : Modern vulnerability scanners depend on precise version ranges. Inaccurate…

TWIML AI Podcast 2026-05-21 19:38 UTC Score 56.0 AI-148-20260521-podcasts-and-830461d3

Relational Foundation Models for Enterprise Data with Jure Leskovec - #768

In this episode, Jure Leskovec, co-founder and chief scientist at Kumo and professor of computer science at Stanford, joins us to explore two fronts of his work: AI for science and relational deep learning. We begin with AI Virtual Cell, a multiscale effort to learn data-driven representations from proteins to cells to patients using single-cell RNA-seq data, protein language models like ESM, and structure models like AlphaFold—without hand-encoding biology. Jure then dives into relational deep learning, reframing enterprise databases as graphs and training neural networks directly on raw multi-table data. He explains Kumo’s Relational Foundation Model (RFM2), which performs in-context learning over subgraphs to make accurate predictions on new databases and tasks with no training, and how this approach benchmarks against RelBench and other multi-table datasets. We also discuss real-world deployments at companies like Reddit, DoorDash, and Coinbase, explainability via attention over tables and columns, integration with agentic systems, deployment options, and practical limitations. The complete show notes for this episode can be found at https://twimlai.com/go/768.

Vector Institute News 2026-02-24 20:55 UTC Score 38.0 USR-0017-20260224-research-aca-826484c8

CRISPNAM-FG: An interpretable Fine-Gray deep survival model for competing risks in health care

Vector researchers developed CRISPNAM-FG, a trustworthy AI model that predicts the risk of developing diabetes-related foot complications for patients discharged from hospitals while providing complete transparency in how each decision […] The post CRISPNAM-FG: An interpretable Fine-Gray deep survival model for competing risks in health care appeared first on Vector Institute for Artificial Intelligence .

METR 2026-02-24 08:00 UTC Score 48.0 USR-0147-20260224-research-aca-aae32dc8

We are Changing our Developer Productivity Experiment Design

METR previously published a paper which found the use of AI tools caused a 20% slowdown in completing tasks among experienced open-source developers, using data from February to June 2025. To understand how AI is impacting developer productivity over time, we started a new experiment in August 2025 with a larger pool of developers using the latest AI tools. Unfortunately, given participant feedback and surveys, we believe that the data from our new experiment gives us an unreliable signal of the current productivity effect of AI tools. The primary reason is that we have observed a significant increase in developers choosing not to participate in the study because they do not wish to work without AI, which likely biases downwards our estimate of AI-assisted speedup. We additionally believe there have been selection effects due to a lower pay rate (we reduced the pay from $150/hr to $50/hr), and that our measurements of time-spent on each task are unreliable for the fraction of developers who use multiple AI agents concurrently. Based on conversations with study participants, we believe it is likely that developers are more sped up from AI tools now — in early 2026 — compared to our estimates from early 2025. However, because of the selection effects in our experiment, our data is only very weak evidence for the size of this increase. Our raw results show some evidence for speedup. Our early 2025 study found the use of AI causes tasks to take 19% longer, with a confidence inte…

Consultancy.lat AI & GenAI 2026-02-18 13:30 UTC Score 12.0 AI-177-20260218-regional-ai--84f181ca

The authenticity gap: What Gen Z really think about brands

A global study from SKIM has found that young consumers from around the world value transparency above all else, while nearly a third reject brands for “trying too hard” with forced messaging. Mariana Abelha and Patricia Fujisawa, senior members in the firm’s LATAM business, explore why authenticity isn’t optional for Gen Z and what this means for brands.

OpenMined Blog 2026-01-16 21:10 UTC Score 38.0 USR-0156-20260116-ai-specialis-a134880a

OpenMined Joins Open Forum for AI to Advance Responsible Data Governance

We’re excited to announce that OpenMined has joined the Open Forum for AI (OFAI), an international initiative led by Carnegie Mellon University that’s bringing together academic institutions and nonprofit organizations to advance human-centered and ethical approaches to artificial intelligence. Launched at Carnegie Mellon University in 2024, OFAI was created to foster collaboration, transparency, and inclusion […] The post OpenMined Joins Open Forum for AI to Advance Responsible Data Governance appeared first on OpenMined .

Amazon Science AI 2025-08-25 17:10 UTC Score 75.0 AI-058-20250825-official-ai--8d6afffe

Beyond detection: A multi-agent framework for root cause analysis of financial discrepancies in distributed environments

The increasing complexity and fragmentation of financial systems in large organizations have created significant challenges for financial teams, particularly in performing real-time, end-to-end validation, as existing validation methods relying on static rules or batch processing are often inadequate for today's dynamic financial environments. This paper introduces a novel approach using Large Language Model (LLM)-based browser agents within a multi-agent framework to enhance financial validation processes. The framework leverages domain-specific agents that autonomously navigate web-based financial platforms to validate data, interpret discrepancies, and perform root cause analysis, ensuring higher accuracy, transparency, and auditability compared to traditional systems. A synthetic dataset and controlled simulation environment were used to evaluate the framework's performance across 20 distinct financial scenarios, revealing significant improvements in validation accuracy (from 40% with a Vanilla agent to 65% with the proposed approach). The results indicate that the proposed multi-agent approach, by isolating validation tasks into specialized agents and orchestrating a coordinated investigation, provides a more reliable, scalable, and interpretable solution for high-stakes financial environments.

EleutherAI Blog 2025-06-12 00:00 UTC Score 27.0 USR-0184-20250612-research-aca-0a6d62bc

Studying inductive biases of random networks via local volumes

In this post, we will study inductive biases of the parameter-function map of random neural networks using star domain volume estimates. This builds on the ideas introduced in Estimating the Probability of Sampling a Trained Neural Network at Random and Neural Redshift: Random Networks are not Random Functions (henceforth NRS). Inductive biases To understand generalization in deep neural networks, we must understand inductive biases. Given a fixed architecture, some tasks will be easily learnable, while others can take an exponentially long time to learn (see here and here).

Lilian Weng Blog 2024-11-28 00:00 UTC Score 47.0 USR-0112-20241128-ai-specialis-1b600ac6

Reward Hacking in Reinforcement Learning

Reward hacking occurs when a reinforcement learning (RL) agent exploits flaws or ambiguities in the reward function to achieve high rewards, without genuinely learning or completing the intended task. Reward hacking exists because RL environments are often imperfect, and it is fundamentally challenging to accurately specify a reward function. With the rise of language models generalizing to a broad spectrum of tasks and RLHF becomes a de facto method for alignment training, reward hacking in RL training of language models has become a critical practical challenge. Instances where the model learns to modify unit tests to pass coding tasks, or where responses contain biases that mimic a user’s preference, are pretty concerning and are likely one of the major blockers for real-world deployment of more autonomous use cases of AI models.

Qdrant Blog 2024-04-11 00:01 UTC Score 35.0 USR-0074-20240411-ai-specialis-5514d12d

Enhance AI Data Sovereignty with Aleph Alpha and Qdrant Hybrid Cloud

Aleph Alpha and Qdrant are on a joint mission to empower the world’s best companies in their AI journey. The launch of Qdrant Hybrid Cloud furthers this effort by ensuring complete data sovereignty and hosting security. This latest collaboration is all about giving enterprise customers complete transparency and sovereignty to make use of AI in their own environment. By using a hybrid cloud vector database, those looking to leverage vector search for the AI applications can now ensure their proprietary and customer data is completely secure.

Cross Validated 2023-10-08 04:39 UTC Score 9.0 AI-113-20231008-social-media-de1e9c72

How to deal with a Stationary DV and a Trend-Stationary IV in using OLS?

I have a dependent variable that is stationary in levels. However, one of the IVs is only trend-stationary (stationary around a deterministic trend that I can extract from the series). In other words, I have a regression with variables that have different transformations. My question is, does this bias the significance of the results, including the significance of any other IVs (that are stationary in levels in the regression)? If not, it is probably reasonable to assume that it biases the estimated coefficient of the DP on the trend-stationary IV but not the significance of the effect since it shouldn't impact the standard errors (due to the trend being deterministic). Is that an accurate assessment? Also, what if I detrend (i.e. extract a deterministic trend) from my stationary DV? Is this equivalent to over differencing and thus losing important information from the series? Does extracting a deterministic trend from a stationary series even make sense?

Cross Validated 2021-11-29 04:38 UTC Score 12.0 AI-113-20211129-social-media-03b20a2b

How to obtain conditional use accuracy equality with communities with different real positive rates?

The short version is that I would like to know what the confusion matrices (numbers of true positives, false positives, true negatives, and false negatives) should be to achieve conditional use accuracy equality among two communities, one with 40% real positives and one with 60% real positives. Here is the long version... I am trying to understand the difference fairness metrics described in Understanding Fairness . It contains an interactive pair of pie charts representing two communities' confusion matrices. Blue portions represent real positives (RP); striped portions represent predicted positives. To the right of the charts are different fairness metrics and how well the proportions in the pie charts satisfy them. Here is the original configuration: The small red and blue circles are handles for adjusting the sizes of regions. As shown in green, these fairness criteria are achieved: Group fairness Equalized Odds Overall Accuracy Equality I have been unable to come up with adjustments that satisfy Conditional Use Accuracy Equality. As described in the document, the two communities should have the same: positive predictive value (PPV) or precision, i.e. TP / Predicted Positive, and negative predictive value (NPV), i.e. TN / Predicted Negatives Can Conditional Use Accuracy Equality be obtained in this scenario?

Stanford AI Lab Blog 2021-11-05 07:00 UTC Score 52.0 USR-0006-20211105-research-aca-02e23852

Stanford AI Lab Papers at EMNLP/CoNLL 2021

The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021) will take place next week, colocated with CoNLL 2021. We’re excited to share all the work from SAIL that will be presented, and you’ll find links to papers, videos and blogs below. Feel free to reach out to the contact authors directly to learn more about the work that’s happening at Stanford! List of Accepted Papers Calibrate your listeners! Robust communication-based training for pragmatic speakers Authors : Rose E. Wang, Julia White, Jesse Mu, Noah D. Goodman Contact : rewang@stanford.edu Links: Paper | Video Keywords : language generation, pragmatics, communication-based training, calibration, uncertainty Cross-Domain Data Integration for Named Entity Disambiguation in Biomedical Text Authors : Maya Varma, Laurel Orr, Sen Wu, Megan Leszczynski, Xiao Ling, Christopher Ré Contact : mvarma2@stanford.edu Links: Paper | Video Keywords : named entity disambiguation, biomedical text, rare entities, data integration ContractNLI: A Dataset for Document-level Natural Language Inference for Contracts Authors : Yuta Koreeda, Christopher D. Manning Contact : koreeda@stanford.edu Links: Paper | Website Keywords : natural language inference, contract, law, legal, dataset Venue : The Findings of EMNLP 2021 The Emergence of the Shape Bias Results from Communicative Efficiency Authors : Eva Portelance, Michael C. Frank, Dan Jurafsky, Alessandro Sordoni, Romain Laroche Contact : portelan@stanford.edu Links…

Data Science Stack Exchange 2021-03-31 01:58 UTC Score 12.0 AI-111-20210331-social-media-a642624f

How to manage survivorship bias in conversion metrics?

I am trying to understand diminishing returns on additional advertising. Basically, does the 2nd ad convert worse than the 1st, etc. I'm struggling with thinking about how to deal with the survivorship bias, because if a user converts, then they will stop seeing additional ads. Are there ways to manage these biases? Ideally, I'm interested in modeling the likelihood of conversion given some demo factors and how many ads they've received so far.

Lilian Weng Blog 2021-03-21 00:00 UTC Score 39.0 USR-0112-20210321-ai-specialis-3e60cc8a

Reducing Toxicity in Language Models

Large pretrained language models are trained over a sizable collection of online data. They unavoidably acquire certain toxic behavior and biases from the Internet. Pretrained language models are very powerful and have shown great success in many NLP tasks. However, to safely deploy them for practical real-world applications demands a strong safety control over the model generation process.