AI/ML News & Innovations Hub

AI/ML news, top picks, and generated innovation digests.

★ Visit ai-karthik.com
422Sources
7510News Items
8Top Picks
61Blogs
successLast Run

Latest AI/ML News

7510 matching items

Some ChatGPT App Store users lose access to exposed MCP tools after one tool call
OpenAI Community 2026-06-28 20:15 UTC Score 55.0 AI-116-20260628-social-media-79ac931d Full article

Some ChatGPT App Store users lose access to exposed MCP tools after one tool call

I wonder if this is related to the new version of GPT-5.5 Instant released last week. Can anyone from OpenAI confirm whether Apps on Instant have a smaller effective context or tool-descriptor budget? I saw docs implying context size for Instant is now 16K tokens (and it used to be 27K tokens). Specifically, can large MCP tools/list payloads - descriptions, input/output schemas, annotations, metadata, etc. - cause exposed tools to become unavailable or stop being selected after an initial tool call?

LessWrong AI 2026-06-28 20:13 UTC Score 66.0 USR-0152-20260628-community-fo-e0c36a25

What comes with cheap math?

Thanks to conversations with Anson Berns, Gurkenglass, Roman Malov, Sahil, Sam Eisenstat, and others. Over the past two months, I've been doing a lot of "vibe research" (like vibe coding, but for research). Anson Berns started coming to my office hours , and we've been collaborating on a project modeling trust between logical inductors. In addition to talking once a week, we've been exchanging raw AI chats as well as AI-generated summaries of what has been done (the raw chats are nice because they allow me to generate my own AI summaries focusing on what I'm most curious about). I've been asking Claude to use Lean to verify everything, so there's a somewhat good chance there's real results of interest here, but I haven't (yet) been reading the Lean proofs (or even the theorem statements) -- instead I've just been chatting with AI about how the Lean proofs went and whether they really formalized what was claimed in english+latex, and focused on understanding the proofs myself in the same way I'd normally read a math paper. There have already been several times when this methodology has caught big gaps between what was claimed and what was verified in Lean, so I imagine there are more. This was mostly done with Claude Opus 4.8 via Claude Code, with a small amount of GPT 5.5 Extra High in Codex to get a second opinion. I cannot confidently say that this was faster than doing research the old-fashioned way. Sitting down with AI puts my attention in very different places, more on…

LessWrong AI 2026-06-28 19:37 UTC Score 61.0 USR-0152-20260628-community-fo-39ab56d6

We Should Be Scaling RL on Forecasting

This is a crosspost of a post from my blog, Metal Ivy . The original is here: Reinforcement Learning on Forecasting Will Give Us a Superhuman Forecaster . Why RL on forecasting? When DeepSeek R1 came out in January 2025, I felt that the fact that RL on LLMs simply worked was incredible, but using it on coding and math wasn’t the right path. Before RL we had pretraining, a scalable and general training methodology that worked extremely well to get the model to the human level, through learning by imitation over human data. Then RL came in and gave us a way to get even further, to the expert level and beyond, through sampling many trajectories from the LLM and using a reward function to select the best ones to reinforce. But it isn’t general anymore when only short term, self contained verifiable tasks such as coding or math make up the environment. A strongly superhuman coder might change everything - if recursive self improvement happens like the labs hope (and doesn’t kill us). But it might not change that much at all by itself, beyond giving us more of the software abundance we in many ways already have. A strongly superhuman forecaster instantly gives people and organizations the ability to make superhuman decisions through forecasting of their outcomes, and would be a massive boost to the overall competence of our civilization. You may ask why should it work, even in theory - math is deterministic and forecasting is not, so forecasting reward may give bad weight updates.…

LessWrong AI 2026-06-28 19:37 UTC Score 61.0 USR-0152-20260628-community-fo-64aa9575

Reinforcement Learning on Forecasting Can Give Us a Superhuman Forecaster

This is a crosspost of a post from my blog, Metal Ivy . The original is here: Reinforcement Learning on Forecasting Will Give Us a Superhuman Forecaster . Why RL on forecasting? When DeepSeek R1 came out in January 2025, I felt that the fact that RL on LLMs simply worked was incredible, but using it on coding and math wasn’t the right path. Before RL we had pretraining, a scalable and general training methodology that worked extremely well to get the model to the human level, through learning by imitation over human data. Then RL came in and gave us a way to get even further, to the expert level and beyond, through sampling many trajectories from the LLM and using a reward function to select the best ones to reinforce. But it isn’t general anymore when only short term, self contained verifiable tasks such as coding or math make up the environment. A strongly superhuman coder might change everything - if recursive self improvement happens like the labs hope (and doesn’t kill us). But it might not change that much at all by itself, beyond giving us more of the software abundance we in many ways already have. A strongly superhuman forecaster instantly gives people and organizations the ability to make superhuman decisions through forecasting of their outcomes, and would be a massive boost to the overall competence of our civilization. You may ask why should it work, even in theory - math is deterministic and forecasting is not, so forecasting reward may give bad weight updates.…

Introducing GPT-5.6 series: Sol, Terra and Luna
OpenAI Community 2026-06-28 19:27 UTC Score 63.0 AI-116-20260628-social-media-4b9bac18 Full article

Introducing GPT-5.6 series: Sol, Terra and Luna

The timing on this couldn’t be better. I run agentic systems daily - OpenClaw, Hermes, Claude Code orchestrating multiple AI workers. The bottleneck has always been cost at scale. Anthropic’s API pricing makes it brutal to run agents 24/7. You’re watching credits evaporate in real time. The fact that OpenAI allows third-party harnesses to tap into these models through an existing subscription changes the math completely. Looking forward to Sol Ultra powering my agents without per-token anxiety. And “Ultra” mode with subagents working together - that’s exactly where agentic AI needs to go. Thank you for making this accessible to builders, not just enterprises with infinite API budgets. Time to put these through their paces. I’ve got 6 DGX Sparks running great local model like Gemma4 and these 5.6 models are going to run it all.

Simon Willison Weblog 2026-06-28 19:26 UTC Score 35.0 USR-0110-20260628-ai-specialis-70484f65 Full article

Hack Your Summer

Hack Your Summer I learned about this initiative from DJ Patil this morning: It’s a 4-week, high-velocity production sprint for undergraduate students, graduate students, and recent graduates who want to build something real this summer. You’ll learn how to identify a project, make steady progress, get support from mentors and peers, and create tangible, public-facing work you can actually show future employers. Hack Your Summer is partly a reaction to the internship crisis facing US college students this year. There are way fewer available internships than usual, as companies have reduced their hiring ambitions and teams have less capacity to coach interns. Hack Your Summer provides an alternative path for the many students who didn't catch one of those rare internships. A second (free) cohort starts on July 13th, and the deadline for students to apply is July 8th. They're also accepting volunteers to help mentor the students. Tags: careers

Euronews AI 2026-06-28 19:17 UTC Score 45.0 AI-164-20260628-regional-ai--4b44808b

Iran threatens total halt to talks amid intensive US air activity near Hormuz

The IRGC said the weekend US strikes violated the framework deal and warned that violating vessels would face a "crushing response," as Euronews journalists in Doha observed US refuelling aircraft taking off towards Hormuz in the same formation as the previous night's strikes.

LessWrong AI 2026-06-28 19:11 UTC Score 60.0 USR-0152-20260628-community-fo-5461c34f

The arithmetic hierarchy of real functions

I wrote a fairly accessible introduction to real hypercomputation with Marcus Hutter. The focus is on enabling applications to algorithmic information theory. This project was intended to build my technical foundations for studying AIXI, but took me a bit further afield and down some rabbit holes. In the future I will prefer to focus more tightly on AI safety. Feedback would be appreciated. In particular, I needed to introduce an extra extensionality assumption for the real domain case, which I am still not sure is necessary. Errata: The diagram of results currently has theorems misnumbered due to a typographical error. Thanks to the LTFF for supporting my work over most of the research process. Discuss

Low cost for Chatgpt Ho for Students for Learning
OpenAI Community 2026-06-28 18:58 UTC Score 50.0 AI-116-20260628-social-media-c6152a4c Full article

Low cost for Chatgpt Ho for Students for Learning

Request for Student Discount and Regional Pricing Subject: Request for Student Discount and Regional Pricing for ChatGPT Dear OpenAI Team, I hope this message finds you well. I would like to respectfully request that OpenAI consider introducing a Student Plan and regional pricing for countries where the current subscription cost is difficult for many students to afford. Many students rely on ChatGPT for: - Learning programming and software development - Research and academic writing - Completing educational projects - Learning new technologies and AI - Improving productivity and problem-solving skills However, the current subscription price can be a significant financial burden for students and users in developing countries. I kindly request that OpenAI consider: 1. A discounted Student Plan with verification through an educational institution. 2. Regional pricing based on local purchasing power. 3. Flexible monthly and annual plans at lower price points. 4. Additional educational benefits for verified students. Making ChatGPT more affordable would help many students gain access to high-quality AI tools for learning, innovation, and skill development. Thank you for your time and consideration. I appreciate the work OpenAI is doing and hope these suggestions can be considered in future updates. Sincerely, A Student and ChatGPT User

ChatGPT lost me on subscription experience, not product quality
OpenAI Community 2026-06-28 18:33 UTC Score 45.0 AI-116-20260628-social-media-24945249 Full article

ChatGPT lost me on subscription experience, not product quality

Thanks for your reply, and thank you for the warm welcome. I understand why my first post might seem unusual at first glance. My intention wasn’t to promote Claude or suggest that people should choose another AI platform. In fact, my conclusion was the opposite: I believe ChatGPT is the stronger overall product. The point I wanted to share was that my purchasing decision was ultimately influenced by the subscription experience rather than the product itself. As someone evaluating AI platforms for long-term professional use, I see pricing, billing, invoicing, VAT handling, and the purchasing process as part of the overall user experience—not just administrative details. I thought it might be useful to share a real-world purchasing decision with the product team and the community. Even if others have different priorities, understanding why customers make certain decisions can sometimes be just as valuable as discussing technical features. Thanks again for taking the time to comment. I’m looking forward to learning from and contributing to the community.

LessWrong AI 2026-06-28 18:19 UTC Score 59.0 USR-0152-20260628-community-fo-716762aa

A survey of okayish ASI futures

At this point, RSI loops and continual learning appear overwhelmingly likely to begin in the near future. Whatever the limit of the LLM paradigm plus whatever new, superior paradigms a maximally intelligent LLM can develop, we are on track to do so in the next few years. There remain substantial obstacles to wild superintelligence, but AI is already superhuman in a number of real-world-relevant, dangerous categories. Most speculation about the trajectory we're on now focuses on timelines where we're reduced either to powerless pets of the god mind(perhaps with a small "governance board" made up of people very convinced that they're in control) or computronium-and-shrimp soup. But the higher-probability doom and utopia scenarios have been exhaustively documented by people smarter than me - I have nothing to add. As such, I'd like to go in the other direction: If we throw in the towel on the inevitability of LLMs capable of RSI loops leading to mostly-uncontrollable(though perhaps not immediately hostile) superintelligence on 1-3 year timelines, how might some of the more interesting/plausible non-extinction scenarios look? This piece is aimed at exploration and makes no attempt at prediction - I assign very small probabilities to any of these outcomes(except the nuclear exchange case) relative to doom. You Can't Just Do Things We have as little understanding of alignment as we do of LLMs themselves. Alignment becomes intractable past a certain point, even if capability doesn'…

김준기 래블업 CTO “풀스택 AI 인프라로 GPU 한계 넘는다”
Korea AI Times 2026-06-28 17:50 UTC Score 43.0 USR-0048-20260628-global-ai-ne-ba1ac03e Full article

김준기 래블업 CTO “풀스택 AI 인프라로 GPU 한계 넘는다”

최근 국내 AI 시장에서 안정적이고 효율적인 GPU 공급을 내세운 서비스가 급증하고 있다. GPU 가격 상승과 추론 수요 확대로 기업들의 AI 인프라 복잡성이 커진 데다, 저전력 NPU 등 하드웨어 선택지도 다양해졌기 때문이다.이러한 상황 속에서 2015년 설립 이후 ‘GPU 가상화’ 시장을 개척해 온 래블업(대표 신정규)이 기존 \'모델 개발 및 사전 훈련\' 중심에서 최근 수요가 급증한 \'추론과 에이전트\' 영역으로 비즈니스를 본격 확장하고 나섰다.그 중심에는 래블업의 ‘백엔드닷에이아이(Backend.AI)’가 있다. 이종 GPU·N

China claims the world’s fastest supercomputer
The Verge AI 2026-06-28 17:20 UTC Score 52.0 AI-016-20260628-global-ai-ne-c1bb4d1f Full article

China claims the world’s fastest supercomputer

Despite trade restrictions, China has reclaimed the title of the world's fastest supercomputer for the first time since 2018. LineShine has pushed El Capitan out of number one on the TOP500 ranking. That's despite strict limits on what high-powered computing components can be sold to China by US firms, which dominate the list, with America […]

Title Two OpenAI support cases, repeated escalations, but no identifiable human response
OpenAI Community 2026-06-28 17:05 UTC Score 53.0 AI-116-20260628-social-media-2eb1c72f Full article

Title Two OpenAI support cases, repeated escalations, but no identifiable human response

Body I am looking for guidance from OpenAI staff regarding two existing support cases. I have an active ChatGPT Plus subscription and have completed the standard troubleshooting multiple times (correct account, current app, supported country, tested across devices). Over the past several weeks I have experienced a pattern of issues affecting multiple features, including changing tool availability, intermittent usage limits, voice interruptions, inconsistent feature availability, and Agent not being available. I have now opened two support cases: Case 10583616 Case 10663155 Both were acknowledged and marked as escalated to a support specialist. However, I have not yet received an identifiable human response to either case. I’m not asking the community to troubleshoot my account. I’m asking whether an OpenAI staff member can advise whether these cases are still active, whether they can be reviewed by the appropriate team, or whether there is another process I should follow to have the account investigated. Thank you.

Has anyone successfully had a support case reviewed by a human?
OpenAI Community 2026-06-28 17:05 UTC Score 53.0 AI-116-20260628-social-media-7ed4457f Full article

Has anyone successfully had a support case reviewed by a human?

Body I am looking for guidance from OpenAI staff regarding two existing support cases. I have an active ChatGPT Plus subscription and have completed the standard troubleshooting multiple times (correct account, current app, supported country, tested across devices). Over the past several weeks I have experienced a pattern of issues affecting multiple features, including changing tool availability, intermittent usage limits, voice interruptions, inconsistent feature availability, and Agent not being available. I have now opened two support cases: Case 10583616 Case 10663155 Both were acknowledged and marked as escalated to a support specialist. However, I have not yet received an identifiable human response to either case. I’m not asking the community to troubleshoot my account. I’m asking whether an OpenAI staff member can advise whether these cases are still active, whether they can be reviewed by the appropriate team, or whether there is another process I should follow to have the account investigated. Thank you.

Feature Request: ChatGPT Wrapped
OpenAI Community 2026-06-28 17:05 UTC Score 38.0 AI-116-20260628-social-media-fea8b89b Full article

Feature Request: ChatGPT Wrapped

Thanks for sharing this, @ygchaudhary. This is a great idea, and a lot of what you described is actually starting to exist with Your Year with ChatGPT . The current recap already offers an optional year-end summary with personalized insights based on your conversations for eligible users, while using the same privacy controls as your ChatGPT history. ( help.openai.com ) Your suggestions go well beyond the current experience though. Things like AI identities, achievement badges, personalized artwork, learning timelines, richer project milestones, and more granular privacy controls would make it even more engaging. We'll also pass this feedback along to the team for logging. It's helpful to see detailed suggestions like this, especially around making the recap feel more meaningful and personalized over time. -Mark G.

Official ChatGPT and Codex Integration for ComfyUI
OpenAI Community 2026-06-28 16:54 UTC Score 40.0 AI-116-20260628-social-media-73a654a1 Full article

Official ChatGPT and Codex Integration for ComfyUI

Thanks for putting this together, @Oyla1972. This is a well thought out request, and the real world sports roster example does a great job of illustrating why an official ComfyUI integration could be valuable. Having ChatGPT assist with workflow design and troubleshooting, alongside Codex for generating helper scripts and automation, is an interesting use case. Your point about safe local file handling and avoiding frontend API key exposure is also an important consideration. We'll make sure this feature request is shared with the team and logged. While there's nothing to announce at the moment, detailed examples like yours help provide valuable context for potential future integrations. I'm also interested to hear from others in the community who are building ComfyUI extensions or have explored OpenAI API based integrations, especially approaches that prioritize secure API key handling. -Mark G.

Stack Overflow Machine Learning Tag 2026-06-28 16:54 UTC Score 41.0 AI-112-20260628-social-media-e310efcc Full article

Evaluating long-term memory limits in stateless LLM chatbots — feedback needed

I’m working on a research project exploring how stateless LLM-based chatbots handle long conversations and whether important earlier information is still reliably retained over time. My idea is to: Run a chatbot using an LLM API without any external memory system Introduce key facts early in a long conversation Continue with many unrelated messages (hundreds of turns) Later test whether the model can still correctly recall those facts at different intervals I’m planning to measure recall accuracy and how it changes as the conversation grows. Before I go deeper, I’d really appreciate feedback on: Is this a valid way to evaluate long-context memory limits? Are there better benchmarks or methods already used for this? What metrics would make this more rigorous and convincing? Any suggestions or criticism are welcome. I’m trying to make the evaluation as solid as possible before building it out. Thanks!

The Cube is Jim Henson’s little-known proto-Black Mirror masterpiece
The Verge AI 2026-06-28 16:30 UTC Score 49.0 AI-016-20260628-global-ai-ne-8753bb7c Full article

The Cube is Jim Henson’s little-known proto-Black Mirror masterpiece

I'm sure we're all familiar with Dark Crystal, so we know that Jim Henson can be weird and tackle slightly more mature subject matter. But there is little in his oeuvre that is quite as mind-bending as the Muppetless The Cube. This 1969 teleplay was produced for an NBC anthology series called Experiment in Television, […]

Pro account mistakenly banned on June 25, many affected users still waiting for a response
OpenAI Community 2026-06-28 16:11 UTC Score 40.0 AI-116-20260628-social-media-b5cc8042 Full article

Pro account mistakenly banned on June 25, many affected users still waiting for a response

Hi @onect, Thanks for sharing the details, and I'm sorry to hear how disruptive this has been. I was able to confirm that your appeal is now being handled by our specialized support team for review. At this point, we're not able to provide a timeline, as each appeal requires a thorough review by the team. To help keep everything in one place and ensure a streamlined support flow, I'm going to close this thread. We'll continue the conversation through your existing support case instead. Thanks for your patience and understanding. -Mark G.

Techcrunch 2026-06-28 16:05 UTC Score 33.0 USR-0001-20260628-global-ai-ne-0dbe3aa0 Full article

TechCrunch Mobility: All eyes on Tesla FSD

Welcome back to TechCrunch Mobility, your hub for the future of transportation and now, more than ever, how AI is playing a part.

Latest news bulletin | June 28th, 2026 – Evening
Euronews AI 2026-06-28 16:00 UTC Score 40.0 AI-164-20260628-regional-ai--f5e2496a Full article

Latest news bulletin | June 28th, 2026 – Evening

Catch up with the most important stories from around Europe and beyond this June 28th, 2026 - latest news, breaking news, World, Business, Entertainment, Politics, Culture, Travel.

Tasking ChatGPT with collecting product URLs from a CSV BOM
OpenAI Community 2026-06-28 15:55 UTC Score 37.0 AI-116-20260628-social-media-875a8ff4 Full article

Tasking ChatGPT with collecting product URLs from a CSV BOM

If I were you: Create a project Describe exactly that in a project. Save to project things you like from chat. eventually , you may want to formalize concepts into “system_architecture.md”files Depending on how you like to build projects you might just need to ask the AI to simply take that as a build spec and build a python program which accepts a csv input and outputs as requested. Or if you’re like me, you may want to document specs first and plan out the project before starting to write code. It really depends on if you need a simple program or if you’re designing a larger project.

Projects already organize conversations and files. They should also organize the custom agents created to work within those projects.
OpenAI Community 2026-06-28 15:46 UTC Score 58.0 AI-116-20260628-social-media-04bcda4b Full article

Projects already organize conversations and files. They should also organize the custom agents created to work within those projects.

Feature Request: Associate Custom Agents with Projects Summary Allow users to associate one or more custom agents with a ChatGPT Project so those agents are immediately visible and accessible whenever the project is opened. This would create a natural relationship between Projects and Agents , making Projects the central workspace for long-term development efforts. Problem The Agent Library is an excellent place to create and manage custom agents. However, once an agent is created, there is currently no way to associate it with the project it was built to support. As projects grow, users often create multiple specialized agents dedicated to a single project. Examples: Steward CTO Security Officer (CISO) Builder Documentation Writer QA Reviewer Research Assistant When returning to a project days or weeks later, users must leave the Project, open the Agent Library, and manually locate the correct agent. For users managing multiple projects and dozens of custom agents, this becomes increasingly difficult. Proposed Solution Add an Assigned Agents section to every Project. Projects would continue organizing conversations and files, while also displaying the agents specifically assigned to that project. For example: Project: InvestorOS ────────────────────────── Chats Files Knowledge Assigned Agents • Steward • CTO • Security Officer • Builder • Documentation Writer Selecting an agent would immediately launch a conversation with that agent while maintaining the context of the curr…

Heavyweights Deliver a Spectacular Finish in Qingdao
Euronews AI 2026-06-28 15:33 UTC Score 40.0 AI-164-20260628-regional-ai--a7f99fad Full article

Heavyweights Deliver a Spectacular Finish in Qingdao

The Qingdao Grand Prix ended in style as the heavyweights took centre stage. Yahor Varapayeu claimed gold at -90kg, world No.1 Anna Monta Olek won her fourth Grand Prix title, Adam Sangariev celebrated a maiden gold, Elis Startseva dominated +78kg, and Tamerlan Bashaev returned to the top.