AI Info

19h ago

Filter:

Curated AI news, research, and engineering updates

AI Info brings together AI news, research posts, engineering write-ups, and product announcements from major labs, companies, and communities in one crawlable hub.

RCCLX: Innovating GPU Communications on AMD Platforms

We are open-sourcing the initial version of RCCLX – an enhanced version of RCCL that we developed and tested on Meta’s internal workloads. RCCLX is fully integrated with Torchcomms and aims to empower researchers and developers to accelerate innovation, regardless of their chosen backend. Communication patterns for AI models are constantly evolving, as are hardware [...] Read More... The post RCCLX: Innovating GPU Communications on AMD Platforms appeared first on Engineering at Meta.

Tue, 24 Feb 2026 21:30:54 +0000

Scaling LLM Inference: Innovations in Tensor Parallelism, Context Parallelism, and Expert Parallelism

At Meta, we are constantly pushing the boundaries of LLM inference systems to power applications such as the Meta AI App. We’re sharing how we developed and implemented advanced parallelism techniques to optimize key performance metrics related to resource efficiency, throughput, and latency. The rapid evolution of large language models (LLMs) has ushered in a [...] Read More... The post Scaling LLM Inference: Innovations in Tensor Parallelism, Context Parallelism, and Expert Parallelism appeared first on Engineering at Meta.

Fri, 17 Oct 2025 16:00:50 +0000

LLMs Are the Key to Mutation Testing and Better Compliance

Following our keynote presentations at FSE 2025 and Eurostar 2025, we’re delving further into the development of Meta’s Automated Compliance Hardening (ACH) tool, an LLM-based tool for software testing that is automating aspects of compliance adherence at Meta, while accelerating developer and product velocity. By leveraging LLMs we’ve been able to overcome the barriers that [...] Read More... The post LLMs Are the Key to Mutation Testing and Better Compliance appeared first on Engineering at Meta.

Tue, 30 Sep 2025 16:00:08 +0000

Meta 3D AssetGen: Generating 3D Worlds With AI

Imagine being able to use AI to create 3D virtual worlds using prompts as easily as you can generate images. The intersection of AI and VR was one of the biggest topics at Meta Connect this year. In his keynote, Mark Zuckerberg shared his vision of a future where anyone can create virtual worlds using [...] Read More... The post Meta 3D AssetGen: Generating 3D Worlds With AI appeared first on Engineering at Meta.

Mon, 29 Sep 2025 14:00:42 +0000

Meta’s Infrastructure Evolution and the Advent of AI

Over the past 21 years, Meta has grown exponentially from a small social network connecting a few thousand people in a handful of universities in the U.S. into several apps and novel hardware products that serve over 3.4 billion people throughout the world. Our infrastructure has evolved significantly over the years, growing from a [...] Read More... The post Meta’s Infrastructure Evolution and the Advent of AI appeared first on Engineering at Meta.

Mon, 29 Sep 2025 13:00:15 +0000

Building Supercharger: How Rocket Close optimized title operations with agentic AI

In this post, we explore how Rocket Close built a solution using Strands Agents, large language models (LLMs), Amazon Bedrock, Amazon Bedrock Knowledge Bases, and Model Context Protocol (MCP) tools. We cover solution features, the rationale for the technology stack, lessons learned, and the business impact at Rocket Close.

Fri, 12 Jun 2026 20:43:56 +0000

Build a meeting prep and follow-up assistant with Amazon Quick and Cisco Webex MCP servers

This post shows how to build a custom meeting prep and follow-up assistant using Amazon Quick and Cisco Webex MCP servers. From a single prompt, the agent finds an upcoming Webex meeting, reviews prior meeting summaries and transcripts, and pulls related Vidcast highlights and transcript context. It then searches Webex message threads for unresolved follow-ups and creates a concise prep brief. After the meeting, the same assistant can summarize the discussion and identify action items. It can also find related Vidcast updates and draft a follow-up message for the right Webex space.

Fri, 12 Jun 2026 14:49:40 +0000

From PDFs to insights: Architecting an intelligent document processing pipeline with AWS generative AI services

This post outlines the development of a cost-effective and scalable intelligent document processing pipeline on AWS, powered by Amazon Bedrock and its features. BDA is a managed service within Amazon Bedrock that automates the extraction of insights from documents. We demonstrate how BDA extracts and analyzes document content, while Strands Agent hosted on Amazon Bedrock AgentCore Runtime coordinate specialized processing tasks, and Amazon Bedrock Knowledge Base enable contextual understanding across multiple documents. By combining these capabilities within a unified architecture, organizations can transform their document processing workflows with minimal development effort.

Fri, 12 Jun 2026 14:43:11 +0000

Built from the inside out: How AWS Professional Services became a frontier team first

AWS Professional Services (AWS ProServe) compressed engagement timelines from months to days, not by adding artificial intelligence (AI) tools to an existing process, but by fundamentally rebuilding how we deliver from the inside out. In this post, we share how AWS ProServe became a frontier team, the practices that enabled it, and what your engineering organization can take from our experience.

Fri, 12 Jun 2026 13:00:10 +0000

Extract Data with On-demand and Batch Pipelines Dynamically

This post demonstrates an intelligent document processing pipeline that consists of both on-demand inference and batch inference options on Amazon Bedrock to enable the flexibility on the document processing time and cost.

Thu, 11 Jun 2026 19:40:33 +0000

Lights Out, Systems On: Validating Instant Power Loss Readiness

We’re introducing Instantaneous PowerLoss Storm, a new testing paradigm within Meta’s infrastructure for handling and mitigating instant or zero-notice power loss in our data centers.  We’re sharing: how we built readiness to tolerate instant failures into our existing systems with defense-in-depth strategies; tradeoffs made in implementing it, and how we validated our readiness. Disaster preparedness [...] Read More... The post Lights Out, Systems On: Validating Instant Power Loss Readiness appeared first on Engineering at Meta.

Wed, 03 Jun 2026 17:00:44 +0000

SilverTorch: Index as Model — A New Retrieval Paradigm for Recommendation Systems

We’re introducing SilverTorch, a reimagining of recommendation systems that unifies all retrieval components for user generated content under a unified architecture.  SilverTorch shows up to 23.7x higher throughput compared to the state-of-the-art approaches. It’s also showing 20.9x more compute cost efficiency compared to a CPU-based solution while also improving accuracy.  Our research paper, “SilverTorch: A [...] Read More... The post SilverTorch: Index as Model — A New Retrieval Paradigm for Recommendation Systems appeared first on Engineering at Meta.

Tue, 26 May 2026 16:00:01 +0000

Reel Friends: Building Social Discovery that Scales to Billions

On its face the new Friend Bubbles feature looks simple enough. It highlights Reels your friends have watched and reacted to. But sometimes the features that seem the most straightforward require the deepest engineering work. On this episode of the Meta Tech Podcast, Pascal Hartig chats with Subasree and Joseph, two software engineers from the Facebook [...] Read More... The post Reel Friends: Building Social Discovery that Scales to Billions appeared first on Engineering at Meta.

Wed, 13 May 2026 13:00:44 +0000

Migrating Data Ingestion Systems at Meta Scale

Meta’s data ingestion system, which our engineering teams leverage for up-to-date snapshots of the social graph, has recently undergone a significant revamp to enhance its reliability at scale.  Moving from our legacy system to our new architecture required a large-scale migration of our entire data ingestion system.  We’re sharing the solutions and strategies that enabled [...] Read More... The post Migrating Data Ingestion Systems at Meta Scale appeared first on Engineering at Meta.

Tue, 12 May 2026 16:00:57 +0000

Labyrinth 1.1: Making End-to-End Encrypted Backups Even More Reliable 

We’re rolling out version 1.1 of Labyrinth, the encrypted storage system and protocol that secures messages and history on Messenger. Labyrinth 1.1 enhances the reliability of end-to-end encrypted backups with a new sub-protocol that helps messages survive the loss of a device, a switched device, and long gaps between sign-ins. Read our updated white paper, [...] Read More... The post Labyrinth 1.1: Making End-to-End Encrypted Backups Even More Reliable  appeared first on Engineering at Meta.

Mon, 11 May 2026 16:00:55 +0000

#497 – Biggest Mysteries in Physics: Antimatter, Dark Energy & ToE – Don Lincoln

Don Lincoln is a particle physicist at Fermilab who has spent decades working at the frontiers of high energy physics. https://lexfridman.com/sponsors/ep497-sc CONTACT LEX: Feedback – give feedback to Lex: https://lexfridman.com/survey AMA – submit questions, videos or call-in: https://lexfridman.com/ama Hiring – join our team: https://lexfridman.com/hiring Other – other ways to get in touch: https://lexfridman.com/contact EPISODE LINKS: https://facebook.com/Dr.Don.Lincoln/ https://drdonlincoln.com/ https://bit.ly/4nHeNiF https://bit.ly/3PCIW67 https://x.com/DrDonLincoln https://amzn.to/4uYbkOZ https://shop.thegreatcourses.com/don-lincoln https://adbl.co/4wGioRV https://www.youtube.com/fermilab https://www.fnal.gov/ https://x.com/fermilab SPONSORS: Upwork: Platform for hiring freelancers. https://upwork.com/lex Larridin: Measure AI adoption in your business. https://larridin.com Fin: AI agent for customer service. https://fin.ai/lex LMNT: Zero-sugar electrolyte drink mix. https://drinkLMNT.com/lex Shopify: Sell stuff online. https://shopify.com/lex Perplexity: AI-powered answer engine. https://perplexity.ai/ OUTLINE: PODCAST LINKS: https://lexfridman.com/podcast https://apple.co/2lwqZIr https://spoti.fi/2nEwCF8 https://lexfridman.com/feed/podcast/ https://www.youtube.com/playlist?list=PLrAXtmErZgOdP_8GztsuKi9nrraNbKKp4 https://www.youtube.com/lexclips

Fri, 29 May 2026 16:22:02 +0000

#496 – FFmpeg: The Incredible Technology Behind Video on the Internet

Jean-Baptiste Kempf is lead developer of VLC and president of VideoLAN. Kieran Kunhya is a longtime FFmpeg contributor, codec engineer, and the person behind the now-infamous FFmpeg account on X. https://lexfridman.com/sponsors/ep496-sc Transcript: https://lexfridman.com/ffmpeg-transcript CONTACT LEX: Feedback – give feedback to Lex: https://lexfridman.com/survey AMA – submit questions, videos or call-in: https://lexfridman.com/ama Hiring – join our team: https://lexfridman.com/hiring Other – other ways to get in touch: https://lexfridman.com/contact EPISODE LINKS: https://x.com/FFmpeg https://ffmpeg.org/ https://www.videolan.org/ https://x.com/videolan https://jbkempf.com/ https://www.linkedin.com/in/jbkempf/ https://github.com/jbkempf https://x.com/kierank_ https://bit.ly/3OORhmC https://github.com/kierank SPONSORS: Larridin: Measure AI adoption in your business. https://larridin.com Blitzy: AI agent for large enterprise codebases. https://blitzy.com/lex BetterHelp: Online therapy and counseling. https://betterhelp.com/lex Fin: AI agent for customer service. https://fin.ai/lex LMNT: Zero-sugar electrolyte drink mix. https://drinkLMNT.com/lex Perplexity: AI-powered answer engine. https://perplexity.ai/ OUTLINE: (00:00) – Introduction (03:00) – Sponsors, Comments, and Reflections (10:48) – Weirdest things VLC opens (15:12) – How video playback works (24:33) – Video codecs and containers (35:20) – FFmpeg explained (56:20) – Linus Torvalds (1:00:59) – Turning down millions to keep VLC ad-free (1:15:17) – FFmpeg & Google drama (1:34:31) – FFmpeg developers (1:41:08) – VLC and FFmpeg (1:45:42) – History of FFmpeg (1:48:59) – Reverse engineering codecs (2:02:14) – FFmpeg testing (2:06:21) – Assembly code (handwritten) (2:30:39) – Rust programming language (2:39:55) – FFmpeg and Libav fork (2:48:17) – Open source burnout (2:56:04) – x264 and internet video (3:09:20) – Video compression basics (3:16:17) – CIA and fake VLC (3:26:52) – Ultra low latency streaming (3:44:20) – AV2 codec and video patents (3:54:12) – VLC backdoors (4:04:27) – Video archiving (4:11:04) – Future of FFmpeg and VLC

Wed, 06 May 2026 22:06:47 +0000

#495 – Vikings, Ragnar, Berserkers, Valhalla & the Warriors of the Viking Age

Lars Brownworth is a historian, teacher, podcaster, and author specializing in Viking history, medieval Europe, and the Byzantine Empire. https://lexfridman.com/sponsors/ep495-sc Transcript: https://lexfridman.com/lars-brownworth-transcript CONTACT LEX: Feedback – give feedback to Lex: https://lexfridman.com/survey AMA – submit questions, videos or call-in: https://lexfridman.com/ama Hiring – join our team: https://lexfridman.com/hiring Other – other ways to get in touch: https://lexfridman.com/contact EPISODE LINKS: https://larsbrownworth.com/ https://www.amazon.com/Sea-Wolves-History-Vikings/dp/1909979120 https://amzn.to/4sHY0xw https://12byzantinerulers.com/ https://apple.co/4sgSxNi SPONSORS: Larridin: Measure AI adoption in your business. https://larridin.com BetterHelp: Online therapy and counseling. https://betterhelp.com/lex LMNT: Zero-sugar electrolyte drink mix. https://drinkLMNT.com/lex Fin: AI agent for customer service. https://fin.ai/lex Shopify: Sell stuff online. https://shopify.com/lex Perplexity: AI-powered answer engine. https://perplexity.ai/ OUTLINE: PODCAST LINKS: https://lexfridman.com/podcast https://apple.co/2lwqZIr https://spoti.fi/2nEwCF8 https://lexfridman.com/feed/podcast/ https://www.youtube.com/playlist?list=PLrAXtmErZgOdP_8GztsuKi9nrraNbKKp4 https://www.youtube.com/lexclips

Thu, 09 Apr 2026 17:43:17 +0000

#494 – Jensen Huang: NVIDIA – The $4 Trillion Company & the AI Revolution

Jensen Huang is the co-founder and CEO of NVIDIA, the world’s most valuable company and the engine powering the AI computing revolution. https://lexfridman.com/sponsors/ep494-sc Transcript: https://lexfridman.com/jensen-huang-transcript CONTACT LEX: Feedback – give feedback to Lex: https://lexfridman.com/survey AMA – submit questions, videos or call-in: https://lexfridman.com/ama Hiring – join our team: https://lexfridman.com/hiring Other – other ways to get in touch: https://lexfridman.com/contact EPISODE LINKS: https://nvidia.com https://x.com/nvidia https://x.com/NVIDIAAI https://youtube.com/@nvidia https://www.instagram.com/nvidia/ https://www.linkedin.com/company/nvidia/ https://www.facebook.com/NVIDIA/ https://github.com/NVIDIA https://developer.nvidia.com/nemotron SPONSORS: Perplexity: AI-powered answer engine. https://perplexity.ai/ Shopify: Sell stuff online. https://shopify.com/lex LMNT: Zero-sugar electrolyte drink mix. https://drinkLMNT.com/lex Fin: AI agent for customer service. https://fin.ai/lex Quo: Phone system (calls, texts, contacts) for businesses. https://quo.com/lex OUTLINE: PODCAST LINKS: https://lexfridman.com/podcast https://apple.co/2lwqZIr https://spoti.fi/2nEwCF8 https://lexfridman.com/feed/podcast/ https://www.youtube.com/playlist?list=PLrAXtmErZgOdP_8GztsuKi9nrraNbKKp4 https://www.youtube.com/lexclips

Mon, 23 Mar 2026 16:28:42 +0000

#493 – Jeff Kaplan: World of Warcraft, Overwatch, Blizzard, and Future of Gaming

Jeff Kaplan is a legendary Blizzard game designer of World of Warcraft and Overwatch, now preparing to launch a new game, The Legend of California, from his new studio Kintsugiyama – available to wishlist on Steam today, with alpha later in March. https://lexfridman.com/sponsors/ep493-sc CONTACT LEX: Feedback – give feedback to Lex: https://lexfridman.com/survey AMA – submit questions, videos or call-in: https://lexfridman.com/ama Hiring – join our team: https://lexfridman.com/hiring Other – other ways to get in touch: https://lexfridman.com/contact EPISODE LINKS: https://store.steampowered.com/app/2550530/The_Legend_of_California https://www.kintsugiyama.com/ SPONSORS: Fin: AI agent for customer service. https://fin.ai/lex Blitzy: AI agent for large enterprise codebases. https://blitzy.com/lex BetterHelp: Online therapy and counseling. https://betterhelp.com/lex Shopify: Sell stuff online. https://shopify.com/lex CodeRabbit: AI-powered code reviews. https://coderabbit.ai/lex Perplexity: AI-powered answer engine. https://perplexity.ai/ OUTLINE: PODCAST LINKS: https://lexfridman.com/podcast https://apple.co/2lwqZIr https://spoti.fi/2nEwCF8 https://lexfridman.com/feed/podcast/ https://www.youtube.com/playlist?list=PLrAXtmErZgOdP_8GztsuKi9nrraNbKKp4 https://www.youtube.com/lexclips

Wed, 11 Mar 2026 20:37:33 +0000