AI-Research

17 posts

AI-Research ML-Applications Open-Source

How open source AI can improve population estimates, sustainable energy, and the delivery of climate change interventions

Data for Good at Meta is open-sourcing the data used to train our AI-powered population maps. We’re hoping that researchers and other organizations around the world will be able to leverage these tools to assist with a wide range of projects including those on climate adaptation, public health and disaster response. The dataset and code [...] Read More... The post How open source AI can improve population estimates, sustainable energy, and the delivery of climate change interventions appeared first on Engineering at Meta.

10/3/2024

AI-Research ML-Applications Open-Source AI-Infra-@-Scale

How PyTorch powers AI training and inference

Learn about new PyTorch advancements for LLMs and how PyTorch is enhancing every aspect of the LLM lifecycle. In this talk from AI Infra @ Scale 2024, software engineers Wanchao Liang and Evan Smothers are joined by Meta research scientist Kimish Patel to discuss our newest features and tools that enable large-scale training, memory efficient [...] Read More... The post How PyTorch powers AI training and inference appeared first on Engineering at Meta.

8/23/2024

AI-Research ML-Applications AI-Infra-@-Scale

Inside the hardware and co-design of MTIA

In this talk from AI Infra @ Scale 2024, Joel Colburn, a software engineer at Meta, technical lead Junqiang Lan, and software engineer Jack Montgomery discuss the second generation of MTIA, Meta’s in-house training and inference accelerator. They cover the co-design process behind building the second generation of Meta’s first-ever custom silicon for AI workloads, [...] Read More... The post Inside the hardware and co-design of MTIA appeared first on Engineering at Meta.

8/22/2024

AI-Research Culture ML-Applications Open-Source Production-Engineering AI-Infra-@-Scale

Bringing Llama 3 to life

Llama 3 is Meta’s most capable openly-available LLM to date and the recently-released Llama 3.1 will enable new workflows, such as synthetic data generation and model distillation with unmatched flexibility, control, and state-of-the-art capabilities that rival the best closed source models. At AI Infra @ Scale 2024, Meta engineers discussed every step of how we [...] Read More... The post Bringing Llama 3 to life appeared first on Engineering at Meta.

8/21/2024

AI-Research Data-Center-Engineering Data-Infrastructure ML-Applications AI-Infra-@-Scale

Aparna Ramani discusses the future of AI infrastructure

Delivering new AI technologies at scale also means rethinking every layer of our infrastructure – from silicon and software systems and even our data center designs. For the second year in a row, Meta’s engineering and infrastructure teams returned for the AI Infra @ Scale conference, where they discussed the challenges of scaling up an [...] Read More... The post Aparna Ramani discusses the future of AI infrastructure appeared first on Engineering at Meta.

8/20/2024

AI-Research ML-Applications Production-Engineering

How Meta animates AI-generated images at scale

We launched Meta AI with the goal of giving people new ways to be more productive and unlock their creativity with generative AI (GenAI). But GenAI also comes with challenges of scale. As we deploy new GenAI technologies at Meta, we also focus on delivering these services to people as quickly and efficiently as possible. [...] Read More... The post How Meta animates AI-generated images at scale appeared first on Engineering at Meta.

8/14/2024

AI-Research Data-Center-Engineering ML-Applications Networking-and-Traffic

RoCE networks for distributed AI training at scale

AI networks play an important role in interconnecting tens of thousands of GPUs together, forming the foundational infrastructure for training, enabling large models with hundreds of billions of parameters such as LLAMA 3.1 405B. This week at ACM SIGCOMM 2024 in Sydney, Australia, we are sharing details on the network we have built at Meta [...] Read More... The post RoCE networks for distributed AI training at scale appeared first on Engineering at Meta.

8/5/2024

AI-Research Culture

Building new custom silicon for Meta’s AI workloads

[...] Read More... The post Building new custom silicon for Meta’s AI workloads appeared first on Engineering at Meta.

4/11/2024

AI-Research Culture

Building new custom silicon for Meta’s AI workloads

[...] Read More... The post Building new custom silicon for Meta’s AI workloads appeared first on Engineering at Meta.

4/11/2024

AI-Research Culture

Building new custom silicon for Meta’s AI workloads

[...] Read More... The post Building new custom silicon for Meta’s AI workloads appeared first on Engineering at Meta.

4/11/2024

AI-Research ML-Applications

Introducing the next-gen Meta Training and Inference Accelerator

[...] Read More... The post Introducing the next-gen Meta Training and Inference Accelerator appeared first on Engineering at Meta.

4/10/2024

AI-Research Data-Center-Engineering ML-Applications

Building Meta’s GenAI Infrastructure

Marking a major investment in Meta’s AI future, we are announcing two 24k GPU clusters. We are sharing details on the hardware, network, storage, design, performance, and software that help us extract high throughput and reliability for various AI workloads. We use this cluster design for Llama 3 training. We are strongly committed to open [...] Read More... The post Building Meta’s GenAI Infrastructure appeared first on Engineering at Meta.

3/12/2024

AI-Research ML-Applications meta-tech-podcast

How Meta is advancing GenAI

What’s going on with generative AI (GenAI) at Meta? And what does the future have in store? In this episode of the Meta Tech Podcast, Meta engineer Pascal Hartig (@passy) speaks with Devi Parikh, an AI research director at Meta. They cover a wide range of topics, including the history and future of GenAI and the most [...] Read More... The post How Meta is advancing GenAI appeared first on Engineering at Meta.

1/11/2024

AI-Research Culture ML-Applications

How Meta is creating custom silicon for AI

Olivia Wu, Meta’s Technical Lead for Infra Silicon, discusses the design and development of Meta’s first-generation AI inference accelerator. [...] Read More... The post How Meta is creating custom silicon for AI appeared first on Engineering at Meta.

10/18/2023

AI-Research ML-Applications

Code Llama: Meta’s state-of-the-art LLM for coding

[...] Read More... The post Code Llama: Meta’s state-of-the-art LLM for coding appeared first on Engineering at Meta.

8/24/2023

AI-Research ML-Applications Virtual-Reality Web

Meta Connect 2023: September 27 – 28

[...] Read More... The post Meta Connect 2023: September 27 – 28 appeared first on Engineering at Meta.

8/14/2023

AI-Research ML-Applications

Meta introduces its first-generation AI inference accelerator

[...] Read More... The post Meta introduces its first-generation AI inference accelerator appeared first on Engineering at Meta.

5/18/2023