BAIR Blog

Scaling Up Reinforcement Learning for Traffic Smoothing...

Training Diffusion Models with Reinforcement Learning ...

The Shift from Models to Compound AI Systems

AI caught everyone’s attention in 2023 with Large Language Mode...

Ghostbuster: Detecting Text Ghostwritten by Large Langu...

The structure of Ghostbuster, our new state-of-the-art metho...

Asymmetric Certified Robustness via Feature-Convex Neur...

Asymmetric Certified Robustness via Feature-Convex Neural Networks ...

TinyAgent: Function Calling at the Edge

The ability of LLMs to execute commands through plain langu...

Modeling Extremely Large Images with xT

As computer vision researchers, we believe that every pixel can...

2024 BAIR Graduate Directory

Every year, the Berkeley Artificial Intelligence Research (BAIR) La...

Linguistic Bias in ChatGPT: Language Models Reinforce D...

Sample language model responses to different varieties of En...

How to Evaluate Jailbreak Methods: A Case Study with th...

When we began studying jailbreak evaluations, we found a fascin...

Are We Ready for Multi-Image Reasoning? Launching VHs: ...

Humans excel at processing vast arrays of visual information, a...

Virtual Personas for Language Models via an Anthology o...

We introduce Anthology, a method for conditioning LLMs to r...

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies.