AI

VernamVeil: A Fresh Take on Function-Based Encryption

Cryptography often feels like an ancient dark art, full of math-heavy concepts, ...

Big Context Windows Are a Big Deal

Last week, I got my hands on Google’s newest generative model: Gemini 1.5, a mul...

Machine Learning's Most Useful Multitool: Embeddings

Embeddings are one of the most versatile techniques in machine learning, and a c...

You, Me, and My AI-Generated Alternate Identity

@azusagakuyuki is a young Japanese motorcyclist with long hair, a delicate chin,...

An Impractical Guide to AI on Google Cloud

Your manager holds a gun to the side of your head and says, “Sell me a Google Cl...

Build an Animal/Object Tracking Camera App with TensorF...

Introducing PetCam: a non-invasive machine-learning-powered pet tracker that run...

Transformers, Explained: Understand the Model Behind GP...

You know that expression When you have a hammer, everything looks like a nail? W...

How I, One Humble Engineer, Deal With Imposter Syndrome

Let me start this post off by saying that imposter syndrome has already been cov...

AI Dubs Over Subs? Translating and Dubbing Videos with AI

Alongside cooking for myself and walking laps around the house, Japanese cartoon...

DALL·E Explained in Under 5 Minutes

It seems like every few months, someone publishes a machine learning paper or de...

AlphaFold 2 Explained: A Semi-Deep Dive

At the end of last month, DeepMind, Google’s machine learning research branch kn...

Modeling Extremely Large Images with xT

As computer vision researchers, we believe that every pixel can...

2024 BAIR Graduate Directory

Every year, the Berkeley Artificial Intelligence Research (BAIR) La...

How to Evaluate Jailbreak Methods: A Case Study with th...

When we began studying jailbreak evaluations, we found a fascin...

Are We Ready for Multi-Image Reasoning? Launching VHs: ...

Humans excel at processing vast arrays of visual information, a...

TinyAgent: Function Calling at the Edge

The ability of LLMs to execute commands through plain langu...

Linguistic Bias in ChatGPT: Language Models Reinforce D...

Sample language model responses to different varieties of En...

Defending against Prompt Injection with Structured Quer...

Recent advances in Large Language Models (LLMs) enable exciting LLM...

Scaling Up Reinforcement Learning for Traffic Smoothing...

Training Diffusion Models with Reinforcement Learning ...

Virtual Personas for Language Models via an Anthology o...

We introduce Anthology, a method for conditioning LLMs to r...

From motor control to embodied intelligence

Using human and animal motions to teach robots to dribble a ball, and simulated ...

Start building with Gemini 2.0 Flash and Flash-Lite

Gemini 2.0 Flash-Lite is now generally available in the Gemini API for productio...

Gemini 2.0 is now available to everyone

We’re announcing new updates to Gemini 2.0 Flash, plus introducing Gemini 2.0 Fl...

Updating the Frontier Safety Framework

Our next iteration of the FSF sets out stronger security protocols on the path t...

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies.