Posts

Implementation and usage mindsets

A one-dimensional take on the enduring static-versus-dynamic debat...

Geographic hulls

Seven lines of Python code. Can you tell what th...

Modelling data relationships with C# types

A C# example implementation of Ghosts of Departed Proofs. ...

Dependency inversion without inversion of control

Here, have a sandwich. For years I've been think...

Modelling data relationships with F# types

An F# example implementation of Ghosts of Departed Proofs. ...

Recawr Sandwich

A pattern variation. After writing the articles ...

Adaptive Inference Budget Management in Large Language ...

Large Language Models (LLMs) have demonstrated remarkable capabilities in comple...

Google DeepMind Introduces AlphaGeometry2: A Significan...

The International Mathematical Olympiad (IMO) is a globally recognized competiti...

Efficient Alignment of Large Language Models Using Toke...

Large language models (LLMs) must align with human preferences like helpfulness ...

Tutorial to Fine-Tuning Mistral 7B with QLoRA Using Axo...

In this tutorial, we demonstrate the workflow for fine-tuning Mistral 7B using Q...

Shanghai AI Lab Releases OREAL-7B and OREAL-32B: Advanc...

Mathematical reasoning remains a difficult area for artificial intelligence (AI)...

Advancing Scalable Text-to-Speech Synthesis: Llasa’s Tr...

Recent advancements in LLMs, such as the GPT series and emerging “o1” models, hi...

LLMDet: How Large Language Models Enhance Open-Vocabula...

Open-vocabulary object detection (OVD) aims to detect arbitrary objects with use...

Vintix: Scaling In-Context Reinforcement Learning for G...

Developing AI systems that learn from their surroundings during execution involv...

The Shift from Models to Compound AI Systems

AI caught everyone’s attention in 2023 with Large Language Mode...

Ghostbuster: Detecting Text Ghostwritten by Large Langu...

The structure of Ghostbuster, our new state-of-the-art metho...

Asymmetric Certified Robustness via Feature-Convex Neur...

Asymmetric Certified Robustness via Feature-Convex Neural Networks ...

TinyAgent: Function Calling at the Edge

The ability of LLMs to execute commands through plain langu...

Modeling Extremely Large Images with xT

As computer vision researchers, we believe that every pixel can...

2024 BAIR Graduate Directory

Every year, the Berkeley Artificial Intelligence Research (BAIR) La...

Linguistic Bias in ChatGPT: Language Models Reinforce D...

Sample language model responses to different varieties of En...

How to Evaluate Jailbreak Methods: A Case Study with th...

When we began studying jailbreak evaluations, we found a fascin...

Are We Ready for Multi-Image Reasoning? Launching VHs: ...

Humans excel at processing vast arrays of visual information, a...

Virtual Personas for Language Models via an Anthology o...

We introduce Anthology, a method for conditioning LLMs to r...

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies.