Posts

Implementation and usage mindsets

Implementation and usage mindsets

Feb 11, 2025 0

A one-dimensional take on the enduring static-versus-dynamic debat...

Geographic hulls

Feb 11, 2025 0

Seven lines of Python code. Can you tell what th...

Modelling data relationships with C# types

Feb 11, 2025 0

A C# example implementation of Ghosts of Departed Proofs. ...

Dependency inversion without inversion of control

Dependency inversion without inversion of control

Feb 11, 2025 0

Here, have a sandwich. For years I've been think...

Modelling data relationships with F# types

Modelling data relationships with F# types

Feb 11, 2025 0

An F# example implementation of Ghosts of Departed Proofs. ...

Recawr Sandwich

Recawr Sandwich

Feb 11, 2025 0

A pattern variation. After writing the articles ...

Adaptive Inference Budget Management in Large Language Models through Constrained Policy Optimization

Adaptive Inference Budget Management in Large Language ...

Feb 11, 2025 0

Large Language Models (LLMs) have demonstrated remarkable capabilities in comple...

Google DeepMind Introduces AlphaGeometry2: A Significant Upgrade to AlphaGeometry Surpassing the Average Gold Medalist in Solving Olympiad Geometry

Google DeepMind Introduces AlphaGeometry2: A Significan...

Feb 11, 2025 0

The International Mathematical Olympiad (IMO) is a globally recognized competiti...

Efficient Alignment of Large Language Models Using Token-Level Reward Guidance with GenARM

Efficient Alignment of Large Language Models Using Toke...

Feb 11, 2025 0

Large language models (LLMs) must align with human preferences like helpfulness ...

Tutorial to Fine-Tuning Mistral 7B with QLoRA Using Axolotl for Efficient LLM Training

Tutorial to Fine-Tuning Mistral 7B with QLoRA Using Axo...

Feb 11, 2025 0

In this tutorial, we demonstrate the workflow for fine-tuning Mistral 7B using Q...

Shanghai AI Lab Releases OREAL-7B and OREAL-32B: Advancing Mathematical Reasoning with Outcome Reward-Based Reinforcement Learning

Shanghai AI Lab Releases OREAL-7B and OREAL-32B: Advanc...

Feb 11, 2025 0

Mathematical reasoning remains a difficult area for artificial intelligence (AI)...

Advancing Scalable Text-to-Speech Synthesis: Llasa’s Transformer-Based Framework for Improved Speech Quality and Emotional Expressiveness

Advancing Scalable Text-to-Speech Synthesis: Llasa’s Tr...

Feb 11, 2025 0

Recent advancements in LLMs, such as the GPT series and emerging “o1” models, hi...

LLMDet: How Large Language Models Enhance Open-Vocabulary Object Detection

LLMDet: How Large Language Models Enhance Open-Vocabula...

Feb 11, 2025 0

Open-vocabulary object detection (OVD) aims to detect arbitrary objects with use...

Vintix: Scaling In-Context Reinforcement Learning for Generalist AI Agents

Vintix: Scaling In-Context Reinforcement Learning for G...

Feb 11, 2025 0

Developing AI systems that learn from their surroundings during execution involv...

The Shift from Models to Compound AI Systems

The Shift from Models to Compound AI Systems

Feb 11, 2025 0

AI caught everyone’s attention in 2023 with Large Language Mode...

Ghostbuster: Detecting Text Ghostwritten by Large Language Models

Ghostbuster: Detecting Text Ghostwritten by Large Langu...

Feb 11, 2025 0

The structure of Ghostbuster, our new state-of-the-art metho...

Asymmetric Certified Robustness via Feature-Convex Neural Networks

Asymmetric Certified Robustness via Feature-Convex Neur...

Feb 11, 2025 0

Asymmetric Certified Robustness via Feature-Convex Neural Networks ...

TinyAgent: Function Calling at the Edge

TinyAgent: Function Calling at the Edge

Feb 11, 2025 0

The ability of LLMs to execute commands through plain langu...

Modeling Extremely Large Images with xT

Modeling Extremely Large Images with xT

Feb 11, 2025 0

As computer vision researchers, we believe that every pixel can...

2024 BAIR Graduate Directory

2024 BAIR Graduate Directory

Feb 11, 2025 0

Every year, the Berkeley Artificial Intelligence Research (BAIR) La...

Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination

Linguistic Bias in ChatGPT: Language Models Reinforce D...

Feb 11, 2025 0

Sample language model responses to different varieties of En...

How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark

How to Evaluate Jailbreak Methods: A Case Study with th...

Feb 11, 2025 0

When we began studying jailbreak evaluations, we found a fascin...

Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark!

Are We Ready for Multi-Image Reasoning? Launching VHs: ...

Feb 11, 2025 0

Humans excel at processing vast arrays of visual information, a...

Virtual Personas for Language Models via an Anthology of Backstories

Virtual Personas for Language Models via an Anthology o...

Feb 11, 2025 0

We introduce Anthology, a method for conditioning LLMs to r...

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies.