MarkTechPost

Tencent AI Lab Introduces Unsupervised Prefix Fine-Tuning (UPFT): An Efficient Method that Trains Models on only the First 8-32 Tokens of Single Self-Generated Solutions

Tencent AI Lab Introduces Unsupervised Prefix Fine-Tuni...

Mar 2, 2025 0

Unleashing a more efficient approach to fine-tuning reasoning in large language ...

Meet AI Co-Scientist: A Multi-Agent System Powered by Gemini 2.0 for Accelerating Scientific Discovery

Meet AI Co-Scientist: A Multi-Agent System Powered by G...

Mar 1, 2025 0

Biomedical researchers face a significant dilemma in their quest for scientific ...

This AI Paper Introduces UniTok: A Unified Visual Tokenizer for Enhancing Multimodal Generation and Understanding

This AI Paper Introduces UniTok: A Unified Visual Token...

Mar 1, 2025 0

With researchers aiming to unify visual generation and understanding into a sing...

IBM AI Releases Granite 3.2 8B Instruct and Granite 3.2 2B Instruct Models: Offering Experimental Chain-of-Thought Reasoning Capabilities

IBM AI Releases Granite 3.2 8B Instruct and Granite 3.2...

Mar 1, 2025 0

Large language models (LLMs) leverage deep learning techniques to understand and...

This AI Paper Introduces Agentic Reward Modeling (ARM) and REWARDAGENT: A Hybrid AI Approach Combining Human Preferences and Verifiable Correctness for Reliable LLM Training

This AI Paper Introduces Agentic Reward Modeling (ARM) ...

Mar 1, 2025 0

Large Language Models (LLMs) rely on reinforcement learning techniques to enhanc...

Google AI Introduces PlanGEN: A Multi-Agent AI Framework Designed to Enhance Planning and Reasoning in LLMs through Constraint-Guided Iterative Verification and Adaptive Algorithm Selection

Google AI Introduces PlanGEN: A Multi-Agent AI Framewor...

Mar 1, 2025 0

Large language models have made remarkable strides in natural language processin...

This AI Paper from USC Introduces FFTNet: An Adaptive Spectral Filtering Framework for Efficient and Scalable Sequence Modeling

This AI Paper from USC Introduces FFTNet: An Adaptive S...

Mar 1, 2025 0

Deep learning models have significantly advanced natural language processing and...

Thinking Harder, Not Longer: Evaluating Reasoning Efficiency in Advanced Language Models

Thinking Harder, Not Longer: Evaluating Reasoning Effic...

Mar 1, 2025 0

Large language models (LLMs) have progressed beyond basic natural language proce...

Revolutionizing Robot Learning: How Meta’s Aria Gen 2 enables 400% Faster Training with Egocentric AI

Revolutionizing Robot Learning: How Meta’s Aria Gen 2 e...

Feb 28, 2025 0

The evolution of robotics has long been constrained by slow and costly training ...

DeepSeek AI Releases Fire-Flyer File System (3FS): A High-Performance Distributed File System Designed to Address the Challenges of AI Training and Inference Workload

DeepSeek AI Releases Fire-Flyer File System (3FS): A Hi...

Feb 28, 2025 0

The advancement of artificial intelligence has ushered in an era where data volu...

LEAPS: A Neural Sampling Algorithm for Discrete Distributions via Continuous-Time Markov Chains (‘Discrete Diffusion’)

LEAPS: A Neural Sampling Algorithm for Discrete Distrib...

Feb 28, 2025 0

Sampling from probability distributions with known density functions (up to norm...

Beyond a Single LLM: Advancing AI Through Multi-Model Collaboration

Beyond a Single LLM: Advancing AI Through Multi-Model C...

Feb 28, 2025 0

The rapid advancement of LLMs has been driven by the belief that scaling model s...

Convergence AI Releases WebGames: A Comprehensive Benchmark Suite Designed to Evaluate General-Purpose Web-Browsing AI Agents

Convergence AI Releases WebGames: A Comprehensive Bench...

Feb 28, 2025 0

AI agents are becoming more advanced and capable of handling complex tasks acros...

Transforming Speech Generation: How the Emilia Dataset Revolutionizes Multilingual Natural Voice Synthesis

Transforming Speech Generation: How the Emilia Dataset ...

Feb 28, 2025 0

Speech generation technology has advanced considerably in recent years, yet ther...

Elevating AI Reasoning: The Art of Sampling for Learnability in LLM Training

Elevating AI Reasoning: The Art of Sampling for Learnab...

Feb 28, 2025 0

Reinforcement learning (RL) has been a core component in training large language...

Cohere AI Releases Command R7B Arabic: A Compact Open-Weights AI Model Optimized to Deliver State-of-the-Art Arabic Language Capabilities to Enterprises in the MENA Region

Cohere AI Releases Command R7B Arabic: A Compact Open-W...

Feb 28, 2025 0

For many years, organizations in the MENA region have encountered difficulties w...

Microsoft AI Releases Phi-4-multimodal and Phi-4-mini: The Newest Models in Microsoft’s Phi Family of Small Language Models (SLMs)

Microsoft AI Releases Phi-4-multimodal and Phi-4-mini: ...

Feb 27, 2025 0

In today’s rapidly evolving technological landscape, developers and organization...

Simplifying Self-Supervised Vision: How Coding Rate Regularization Transforms DINO & DINOv2

Simplifying Self-Supervised Vision: How Coding Rate Reg...

Feb 27, 2025 0

Learning useful features from large amounts of unlabeled images is important, an...

DeepSeek AI Releases DualPipe: A Bidirectional Pipeline Parallelism Algorithm for Computation-Communication Overlap in V3/R1 Training

DeepSeek AI Releases DualPipe: A Bidirectional Pipeline...

Feb 27, 2025 0

The task of training deep neural networks, especially those with billions of par...

Meta AI Introduces SWE-RL: An AI Approach to Scale Reinforcement Learning based LLM Reasoning for Real-World Software Engineering

Meta AI Introduces SWE-RL: An AI Approach to Scale Rein...

Feb 27, 2025 0

Modern software development faces a multitude of challenges that extend beyond s...

Monte Carlo Tree Diffusion: A Scalable AI Framework for Long-Horizon Planning

Monte Carlo Tree Diffusion: A Scalable AI Framework for...

Feb 27, 2025 0

Diffusion models are promising in long-horizon planning by generating complex tr...

SongGen: A Fully Open-Source Single-Stage Auto-Regressive Transformer Designed for Controllable Song Generation

SongGen: A Fully Open-Source Single-Stage Auto-Regressi...

Feb 27, 2025 0

Creating songs from text is difficult because it involves generating vocals and ...

Hume Introduces Octave TTS: A New Text-to-Speech Model that Creates Custom AI Voices with Tailored Emotions

Hume Introduces Octave TTS: A New Text-to-Speech Model ...

Feb 27, 2025 0

In the rapidly evolving field of digital communication, traditional text-to-spee...

LongPO: Enhancing Long-Context Alignment in LLMs Through Self-Optimized Short-to-Long Preference Learning

LongPO: Enhancing Long-Context Alignment in LLMs Throug...

Feb 26, 2025 0

LLMs have exhibited impressive capabilities through extensive pretraining and al...

9
10
11
12
13

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies.