Best AI papers explained

En podcast af Enoch H. Kang

Prøv Podimo gratis! i 30 dage

Et univers fyldt med hundredvis af eksklusive podcasts & lydbøger, klik her for at prøve

Annoncering

512 Episoder

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Udgivet: 11.10.2025
Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs
Udgivet: 9.10.2025
Learning dynamics of LLM finetuning
Udgivet: 9.10.2025
Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF
Udgivet: 9.10.2025
OpenAI Agent Builder and n8n: Orchestrating Reasoning Versus Automating Process
Udgivet: 8.10.2025
Training Agents Inside of Scalable World Models
Udgivet: 8.10.2025
Small Language Models are the Future of Agentic AI
Udgivet: 7.10.2025
Activation Steering in Generative Settings via Contrastive Causal Mediation Analysis
Udgivet: 6.10.2025
Eliciting Secret Knowledge from Language Models
Udgivet: 6.10.2025
Temporal difference flow
Udgivet: 6.10.2025
Personalized reasoning: just-in-time personalization and why LLMs fail at it
Udgivet: 5.10.2025
Prompt Curriculum Learning for Efficient LLM Post-Training
Udgivet: 5.10.2025
Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning
Udgivet: 4.10.2025
Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward
Udgivet: 4.10.2025
Learning to summarize user information for personalized reinforcement learning from human feedback
Udgivet: 4.10.2025
Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF
Udgivet: 3.10.2025
LIMI: Less is More for Agency
Udgivet: 1.10.2025
LoRA Without Regret
Udgivet: 1.10.2025
Actor-Critic without Actor: Critic-Guided Denoising for RL
Udgivet: 29.9.2025
DELTA-Code: How Does RL Unlock and Transfer New Programming Algorithms in LLMs?
Udgivet: 29.9.2025

3 / 26

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

Visit the podcast's native language site

512 Episoder

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs

Learning dynamics of LLM finetuning

Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF

OpenAI Agent Builder and n8n: Orchestrating Reasoning Versus Automating Process

Training Agents Inside of Scalable World Models

Small Language Models are the Future of Agentic AI

Activation Steering in Generative Settings via Contrastive Causal Mediation Analysis

Eliciting Secret Knowledge from Language Models

Temporal difference flow

Personalized reasoning: just-in-time personalization and why LLMs fail at it

Prompt Curriculum Learning for Efficient LLM Post-Training

Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning

Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward

Learning to summarize user information for personalized reinforcement learning from human feedback

Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF

LIMI: Less is More for Agency

LoRA Without Regret

Actor-Critic without Actor: Critic-Guided Denoising for RL

DELTA-Code: How Does RL Unlock and Transfer New Programming Algorithms in LLMs?