Good Afternoon, scholar.
Your curated library for AI research, lectures, and deep learning.
🧠 Daily Concepts
Latest Research
Fresh from ArXiv, OpenAI, and more
TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
IntermediateJun Zhang, Teng Wang et al.Dec 16arXiv
TimeLens studies how to teach AI not just what happens in a video, but exactly when it happens, which is called video temporal grounding (VTG).
#video temporal grounding#multimodal large language models#benchmark re-annotation
DreaMontage: Arbitrary Frame-Guided One-Shot Video Generation
IntermediateJiawei Liu, Junqiao Li et al.Dec 24arXiv
DreaMontage is a new AI method that makes long, single-shot videos that feel smooth and connected, even when you give it scattered images or short clips in the middle.
#arbitrary frame conditioning#one-shot video generation#Diffusion Transformer
Towards Interactive Intelligence for Digital Humans
IntermediateYiyi Cai, Xuangeng Chu et al.Dec 15arXiv
Digital humans used to just copy motions; this paper makes them think, speak, and move in sync like real people.
#interactive intelligence#digital human#multimodal avatar
MOA: Multi-Objective Alignment for Role-Playing Agents
IntermediateChonghua Liao, Ke Wang et al.Dec 10arXiv
Role-playing agents need to juggle several goals at once, like staying in character, following instructions, and using the right tone.
#multi-objective alignment#role-playing agents#reinforcement learning
University Lectures
Deep dives from top institutions

Stanford CS329H: Machine Learning from Human Preferences | Autumn 2024 | Mechanism Design
Stanfordbeginner

Stanford CS329H: Machine Learning from Human Preferences I Guest Lecture: Joseph Jay Williams
Stanfordbeginner

Stanford CS329H: ML from Human Preferences | Autumn 2024 | Model-based Preference Optimization
Stanfordbeginner