▶ ai·
ad slot opena single understated line lives here — sponsor wordmark + a short line.advertise on shipfeed →
items50 latest
▶ ai·
Learning, Fast and Slow: Towards LLMs That Adapt Continually
▶ ai·
Beyond GRPO and On-Policy Distillation: An Empirical Sparse-to-Dense Reward Principle for Language-Model Post-Training
▶ ai·
ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents
▶ ai·
OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation
▶ ai·
Reward Hacking in Rubric-Based Reinforcement Learning
▶ ai·
KV-Fold: One-Step KV-Cache Recurrence for Long-Context Inference
▶ ai·
Solve the Loop: Attractor Models for Language and Reasoning
▶ ai·
Towards Affordable Energy: A Gymnasium Environment for Electric Utility Demand-Response Programs
▶ ai·
Enabling AI-Native Mobility in 6G: A Real-World Dataset for Handover, Beam Management, and Timing Advance
▶ ai·
The Algorithmic Caricature: Auditing LLM-Generated Political Discourse Across Crisis Events
▶ ai·
A Causal Language Modeling Detour Improves Encoder Continued Pretraining
▶ ai·
CAAFC: Chronological Actionable Automated Fact-Checker for misinformation / non-factual hallucination detection and correction
▶ ai·
Formalize, Don't Optimize: The Heuristic Trap in LLM-Generated Combinatorial Solvers
▶ ai·
Stories in Space: In-Context Learning Trajectories in Conceptual Belief Space
▶ ai·
Predicting Decisions of AI Agents from Limited Interaction through Text-Tabular Modeling
▶ ai·
Semantic Reward Collapse and the Preservation of Epistemic Integrity in Adaptive AI Systems
▶ ai·
OGLS-SD: On-Policy Self-Distillation with Outcome-Guided Logit Steering for LLM Reasoning
▶ ai·
Detecting overfitting in Neural Networks during long-horizon grokking using Random Matrix Theory
▶ ai·
SEMIR: Semantic Minor-Induced Representation Learning on Graphs for Visual Segmentation
▶ ai·
Scalable Token-Level Hallucination Detection in Large Language Models
▶ ai·
Trust the Batch, On- or Off-Policy: Adaptive Policy Optimization for RL Post-Training
▶ ai·
Discrete Flow Matching for Offline-to-Online Reinforcement Learning
▶ ai·
ProfiliTable: Profiling-Driven Tabular Data Processing via Agentic Workflows
▶ ai·
Agent-Based Post-Hoc Correction of Agricultural Yield Forecasts
▶ ai·
Fill the GAP: A Granular Alignment Paradigm for Visual Reasoning in Multimodal Large Language Models
▶ ai·
Classifier Context Rot: Monitor Performance Degrades with Context Length
▶ ai·
QAP-Router: Tackling Qubit Routing as Dynamic Quadratic Assignment with Reinforcement Learning
▶ ai·
A Family of Quaternion-Valued Differential Evolution Algorithms for Numerical Function Optimization
▶ ai·
MedHopQA: A Disease-Centered Multi-Hop Reasoning Benchmark and Evaluation Framework for LLM-Based Biomedical Question Answering
▶ ai·
$δ$-mem: Efficient Online Memory for Large Language Models
▶ ai·
A New Technique for AI Explainability using Feature Association Map
▶ ai·
BSO: Safety Alignment Is Density Ratio Matching
▶ ai·
Manifold Sampling via Entropy Maximization
▶ ai·
EHR-RAGp: Retrieval-Augmented Prototype-Guided Foundation Model for Electronic Health Records
▶ ai·
Reinforcing VLAs in Task-Agnostic World Models
▶ ai·
Towards Automated Air Traffic Safety Assessment Around Non-Towered Airports Using Large Language Models
▶ ai·
LISA: Cognitive Arbitration for Signal-Free Autonomous Intersection Management
▶ ai·
Transferable Delay-Aware Reinforcement Learning via Implicit Causal Graph Modeling
▶ ai·
KAN-CL: Per-Knot Importance Regularization for Continual Learning with Kolmogorov-Arnold Networks
▶ ai·
Executable Agentic Memory for GUI Agent
▶ ai·
PriorZero: Bridging Language Priors and World Models for Decision Making
▶ ai·
TokenRatio: Principled Token-Level Preference Optimization via Ratio Matching
▶ ai·
Set-Aggregated Genome Embeddings for Microbiome Abundance Prediction
▶ ai·
Iterative Audit Convergence in LLM-Managed Multi-Agent Systems: A Case Study in Prompt Engineering Quality Assurance
▶ ai·
NARA: Anchor-Conditioned Relation-Aware Contextualization of Heterogeneous Geoentities
▶ ai·
How Useful Is Cross-Domain Generalization for Training LLM Monitors?
▶ ai·
Reconnecting Fragmented Citation Networks with Semantic Augmentation
▶ ai·
Missingness-MDPs: Bridging the Theory of Missing Data and POMDPs
▶ ai·