RESEARCH PAPERS
Academic papers, algorithm breakdowns, and advanced research.
research-aiMar 9, 2026
Scale Space Diffusion
By arXiv
0research-aiMar 9, 2026
FVG-PT: Adaptive Foreground View-Guided Prompt Tuning for Vision-Language Models
By arXiv
0research-aiMar 9, 2026
Agentic Critical Training
By arXiv
0research-aiMar 9, 2026
Evaluating Financial Intelligence in Large Language Models: Benchmarking SuperInvesting AI with LLM Engines
By arXiv
0research-aiMar 9, 2026
HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising
By arXiv
0research-aiMar 9, 2026
A Multi-Objective Optimization Approach for Sustainable AI-Driven Entrepreneurship in Resilient Economies
By arXiv
0research-aiMar 9, 2026
Benchmarking Language Modeling for Lossless Compression of Full-Fidelity Audio
By arXiv
0research-aiMar 9, 2026
ER-Pose: Rethinking Keypoint-Driven Representation Learning for Real-Time Human Pose Estimation
By arXiv
0research-aiMar 9, 2026
Talking Together: Synthesizing Co-Located 3D Conversations from Audio
By arXiv
0research-aiMar 9, 2026
ImprovedGS+: A High-Performance C++/CUDA Re-Implementation Strategy for 3D Gaussian Splatting
By arXiv
0research-aiMar 9, 2026
CODA: Difficulty-Aware Compute Allocation for Adaptive Reasoning
By arXiv
0research-aiMar 9, 2026
OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning
By arXiv
0research-aiMar 9, 2026
CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation
By arXiv
0research-aiMar 9, 2026
CAST: Modeling Visual State Transitions for Consistent Video Retrieval
By arXiv
0research-aiMar 9, 2026
Retrieval-Augmented Gaussian Avatars: Improving Expression Generalization
By arXiv
0research-aiMar 9, 2026
PostTrainBench: Can LLM Agents Automate LLM Post-Training?
By arXiv
0research-aiMar 9, 2026
UNBOX: Unveiling Black-box visual models with Natural-language
By arXiv
0research-aiMar 9, 2026
StreamReady: Learning What to Answer and When in Long Streaming Videos
By arXiv
0research-aiMar 9, 2026
mmGAT: Pose Estimation by Graph Attention with Mutual Features from mmWave Radar Point Cloud
By arXiv
0research-aiMar 9, 2026
PCFEx: Point Cloud Feature Extraction for Graph Neural Networks
By arXiv
0research-aiMar 9, 2026
One Model Is Enough: Native Retrieval Embeddings from LLM Agent Hidden States
By arXiv
0research-aiMar 9, 2026
SPD-RAG: Sub-Agent Per Document Retrieval-Augmented Generation
By arXiv
0research-aiMar 6, 2026
Multimodal Large Language Models as Image Classifiers
By arXiv
0research-aiMar 6, 2026
Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion
By arXiv
0research-aiMar 6, 2026
BEVLM: Distilling Semantic Knowledge from LLMs into Bird's-Eye View Representations
By arXiv
0research-aiMar 6, 2026
Fly360: Omnidirectional Obstacle Avoidance within Drone View
By arXiv
0research-aiMar 6, 2026
SCOPE: Scene-Contextualized Incremental Few-Shot 3D Segmentation
By arXiv
0research-aiMar 6, 2026
SUREON: A Benchmark and Vision-Language-Model for Surgical Reasoning
By arXiv
0research-aiMar 6, 2026
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders
By arXiv
0research-aiMar 6, 2026
Boosting deep Reinforcement Learning using pretraining with Logical Options
By arXiv
0