arxiv-daily

Automated deployment @ 2024-06-13 09:04:46 Asia/Shanghai

Welcome to contribute! Add your topics and keywords in topic.yml. You can also view historical data through the storage.

Computer Vision

Object Tracking

Publish Date	Title	Authors	PDF	Code
2024-06-11	A3VLM: Actionable Articulation-Aware Vision Language Model	Siyuan Huang et.al.	2406.07549v1	link
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds	Gabriella Tarantello et.al.	2406.07518v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven"	D. T. Chung et.al.	2406.07512v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	A pilot protocol and cohort for the investigation of non-pathological variability in speech	Nicholas Cummins et.al.	2406.07497v1	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	Partially Observed Trajectory Inference using Optimal Transport and a Dynamics Prior	Anming Gu et.al.	2406.07475v1	null
2024-06-11	4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models	Heng Yu et.al.	2406.07472v1	null
2024-06-11	Exploring non-radial oscillation modes in dark matter admixed neutron stars	Pratik Thakur et.al.	2406.07470v1	null
2024-06-11	Reserve Provision from Electric Vehicles: Aggregate Boundaries and Stochastic Model Predictive Control	Jacob Thrän et.al.	2406.07454v1	null
2024-06-11	Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments	Christopher D. Hsu et.al.	2406.07431v1	null
2024-06-11	Single and merger soliton dynamics in scalar field dark matter with and without self-interactions	Matthias Stallovits et.al.	2406.07419v1	null
2024-06-11	Visual Representation Learning with Stochastic Frame Prediction	Huiwon Jang et.al.	2406.07398v1	null
2024-06-11	Operad of posets 101: The Wixarika posets	José Antonio Arciniega-Nevárez et.al.	2406.07370v1	null
2024-06-11	Fast and accurate evaluation of Biot-Savart integrals over spatial curves	Juan Ignacio Polanco et.al.	2406.07366v1	null
2024-06-11	Deep Implicit Optimization for Robust and Flexible Image Registration	Rohit Jena et.al.	2406.07361v1	null
2024-06-11	Machine Learning approaches to classical density functional theory	Alessandro Simon et.al.	2406.07345v1	null
2024-06-11	Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach	Challapalli Phanindra Revanth et.al.	2406.07332v1	null
2024-06-11	The representation and computational efficiency of the Tolman-Oppenheimer-Volkoff equations in isotropic coordinates	Dániel Barta et.al.	2406.07319v1	null
2024-06-11	A directional total variation minimization algorithm for isotropic resolution in digital breast tomosynthesis	Emil Y. Sidky et.al.	2406.07306v1	null
2024-06-11	Shadows, rings and optical appearance of a magnetically charged regular black hole illuminated by various accretion disks	Soroush Zare et.al.	2406.07300v1	null
2024-06-11	Multi-objective Reinforcement learning from AI Feedback	Marcus Williams et.al.	2406.07295v2	null
2024-06-11	Joint Learning of Context and Feedback Embeddings in Spoken Dialogue	Livia Qian et.al.	2406.07291v1	null
2024-06-11	Unsupervised Object Detection with Theoretical Guarantees	Marian Longa et.al.	2406.07284v1	null

Image Classification

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Dynamics of the non-radial energy-critical inhomogeneous NLS	Carlos M. Guzmán et.al.	2406.07535v1	null
2024-06-11	Interpolating between Hausdorff and box dimension	Amlan Banaji et.al.	2406.07527v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502v1	link
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	Trim 3D Gaussian Splatting for Accurate Geometry Representation	Lue Fan et.al.	2406.07499v1	null
2024-06-11	GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection	Hang Yao et.al.	2406.07487v1	null
2024-06-11	Novel Optimized Designs of Modulo $2n+1$ Adder for Quantum Computing	Bhaskar Gaur et.al.	2406.07486v1	null
2024-06-11	Image Neural Field Diffusion Models	Yinbo Chen et.al.	2406.07480v1	null
2024-06-11	fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions	Alireza Afzal Aghaei et.al.	2406.07456v1	link
2024-06-11	An Optimism-based Approach to Online Evaluation of Generative Models	Xiaoyan Hu et.al.	2406.07451v1	null
2024-06-11	Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning	Shuvendu Roy et.al.	2406.07450v1	link
2024-06-11	Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration	Shashank Agnihotri et.al.	2406.07435v1	null
2024-06-11	Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments	Christopher D. Hsu et.al.	2406.07431v1	null
2024-06-11	Learning Domain-Invariant Features for Out-of-Context News Detection	Yimeng Gu et.al.	2406.07430v1	null
2024-06-11	DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 38 Subclasses	Abdurrahim Yilmaz et.al.	2406.07426v1	null
2024-06-11	MINERS: Multilingual Language Models as Semantic Retrievers	Genta Indra Winata et.al.	2406.07424v1	null
2024-06-11	Holistic Memory Diversification for Incremental Learning in Growing Graphs	Ziyue Qiao et.al.	2406.07413v1	null

Multi-Object Tracking

Publish Date	Title	Authors	PDF	Code
2024-06-11	A3VLM: Actionable Articulation-Aware Vision Language Model	Siyuan Huang et.al.	2406.07549v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Autoregressive Pretraining with Mamba in Vision	Sucheng Ren et.al.	2406.07537v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds	Gabriella Tarantello et.al.	2406.07518v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven"	D. T. Chung et.al.	2406.07512v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502v1	link
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	A pilot protocol and cohort for the investigation of non-pathological variability in speech	Nicholas Cummins et.al.	2406.07497v1	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	Partially Observed Trajectory Inference using Optimal Transport and a Dynamics Prior	Anming Gu et.al.	2406.07475v1	null
2024-06-11	4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models	Heng Yu et.al.	2406.07472v1	null
2024-06-11	Exploring non-radial oscillation modes in dark matter admixed neutron stars	Pratik Thakur et.al.	2406.07470v1	null
2024-06-11	Reserve Provision from Electric Vehicles: Aggregate Boundaries and Stochastic Model Predictive Control	Jacob Thrän et.al.	2406.07454v1	null
2024-06-11	Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning	Shuvendu Roy et.al.	2406.07450v1	link
2024-06-11	Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments	Christopher D. Hsu et.al.	2406.07431v1	null
2024-06-11	Single and merger soliton dynamics in scalar field dark matter with and without self-interactions	Matthias Stallovits et.al.	2406.07419v1	null
2024-06-11	Enhanced Gene Selection in Single-Cell Genomics: Pre-Filtering Synergy and Reinforced Optimization	Weiliang Zhang et.al.	2406.07418v1	null
2024-06-11	Visual Representation Learning with Stochastic Frame Prediction	Huiwon Jang et.al.	2406.07398v1	null
2024-06-11	DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling	Sixian Wang et.al.	2406.07390v1	null
2024-06-11	Operad of posets 101: The Wixarika posets	José Antonio Arciniega-Nevárez et.al.	2406.07370v1	null

Object Detection

Publish Date	Title	Authors	PDF	Code
2024-06-11	A3VLM: Actionable Articulation-Aware Vision Language Model	Siyuan Huang et.al.	2406.07549v1	link
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds	Gabriella Tarantello et.al.	2406.07518v1	null
2024-06-11	Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System	SBND Collaboration et.al.	2406.07514v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum	N. -O. Stutzer et.al.	2406.07511v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	A pilot protocol and cohort for the investigation of non-pathological variability in speech	Nicholas Cummins et.al.	2406.07497v1	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	Neutrino magnetic dipole portal with low energy neutrino nucleus scattering data	Ying-Ying Li et.al.	2406.07477v1	null
2024-06-11	4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models	Heng Yu et.al.	2406.07472v1	null
2024-06-11	Exploring non-radial oscillation modes in dark matter admixed neutron stars	Pratik Thakur et.al.	2406.07470v1	null
2024-06-11	Reserve Provision from Electric Vehicles: Aggregate Boundaries and Stochastic Model Predictive Control	Jacob Thrän et.al.	2406.07454v1	null
2024-06-11	Search for photons above 10$^{18}$ eV by simultaneously measuring the atmospheric depth and the muon content of air showers at the Pierre Auger Observatory	The Pierre Auger Collaboration et.al.	2406.07439v1	null
2024-06-11	Single and merger soliton dynamics in scalar field dark matter with and without self-interactions	Matthias Stallovits et.al.	2406.07419v1	null
2024-06-11	Visual Representation Learning with Stochastic Frame Prediction	Huiwon Jang et.al.	2406.07398v1	null
2024-06-11	Operad of posets 101: The Wixarika posets	José Antonio Arciniega-Nevárez et.al.	2406.07370v1	null
2024-06-11	Fast and accurate evaluation of Biot-Savart integrals over spatial curves	Juan Ignacio Polanco et.al.	2406.07366v1	null
2024-06-11	Deep Implicit Optimization for Robust and Flexible Image Registration	Rohit Jena et.al.	2406.07361v1	null
2024-06-11	Machine Learning approaches to classical density functional theory	Alessandro Simon et.al.	2406.07345v1	null
2024-06-11	Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach	Challapalli Phanindra Revanth et.al.	2406.07332v1	null
2024-06-11	The representation and computational efficiency of the Tolman-Oppenheimer-Volkoff equations in isotropic coordinates	Dániel Barta et.al.	2406.07319v1	null
2024-06-11	A directional total variation minimization algorithm for isotropic resolution in digital breast tomosynthesis	Emil Y. Sidky et.al.	2406.07306v1	null
2024-06-11	Shadows, rings and optical appearance of a magnetically charged regular black hole illuminated by various accretion disks	Soroush Zare et.al.	2406.07300v1	null
2024-06-11	Multi-objective Reinforcement learning from AI Feedback	Marcus Williams et.al.	2406.07295v2	null
2024-06-11	Joint Learning of Context and Feedback Embeddings in Spoken Dialogue	Livia Qian et.al.	2406.07291v1	null
2024-06-11	Unsupervised Object Detection with Theoretical Guarantees	Marian Longa et.al.	2406.07284v1	null

Image Matching

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Interpolating between Hausdorff and box dimension	Amlan Banaji et.al.	2406.07527v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven"	D. T. Chung et.al.	2406.07512v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502v1	link
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	Trim 3D Gaussian Splatting for Accurate Geometry Representation	Lue Fan et.al.	2406.07499v1	null
2024-06-11	GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection	Hang Yao et.al.	2406.07487v1	null
2024-06-11	Novel Optimized Designs of Modulo $2n+1$ Adder for Quantum Computing	Bhaskar Gaur et.al.	2406.07486v1	null
2024-06-11	Advancing Annotation of Stance in Social Media Posts: A Comparative Analysis of Large Language Models and Crowd Sourcing	Mao Li et.al.	2406.07483v1	null
2024-06-11	Image Neural Field Diffusion Models	Yinbo Chen et.al.	2406.07480v1	null
2024-06-11	Resummation of Multi-Stress Tensors in Higher Dimensions	Kuo-Wei Huang et.al.	2406.07458v1	null
2024-06-11	fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions	Alireza Afzal Aghaei et.al.	2406.07456v1	link
2024-06-11	An Optimism-based Approach to Online Evaluation of Generative Models	Xiaoyan Hu et.al.	2406.07451v1	null
2024-06-11	Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning	Shuvendu Roy et.al.	2406.07450v1	link
2024-06-11	Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration	Shashank Agnihotri et.al.	2406.07435v1	null
2024-06-11	Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments	Christopher D. Hsu et.al.	2406.07431v1	null
2024-06-11	Learning Domain-Invariant Features for Out-of-Context News Detection	Yimeng Gu et.al.	2406.07430v1	null
2024-06-11	DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 38 Subclasses	Abdurrahim Yilmaz et.al.	2406.07426v1	null
2024-06-11	Optimal Marital Strategies: How Couples Develop Successful Interaction Styles	Micah Henson et.al.	2406.07403v1	null

Semantic Segmentation

Publish Date	Title	Authors	PDF	Code
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	QuickLLaMA: Query-aware Inference Acceleration for Large Language Models	Jingyao Li et.al.	2406.07528v1	link
2024-06-11	CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization	Frederic Kirstein et.al.	2406.07494v2	null
2024-06-11	Textual Similarity as a Key Metric in Machine Translation Quality Estimation	Kun Sun et.al.	2406.07440v1	null
2024-06-11	MINERS: Multilingual Language Models as Semantic Retrievers	Genta Indra Winata et.al.	2406.07424v1	null
2024-06-11	Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech	Yin-Long Liu et.al.	2406.07410v1	null
2024-06-11	Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance	Ruxin Zheng et.al.	2406.07399v1	null
2024-06-11	Visual Representation Learning with Stochastic Frame Prediction	Huiwon Jang et.al.	2406.07398v1	null
2024-06-11	Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach	Challapalli Phanindra Revanth et.al.	2406.07332v1	null
2024-06-11	A Banach space whose set of norm-attaining functionals is algebraically trivial	Miguel Martin et.al.	2406.07273v1	null
2024-06-11	Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation	Jinyuan Li et.al.	2406.07268v1	null
2024-06-11	Semantic-Aware Spectrum Sharing in Internet of Vehicles Based on Deep Reinforcement Learning	Zhiyu Shao et.al.	2406.07213v1	link
2024-06-11	Can Foundation Models Reliably Identify Spatial Hazards? A Case Study on Curb Segmentation	Diwei Sheng et.al.	2406.07202v1	null
2024-06-11	Target Speech Diarization with Multimodal Prompts	Yidi Jiang et.al.	2406.07198v1	null
2024-06-11	RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker	Yunfeng Li et.al.	2406.07189v1	link
2024-06-11	TernaryLLM: Ternarized Large Language Model	Tianqi Chen et.al.	2406.07177v1	null
2024-06-11	ULog: Unsupervised Log Parsing with Large Language Models through Log Contrastive Units	Junjie Huang et.al.	2406.07174v1	null
2024-06-11	FaceGPT: Self-supervised Learning to Chat about 3D Human Faces	Haoran Wang et.al.	2406.07163v1	null
2024-06-11	EEG classification for visual brain decoding with spatio-temporal and transformer based paradigms	Akanksha Sharma et.al.	2406.07153v1	null
2024-06-11	Translating speech with just images	Dan Oneata et.al.	2406.07133v1	null
2024-06-11	Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph	Sergey Linok et.al.	2406.07113v1	null
2024-06-11	AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding	Xing Zhang et.al.	2406.07091v1	null
2024-06-11	CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation	Zhongzhen Huang et.al.	2406.07085v1	null
2024-06-11	1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation	Mingqi Gao et.al.	2406.07043v1	link
2024-06-11	EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network	Yining Shi et.al.	2406.07042v1	link
2024-06-11	PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving	Yining Shi et.al.	2406.07037v1	null
2024-06-11	RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks	Zhechao Wang et.al.	2406.07032v1	null
2024-06-11	LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection	Jiahua Xu et.al.	2406.07023v2	null
2024-06-11	Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models	Sooyeon Go et.al.	2406.07008v1	null

Instance Segmentation

Publish Date	Title	Authors	PDF	Code
2024-06-11	Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis	Qining Zhang et.al.	2406.07455v1	null
2024-06-11	On the Robustness of Document-Level Relation Extraction Models to Entity Name Variations	Shiao Meng et.al.	2406.07444v1	null
2024-06-11	Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech	Yin-Long Liu et.al.	2406.07410v1	null
2024-06-11	Visual Representation Learning with Stochastic Frame Prediction	Huiwon Jang et.al.	2406.07398v1	null
2024-06-11	PSMC: Provable and Scalable Algorithms for Motif Conductance Based Graph Clustering	Longlong Lin et.al.	2406.07357v1	null
2024-06-11	The Theory of Intrinsic Time: A Primer	James B. Glattfelder et.al.	2406.07354v1	null
2024-06-11	Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach	Challapalli Phanindra Revanth et.al.	2406.07332v1	null
2024-06-11	A Banach space whose set of norm-attaining functionals is algebraically trivial	Miguel Martin et.al.	2406.07273v1	null
2024-06-11	Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation	Jinyuan Li et.al.	2406.07268v1	null
2024-06-11	Can Foundation Models Reliably Identify Spatial Hazards? A Case Study on Curb Segmentation	Diwei Sheng et.al.	2406.07202v1	null
2024-06-11	Quantum repeaters based on stationary Gottesman-Kitaev-Preskill qubits	Stefan Häussler et.al.	2406.07158v1	null
2024-06-11	Scaling Large-Language-Model-based Multi-Agent Collaboration	Chen Qian et.al.	2406.07155v1	link
2024-06-11	CHARME: A chain-based reinforcement learning approach for the minor embedding problem	Hoang M. Ngo et.al.	2406.07124v1	null
2024-06-11	The Treatment of Ties in Rank-Biased Overlap	Matteo Corsi et.al.	2406.07121v1	null
2024-06-11	Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph	Sergey Linok et.al.	2406.07113v1	null
2024-06-11	Large amplitude quasi-periodic traveling waves in two dimensional forced rotating fluids	Roberta Bianchini et.al.	2406.07099v1	null
2024-06-11	Edge Rendering Architecture for multiuser XR Experiences and E2E Performance Assessment	Inhar Yeregui et.al.	2406.07087v1	null
2024-06-11	CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation	Zhongzhen Huang et.al.	2406.07085v1	null
2024-06-11	Triage of 3D pathology data via 2.5D multiple-instance learning to guide pathologist assessments	Gan Gao et.al.	2406.07061v1	link
2024-06-11	Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study	Yichi Zhang et.al.	2406.07057v1	null
2024-06-11	1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation	Mingqi Gao et.al.	2406.07043v1	link
2024-06-11	PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving	Yining Shi et.al.	2406.07037v1	null
2024-06-11	RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks	Zhechao Wang et.al.	2406.07032v1	null
2024-06-11	LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection	Jiahua Xu et.al.	2406.07023v2	null
2024-06-11	Dual Thinking and Perceptual Analysis of Deep Learning Models using Human Adversarial Examples	Kailas Dayanandan et.al.	2406.06967v1	link
2024-06-11	Distributional MIPLIB: a Multi-Domain Library for Advancing ML-Guided MILP Methods	Weimin Huang et.al.	2406.06954v1	null
2024-06-11	Sparse Bayesian Networks: Efficient Uncertainty Quantification in Medical Image Analysis	Zeinab Abboud et.al.	2406.06946v1	null
2024-06-11	UVIS: Unsupervised Video Instance Segmentation	Shuaiyi Huang et.al.	2406.06908v1	null
2024-06-11	Person Transfer in the Field: Examining Real World Sequential Human-Robot Interaction Between Two Robots	Xiang Zhi Tan et.al.	2406.06904v1	null
2024-06-11	Universal spatial inflation of human mobility	Lu Zhong et.al.	2406.06889v1	null

Keypoint Detection

Publish Date	Title	Authors	PDF	Code
2024-06-11	Differentiability and Optimization of Multiparameter Persistent Homology	Luis Scoccola et.al.	2406.07224v1	null
2024-06-10	Relative descriptors for quantum agents	David Möckli et.al.	2406.06719v1	null
2024-06-08	Unsupervised learning of Data-driven Facial Expression Coding System (DFECS) using keypoint tracking	Shivansh Chandra Tripathi et.al.	2406.05434v1	null
2024-06-07	Expected Lipschitz-Killing curvatures for spin random fields and other non-isotropic fields	Francesca Pistolato et.al.	2406.04850v1	null
2024-06-07	LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model	Dongkai Wang et.al.	2406.04659v1	link
2024-06-06	Monocular Localization with Semantics Map for Autonomous Vehicles	Jixiang Wan et.al.	2406.03835v1	null
2024-06-05	Image Copy-Move Forgery Detection and Localization Scheme: How to Avoid Missed Detection and False Alarm	Li Jiang et.al.	2406.03271v1	null
2024-06-05	Topological Neural Networks go Persistent, Equivariant, and Continuous	Yogesh Verma et.al.	2406.03164v1	null
2024-06-05	How precisely are solute clusters in RPV steels characterized by atom probe experiments?	N. Castin et.al.	2406.02973v1	null
2024-06-05	Homotopic Path Set Planning for Robot Manipulation and Navigation	Jing Huang et.al.	2406.02885v1	link
2024-06-05	Controllable Talking Face Generation by Implicit Facial Keypoints Editing	Dong Zhao et.al.	2406.02880v1	null
2024-06-04	Machine learning Hubbard parameters with equivariant neural networks	Martin Uhrin et.al.	2406.02457v1	link
2024-06-03	Scale-Free Image Keypoints Using Differentiable Persistent Homology	Giovanni Barbarani et.al.	2406.01315v1	link
2024-06-03	MoFormer: Multi-objective Antimicrobial Peptide Generation Based on Conditional Transformer Joint Multi-modal Fusion Descriptor	Li Wang et.al.	2406.02610v1	null
2024-06-02	W-Net: A Facial Feature-Guided Face Super-Resolution Network	Hao Liu et.al.	2406.00676v1	null
2024-06-02	SAM-LAD: Segment Anything Model Meets Zero-Shot Logic Anomaly Detection	Yun Peng et.al.	2406.00625v2	null
2024-06-01	CapeX: Category-Agnostic Pose Estimation from Textual Point Explanation	Matan Rusanovsky et.al.	2406.00384v1	link
2024-05-31	Learning from metastable grain boundaries	Avanish Mishra et.al.	2406.00204v1	null
2024-05-30	Estimating Human Poses Across Datasets: A Unified Skeleton and Multi-Teacher Distillation Approach	Muhammad Saif Ullah Khan et.al.	2405.20084v1	null
2024-05-30	KITRO: Refining Human Mesh by 2D Clues and Kinematic-tree Rotation	Fengyuan Yang et.al.	2405.19833v1	link
2024-05-30	Automatic Dance Video Segmentation for Understanding Choreography	Koki Endo et.al.	2405.19727v1	null
2024-05-30	SMPLX-Lite: A Realistic and Drivable Avatar Benchmark with Rich Geometry and Texture Annotations	Yujiao Jiang et.al.	2405.19609v1	null
2024-05-29	SDPRLayers: Certifiable Backpropagation Through Polynomial Optimization Problems in Robotics	Connor Holmes et.al.	2405.19309v1	null
2024-05-29	Greedy Kernel Methods for Approximating Breakthrough Curves for Reactive Flow from 3D Porous Geometry Data	Robin Herkert et.al.	2405.19170v1	null
2024-05-29	PointNetPGAP-SLC: A 3D LiDAR-based Place Recognition Approach with Segment-level Consistency Training for Mobile Robots in Horticulture	T. Barros et.al.	2405.19038v1	link
2024-05-29	Classification analysis of transition-metal chalcogenides and oxides using quantum machine learning	Kurudi V Vedavyasa et.al.	2405.18989v1	null
2024-05-29	Diffeomorphic interpolation for efficient persistence-based topological optimization	Mathieu Carriere et.al.	2405.18820v1	null
2024-05-28	Temperature-Dependent Chirality in Halide Perovskites	Mike Pols et.al.	2405.18643v1	null
2024-05-28	What can machine learning help with microstructure-informed materials modeling and design?	Xiang-Long Peng et.al.	2405.18396v1	null
2024-05-28	Relational Self-supervised Distillation with Compact Descriptors for Image Copy Detection	Juntae Kim et.al.	2405.17928v3	null

3D Vision

Point Cloud Matching

Publish Date	Title	Authors	PDF	Code
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Dynamics of the non-radial energy-critical inhomogeneous NLS	Carlos M. Guzmán et.al.	2406.07535v1	null
2024-06-11	Interpolating between Hausdorff and box dimension	Amlan Banaji et.al.	2406.07527v1	null
2024-06-11	On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds	Gabriella Tarantello et.al.	2406.07518v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven"	D. T. Chung et.al.	2406.07512v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results I. Improved data selection and processing	J. G. S. Lunde et.al.	2406.07510v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null
2024-06-11	Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices	Atli Sigurgeirsson et.al.	2406.07504v1	null
2024-06-11	ReduceFormer: Attention with Tensor Reduction by Summation	John Yang et.al.	2406.07488v1	null
2024-06-11	Advancing Annotation of Stance in Social Media Posts: A Comparative Analysis of Large Language Models and Crowd Sourcing	Mao Li et.al.	2406.07483v1	null
2024-06-11	Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery	Biplov Bhandari et.al.	2406.07482v1	null
2024-06-11	Resummation of Multi-Stress Tensors in Higher Dimensions	Kuo-Wei Huang et.al.	2406.07458v1	null
2024-06-11	Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments	Christopher D. Hsu et.al.	2406.07431v1	null
2024-06-11	Support for fragile porous dust in a gravitationally self-regulated disk around IM Lup	Takahiro Ueda et.al.	2406.07427v1	null
2024-06-11	Adic curves: stable reduction, skeletons and metric structure	Katharina Hübner et.al.	2406.07414v1	null
2024-06-11	Private Geometric Median	Mahdi Haghifam et.al.	2406.07407v1	null
2024-06-11	Optimal Marital Strategies: How Couples Develop Successful Interaction Styles	Micah Henson et.al.	2406.07403v1	null
2024-06-11	Disrupting Bipartite Trading Networks: Matching for Revenue Maximization	Luca D'Amico-Wong et.al.	2406.07385v1	null
2024-06-11	Closing the Computational-Query Depth Gap in Parallel Stochastic Convex Optimization	Arun Jambulapati et.al.	2406.07373v1	null
2024-06-11	Analytical Delta-V Approximation for Nonlinear Programming of Multi-target Rendezvous and Flyby Trajectories	An-Yi Huang et.al.	2406.07341v1	null
2024-06-11	Searching for gravitational waves from stellar-mass binary black holes early inspiral	Xue-Ting Zhang et.al.	2406.07336v1	null
2024-06-11	Maximum number of points on an intersection of a cubic threefold and a non-degenerate Hermitian threefold	Mrinmoy Datta et.al.	2406.07326v1	null
2024-06-11	Lyapunov equations: a (fixed) point of view	Richard Pates et.al.	2406.07324v1	null
2024-06-11	Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs	Kamil Jeziorek et.al.	2406.07318v1	null
2024-06-11	Morse Index Stability for the Ginzburg-Landau Approximation	Francesca Da Lio et.al.	2406.07317v1	null
2024-06-11	Sum the Probabilities to $m$ and Stop	Zakaria Derbazi et.al.	2406.07283v1	null
2024-06-11	Efficient 3D Molecular Generation with Flow Matching and Scale Optimal Transport	Ross Irwin et.al.	2406.07266v1	null
2024-06-11	Coupled-channel $J^{--}$ meson resonances from lattice QCD	Jozef J. Dudek et.al.	2406.07261v1	null
2024-06-11	Hybrid Reinforcement Learning from Offline Observation Alone	Yuda Song et.al.	2406.07253v1	null

3D Object Tracking

Publish Date	Title	Authors	PDF	Code
2024-06-11	A3VLM: Actionable Articulation-Aware Vision Language Model	Siyuan Huang et.al.	2406.07549v1	link
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds	Gabriella Tarantello et.al.	2406.07518v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System	SBND Collaboration et.al.	2406.07514v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven"	D. T. Chung et.al.	2406.07512v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum	N. -O. Stutzer et.al.	2406.07511v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	Trim 3D Gaussian Splatting for Accurate Geometry Representation	Lue Fan et.al.	2406.07499v1	null
2024-06-11	A pilot protocol and cohort for the investigation of non-pathological variability in speech	Nicholas Cummins et.al.	2406.07497v1	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	ReduceFormer: Attention with Tensor Reduction by Summation	John Yang et.al.	2406.07488v1	null
2024-06-11	Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery	Biplov Bhandari et.al.	2406.07482v1	null
2024-06-11	Partially Observed Trajectory Inference using Optimal Transport and a Dynamics Prior	Anming Gu et.al.	2406.07475v1	null
2024-06-11	4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models	Heng Yu et.al.	2406.07472v1	null
2024-06-11	Exploring non-radial oscillation modes in dark matter admixed neutron stars	Pratik Thakur et.al.	2406.07470v1	null
2024-06-11	Existence and asymptotic autonomous robustness of random attractors for three-dimensional stochastic globally modified Navier-Stokes equations on unbounded domains	Kush Kinra et.al.	2406.07460v1	null
2024-06-11	Reserve Provision from Electric Vehicles: Aggregate Boundaries and Stochastic Model Predictive Control	Jacob Thrän et.al.	2406.07454v1	null
2024-06-11	Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments	Christopher D. Hsu et.al.	2406.07431v1	null
2024-06-11	Single and merger soliton dynamics in scalar field dark matter with and without self-interactions	Matthias Stallovits et.al.	2406.07419v1	null
2024-06-11	Visual Representation Learning with Stochastic Frame Prediction	Huiwon Jang et.al.	2406.07398v1	null
2024-06-11	Operad of posets 101: The Wixarika posets	José Antonio Arciniega-Nevárez et.al.	2406.07370v1	null

3D Object Detection

Publish Date	Title	Authors	PDF	Code
2024-06-11	A3VLM: Actionable Articulation-Aware Vision Language Model	Siyuan Huang et.al.	2406.07549v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds	Gabriella Tarantello et.al.	2406.07518v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System	SBND Collaboration et.al.	2406.07514v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven"	D. T. Chung et.al.	2406.07512v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum	N. -O. Stutzer et.al.	2406.07511v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid	Naser Souri et.al.	2406.07503v1	null
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	Trim 3D Gaussian Splatting for Accurate Geometry Representation	Lue Fan et.al.	2406.07499v1	null
2024-06-11	A pilot protocol and cohort for the investigation of non-pathological variability in speech	Nicholas Cummins et.al.	2406.07497v1	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	Prospects for the detection of Dark Matter with Long-lived Mediators in the Sun using the Southern Wide-field Gamma-ray Observatory	Micael Andrade et.al.	2406.07489v1	null
2024-06-11	ReduceFormer: Attention with Tensor Reduction by Summation	John Yang et.al.	2406.07488v1	null
2024-06-11	GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection	Hang Yao et.al.	2406.07487v1	null
2024-06-11	Advancing Annotation of Stance in Social Media Posts: A Comparative Analysis of Large Language Models and Crowd Sourcing	Mao Li et.al.	2406.07483v1	null
2024-06-11	Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery	Biplov Bhandari et.al.	2406.07482v1	null
2024-06-11	Neutrino magnetic dipole portal with low energy neutrino nucleus scattering data	Ying-Ying Li et.al.	2406.07477v1	null
2024-06-11	4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models	Heng Yu et.al.	2406.07472v1	null
2024-06-11	Exploring non-radial oscillation modes in dark matter admixed neutron stars	Pratik Thakur et.al.	2406.07470v1	null
2024-06-11	Anomaly Detection on Unstable Logs with GPT Models	Fatemeh Hadadi et.al.	2406.07467v1	null
2024-06-11	Existence and asymptotic autonomous robustness of random attractors for three-dimensional stochastic globally modified Navier-Stokes equations on unbounded domains	Kush Kinra et.al.	2406.07460v1	null
2024-06-11	Reserve Provision from Electric Vehicles: Aggregate Boundaries and Stochastic Model Predictive Control	Jacob Thrän et.al.	2406.07454v1	null

Point Cloud Segmentation

Publish Date	Title	Authors	PDF	Code
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Dynamics of the non-radial energy-critical inhomogeneous NLS	Carlos M. Guzmán et.al.	2406.07535v1	null
2024-06-11	Interpolating between Hausdorff and box dimension	Amlan Banaji et.al.	2406.07527v1	null
2024-06-11	On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds	Gabriella Tarantello et.al.	2406.07518v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results I. Improved data selection and processing	J. G. S. Lunde et.al.	2406.07510v1	null
2024-06-11	Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices	Atli Sigurgeirsson et.al.	2406.07504v1	null
2024-06-11	ReduceFormer: Attention with Tensor Reduction by Summation	John Yang et.al.	2406.07488v1	null
2024-06-11	Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery	Biplov Bhandari et.al.	2406.07482v1	null
2024-06-11	Resummation of Multi-Stress Tensors in Higher Dimensions	Kuo-Wei Huang et.al.	2406.07458v1	null
2024-06-11	Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments	Christopher D. Hsu et.al.	2406.07431v1	null
2024-06-11	Support for fragile porous dust in a gravitationally self-regulated disk around IM Lup	Takahiro Ueda et.al.	2406.07427v1	null
2024-06-11	Adic curves: stable reduction, skeletons and metric structure	Katharina Hübner et.al.	2406.07414v1	null
2024-06-11	Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech	Yin-Long Liu et.al.	2406.07410v1	null
2024-06-11	Private Geometric Median	Mahdi Haghifam et.al.	2406.07407v1	null
2024-06-11	Visual Representation Learning with Stochastic Frame Prediction	Huiwon Jang et.al.	2406.07398v1	null
2024-06-11	Analytical Delta-V Approximation for Nonlinear Programming of Multi-target Rendezvous and Flyby Trajectories	An-Yi Huang et.al.	2406.07341v1	null
2024-06-11	Searching for gravitational waves from stellar-mass binary black holes early inspiral	Xue-Ting Zhang et.al.	2406.07336v1	null
2024-06-11	Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach	Challapalli Phanindra Revanth et.al.	2406.07332v1	null
2024-06-11	Maximum number of points on an intersection of a cubic threefold and a non-degenerate Hermitian threefold	Mrinmoy Datta et.al.	2406.07326v1	null
2024-06-11	Lyapunov equations: a (fixed) point of view	Richard Pates et.al.	2406.07324v1	null
2024-06-11	Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs	Kamil Jeziorek et.al.	2406.07318v1	null
2024-06-11	Morse Index Stability for the Ginzburg-Landau Approximation	Francesca Da Lio et.al.	2406.07317v1	null
2024-06-11	Sum the Probabilities to $m$ and Stop	Zakaria Derbazi et.al.	2406.07283v1	null
2024-06-11	A Banach space whose set of norm-attaining functionals is algebraically trivial	Miguel Martin et.al.	2406.07273v1	null
2024-06-11	Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation	Jinyuan Li et.al.	2406.07268v1	null
2024-06-11	Coupled-channel $J^{--}$ meson resonances from lattice QCD	Jozef J. Dudek et.al.	2406.07261v1	null
2024-06-11	Even dimensional Fermat cubics are rational over any field	Alex Massarenti et.al.	2406.07223v1	null
2024-06-11	Can Foundation Models Reliably Identify Spatial Hazards? A Case Study on Curb Segmentation	Diwei Sheng et.al.	2406.07202v1	null
2024-06-11	A Multi-step Approach for Minimizing Risk in Decentralized Exchanges	Daniele Maria Di Nosse et.al.	2406.07200v2	null
2024-06-11	TernaryLLM: Ternarized Large Language Model	Tianqi Chen et.al.	2406.07177v1	null

Point Cloud Registration

Publish Date	Title	Authors	PDF	Code
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Dynamics of the non-radial energy-critical inhomogeneous NLS	Carlos M. Guzmán et.al.	2406.07535v1	null
2024-06-11	Interpolating between Hausdorff and box dimension	Amlan Banaji et.al.	2406.07527v1	null
2024-06-11	On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds	Gabriella Tarantello et.al.	2406.07518v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results I. Improved data selection and processing	J. G. S. Lunde et.al.	2406.07510v1	null
2024-06-11	Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices	Atli Sigurgeirsson et.al.	2406.07504v1	null
2024-06-11	ReduceFormer: Attention with Tensor Reduction by Summation	John Yang et.al.	2406.07488v1	null
2024-06-11	Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery	Biplov Bhandari et.al.	2406.07482v1	null
2024-06-11	Resummation of Multi-Stress Tensors in Higher Dimensions	Kuo-Wei Huang et.al.	2406.07458v1	null
2024-06-11	Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments	Christopher D. Hsu et.al.	2406.07431v1	null
2024-06-11	Support for fragile porous dust in a gravitationally self-regulated disk around IM Lup	Takahiro Ueda et.al.	2406.07427v1	null
2024-06-11	Adic curves: stable reduction, skeletons and metric structure	Katharina Hübner et.al.	2406.07414v1	null
2024-06-11	Private Geometric Median	Mahdi Haghifam et.al.	2406.07407v1	null
2024-06-11	Deep Implicit Optimization for Robust and Flexible Image Registration	Rohit Jena et.al.	2406.07361v1	null
2024-06-11	Analytical Delta-V Approximation for Nonlinear Programming of Multi-target Rendezvous and Flyby Trajectories	An-Yi Huang et.al.	2406.07341v1	null
2024-06-11	Searching for gravitational waves from stellar-mass binary black holes early inspiral	Xue-Ting Zhang et.al.	2406.07336v1	null
2024-06-11	Maximum number of points on an intersection of a cubic threefold and a non-degenerate Hermitian threefold	Mrinmoy Datta et.al.	2406.07326v1	null
2024-06-11	Lyapunov equations: a (fixed) point of view	Richard Pates et.al.	2406.07324v1	null
2024-06-11	Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs	Kamil Jeziorek et.al.	2406.07318v1	null
2024-06-11	Morse Index Stability for the Ginzburg-Landau Approximation	Francesca Da Lio et.al.	2406.07317v1	null
2024-06-11	Sum the Probabilities to $m$ and Stop	Zakaria Derbazi et.al.	2406.07283v1	null
2024-06-11	Coupled-channel $J^{--}$ meson resonances from lattice QCD	Jozef J. Dudek et.al.	2406.07261v1	null
2024-06-11	Even dimensional Fermat cubics are rational over any field	Alex Massarenti et.al.	2406.07223v1	null
2024-06-11	A Multi-step Approach for Minimizing Risk in Decentralized Exchanges	Daniele Maria Di Nosse et.al.	2406.07200v2	null
2024-06-11	TernaryLLM: Ternarized Large Language Model	Tianqi Chen et.al.	2406.07177v1	null
2024-06-11	Ultrametric-preserving functions as monoid endomorphisms	Oleksiy Dovgoshey et.al.	2406.07166v2	null
2024-06-11	ACCO: Automated Causal CNN Scheduling Optimizer for Real-Time Edge Accelerators	Jun Yin et.al.	2406.07161v1	null
2024-06-11	Deep Learning-Based Approach for User Activity Detection with Grant-Free Random Access in Cell-Free Massive MIMO	Ali Elkeshawy et.al.	2406.07160v1	null
2024-06-11	A portrait of the rotation of Ultra-Cool Dwarfs revealed by TESS	D. O. Fontinele et.al.	2406.07154v1	null
2024-06-11	High-performance in-vacuum optical system for quantum optics experiments in a Penning-trap	Joaquín Berrocal et.al.	2406.07152v1	null

Point Cloud Completion

Publish Date	Title	Authors	PDF	Code
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Dynamics of the non-radial energy-critical inhomogeneous NLS	Carlos M. Guzmán et.al.	2406.07535v1	null
2024-06-11	Interpolating between Hausdorff and box dimension	Amlan Banaji et.al.	2406.07527v1	null
2024-06-11	On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds	Gabriella Tarantello et.al.	2406.07518v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results I. Improved data selection and processing	J. G. S. Lunde et.al.	2406.07510v1	null
2024-06-11	Uniqueness on average of large isoperimetric sets in noncompact manifolds with nonnegative Ricci curvature	Gioacchino Antonelli et.al.	2406.07509v1	null
2024-06-11	Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices	Atli Sigurgeirsson et.al.	2406.07504v1	null
2024-06-11	ReduceFormer: Attention with Tensor Reduction by Summation	John Yang et.al.	2406.07488v1	null
2024-06-11	Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery	Biplov Bhandari et.al.	2406.07482v1	null
2024-06-11	Resummation of Multi-Stress Tensors in Higher Dimensions	Kuo-Wei Huang et.al.	2406.07458v1	null
2024-06-11	McEval: Massively Multilingual Code Evaluation	Linzheng Chai et.al.	2406.07436v1	null
2024-06-11	Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments	Christopher D. Hsu et.al.	2406.07431v1	null
2024-06-11	Support for fragile porous dust in a gravitationally self-regulated disk around IM Lup	Takahiro Ueda et.al.	2406.07427v1	null
2024-06-11	Adic curves: stable reduction, skeletons and metric structure	Katharina Hübner et.al.	2406.07414v1	null
2024-06-11	VersiCode: Towards Version-controllable Code Generation	Tongtong Wu et.al.	2406.07411v1	null
2024-06-11	Private Geometric Median	Mahdi Haghifam et.al.	2406.07407v1	null
2024-06-11	Limited Out-of-Context Knowledge Reasoning in Large Language Models	Peng Hu et.al.	2406.07393v1	null
2024-06-11	A mechanical qubit	Yu Yang et.al.	2406.07360v1	null
2024-06-11	Finite $W$-algebra invariants via Lax type operators	Jonathan S. Brown et.al.	2406.07350v1	null
2024-06-11	Analytical Delta-V Approximation for Nonlinear Programming of Multi-target Rendezvous and Flyby Trajectories	An-Yi Huang et.al.	2406.07341v1	null
2024-06-11	Searching for gravitational waves from stellar-mass binary black holes early inspiral	Xue-Ting Zhang et.al.	2406.07336v1	null
2024-06-11	Maximum number of points on an intersection of a cubic threefold and a non-degenerate Hermitian threefold	Mrinmoy Datta et.al.	2406.07326v1	null
2024-06-11	Lyapunov equations: a (fixed) point of view	Richard Pates et.al.	2406.07324v1	null
2024-06-11	Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs	Kamil Jeziorek et.al.	2406.07318v1	null
2024-06-11	Morse Index Stability for the Ginzburg-Landau Approximation	Francesca Da Lio et.al.	2406.07317v1	null
2024-06-11	Sum the Probabilities to $m$ and Stop	Zakaria Derbazi et.al.	2406.07283v1	null
2024-06-11	Coupled-channel $J^{--}$ meson resonances from lattice QCD	Jozef J. Dudek et.al.	2406.07261v1	null
2024-06-11	Hybrid Reinforcement Learning from Offline Observation Alone	Yuda Song et.al.	2406.07253v1	null
2024-06-11	Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring	Tomoya Nishida et.al.	2406.07250v1	null
2024-06-11	Variational inequalities and smooth-fit principle for singular stochastic control problems in Hilbert spaces	Salvatore Federico et.al.	2406.07242v1	null

Point Cloud

Publish Date	Title	Authors	PDF	Code
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Dynamics of the non-radial energy-critical inhomogeneous NLS	Carlos M. Guzmán et.al.	2406.07535v1	null
2024-06-11	Interpolating between Hausdorff and box dimension	Amlan Banaji et.al.	2406.07527v1	null
2024-06-11	On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds	Gabriella Tarantello et.al.	2406.07518v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results I. Improved data selection and processing	J. G. S. Lunde et.al.	2406.07510v1	null
2024-06-11	Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices	Atli Sigurgeirsson et.al.	2406.07504v1	null
2024-06-11	ReduceFormer: Attention with Tensor Reduction by Summation	John Yang et.al.	2406.07488v1	null
2024-06-11	Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery	Biplov Bhandari et.al.	2406.07482v1	null
2024-06-11	Resummation of Multi-Stress Tensors in Higher Dimensions	Kuo-Wei Huang et.al.	2406.07458v1	null
2024-06-11	Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments	Christopher D. Hsu et.al.	2406.07431v1	null
2024-06-11	Support for fragile porous dust in a gravitationally self-regulated disk around IM Lup	Takahiro Ueda et.al.	2406.07427v1	null
2024-06-11	Adic curves: stable reduction, skeletons and metric structure	Katharina Hübner et.al.	2406.07414v1	null
2024-06-11	Private Geometric Median	Mahdi Haghifam et.al.	2406.07407v1	null
2024-06-11	Analytical Delta-V Approximation for Nonlinear Programming of Multi-target Rendezvous and Flyby Trajectories	An-Yi Huang et.al.	2406.07341v1	null
2024-06-11	Searching for gravitational waves from stellar-mass binary black holes early inspiral	Xue-Ting Zhang et.al.	2406.07336v1	null
2024-06-11	Maximum number of points on an intersection of a cubic threefold and a non-degenerate Hermitian threefold	Mrinmoy Datta et.al.	2406.07326v1	null
2024-06-11	Lyapunov equations: a (fixed) point of view	Richard Pates et.al.	2406.07324v1	null
2024-06-11	Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs	Kamil Jeziorek et.al.	2406.07318v1	null
2024-06-11	Morse Index Stability for the Ginzburg-Landau Approximation	Francesca Da Lio et.al.	2406.07317v1	null
2024-06-11	Sum the Probabilities to $m$ and Stop	Zakaria Derbazi et.al.	2406.07283v1	null
2024-06-11	Coupled-channel $J^{--}$ meson resonances from lattice QCD	Jozef J. Dudek et.al.	2406.07261v1	null
2024-06-11	Even dimensional Fermat cubics are rational over any field	Alex Massarenti et.al.	2406.07223v1	null
2024-06-11	A Multi-step Approach for Minimizing Risk in Decentralized Exchanges	Daniele Maria Di Nosse et.al.	2406.07200v2	null
2024-06-11	TernaryLLM: Ternarized Large Language Model	Tianqi Chen et.al.	2406.07177v1	null
2024-06-11	Ultrametric-preserving functions as monoid endomorphisms	Oleksiy Dovgoshey et.al.	2406.07166v2	null
2024-06-11	ACCO: Automated Causal CNN Scheduling Optimizer for Real-Time Edge Accelerators	Jun Yin et.al.	2406.07161v1	null
2024-06-11	Deep Learning-Based Approach for User Activity Detection with Grant-Free Random Access in Cell-Free Massive MIMO	Ali Elkeshawy et.al.	2406.07160v1	null
2024-06-11	A portrait of the rotation of Ultra-Cool Dwarfs revealed by TESS	D. O. Fontinele et.al.	2406.07154v1	null
2024-06-11	High-performance in-vacuum optical system for quantum optics experiments in a Penning-trap	Joaquín Berrocal et.al.	2406.07152v1	null
2024-06-11	Partial yet definite emergence of the Kardar-Parisi-Zhang class in isotropic spin chains	Kazumasa A. Takeuchi et.al.	2406.07150v1	null

Federated Learning

Publish Date	Title	Authors	PDF	Code
2024-06-11	A3VLM: Actionable Articulation-Aware Vision Language Model	Siyuan Huang et.al.	2406.07549v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis	David Ortiz-Perez et.al.	2406.07542v1	link
2024-06-11	CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning	Zeyuan Liu et.al.	2406.07541v1	null
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	QuickLLaMA: Query-aware Inference Acceleration for Large Language Models	Jingyao Li et.al.	2406.07528v1	link
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Change of numeraire for weak martingale transport	Mathias Beiglböck et.al.	2406.07523v1	null
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Faster Spectral Density Estimation and Sparsification in the Nuclear Norm	Yujia Jin et.al.	2406.07521v1	null
2024-06-11	Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics	Reuben R. W. Wang et.al.	2406.07519v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid	Naser Souri et.al.	2406.07503v1	null
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale	William Giarè et.al.	2406.07493v1	null
2024-06-11	Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction	Bekir Z. Demiray et.al.	2406.07484v1	null
2024-06-11	Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery	Biplov Bhandari et.al.	2406.07482v1	null
2024-06-11	The end of multiple choice tests: using AI to enhance assessment	Michael Klymkowsky et.al.	2406.07481v1	null

Framework

Publish Date	Title	Authors	PDF	Code
2024-06-11	Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks	Bjarke Madsen et.al.	2406.07383v1	null
2024-06-11	Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach	Challapalli Phanindra Revanth et.al.	2406.07332v1	null
2024-06-11	Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway	Hamed Babaei Giglou et.al.	2406.07257v1	null
2024-06-11	Which Country Is This? Automatic Country Ranking of Street View Photos	Tim Menzner et.al.	2406.07227v1	null
2024-06-10	Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints	T. Tony Cai et.al.	2406.06755v1	null
2024-06-10	Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests	T. Tony Cai et.al.	2406.06749v1	null
2024-06-10	Decentralized Personalized Federated Learning	Salma Kharrat et.al.	2406.06520v1	null
2024-06-10	Optimisation of federated learning settings under statistical heterogeneity variations	Basem Suleiman et.al.	2406.06340v1	null
2024-06-10	Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning	Xiaoting Lyu et.al.	2406.06207v1	null
2024-06-10	Federated learning in food research	Zuzanna Fendor et.al.	2406.06202v1	null
2024-06-10	Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm	Ahmed Elbakary et.al.	2406.06655v1	null
2024-06-10	Federated Machine Reasoning for Resource Provisioning in 6G O-RAN	Swastika Roy et.al.	2406.06128v1	null
2024-06-09	Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis"	Mahtab Talaei et.al.	2406.05858v1	null
2024-06-08	Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey	Shinu M. Rajagopal et.al.	2406.05517v1	null
2024-06-08	PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System	Wei Yuan et.al.	2406.05387v1	null
2024-06-07	Federated LoRA with Sparse Communication	Kevin Kuo et.al.	2406.05233v1	null
2024-06-07	The Russian Legislative Corpus	Denis Saveliev et.al.	2406.04855v1	link
2024-06-07	FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models	Rui Ye et.al.	2406.04845v1	link
2024-06-07	When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain	Lei Xu et.al.	2406.04743v1	null
2024-06-07	Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems	Zhen Cai et.al.	2406.04702v1	null
2024-06-07	Federated Representation Learning in the Under-Parameterized Regime	Renpu Liu et.al.	2406.04596v3	null
2024-06-06	Data Measurements for Decentralized Data Markets	Charles Lu et.al.	2406.04257v1	null
2024-06-06	R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients	Tamer Ahmed Eltaras et.al.	2406.04227v1	null
2024-06-06	Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning	Xuhan Zuo et.al.	2406.04076v1	null
2024-06-06	Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation	Honglei Zhang et.al.	2406.03933v1	link
2024-06-06	1-D CNN-Based Online Signature Verification with Federated Learning	Lingfeng Zhang et.al.	2406.06597v1	null
2024-06-06	Stochastic Dynamic Network Utility Maximization with Application to Disaster Response	Anna Scaglione et.al.	2406.03750v1	null
2024-06-05	FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles	Cyprien Quéméneur et.al.	2406.03611v1	link
2024-06-05	Fantastyc: Blockchain-based Federated Learning Made Secure and Practical	William Boitier et.al.	2406.03608v1	null
2024-06-05	Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning	Saber Malekmohammadi et.al.	2406.03519v1	link

Communication

Publish Date	Title	Authors	PDF	Code
2024-06-11	Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks	Bjarke Madsen et.al.	2406.07383v1	null
2024-06-11	Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach	Challapalli Phanindra Revanth et.al.	2406.07332v1	null
2024-06-11	Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway	Hamed Babaei Giglou et.al.	2406.07257v1	null
2024-06-11	Which Country Is This? Automatic Country Ranking of Street View Photos	Tim Menzner et.al.	2406.07227v1	null
2024-06-10	Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints	T. Tony Cai et.al.	2406.06755v1	null
2024-06-10	Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests	T. Tony Cai et.al.	2406.06749v1	null
2024-06-10	Decentralized Personalized Federated Learning	Salma Kharrat et.al.	2406.06520v1	null
2024-06-10	Optimisation of federated learning settings under statistical heterogeneity variations	Basem Suleiman et.al.	2406.06340v1	null
2024-06-10	Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning	Xiaoting Lyu et.al.	2406.06207v1	null
2024-06-10	Federated learning in food research	Zuzanna Fendor et.al.	2406.06202v1	null
2024-06-10	Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm	Ahmed Elbakary et.al.	2406.06655v1	null
2024-06-10	Federated Machine Reasoning for Resource Provisioning in 6G O-RAN	Swastika Roy et.al.	2406.06128v1	null
2024-06-09	Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis"	Mahtab Talaei et.al.	2406.05858v1	null
2024-06-08	Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey	Shinu M. Rajagopal et.al.	2406.05517v1	null
2024-06-08	PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System	Wei Yuan et.al.	2406.05387v1	null
2024-06-07	Federated LoRA with Sparse Communication	Kevin Kuo et.al.	2406.05233v1	null
2024-06-07	The Russian Legislative Corpus	Denis Saveliev et.al.	2406.04855v1	link
2024-06-07	FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models	Rui Ye et.al.	2406.04845v1	link
2024-06-07	When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain	Lei Xu et.al.	2406.04743v1	null
2024-06-07	Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems	Zhen Cai et.al.	2406.04702v1	null
2024-06-07	Federated Representation Learning in the Under-Parameterized Regime	Renpu Liu et.al.	2406.04596v3	null
2024-06-06	Data Measurements for Decentralized Data Markets	Charles Lu et.al.	2406.04257v1	null
2024-06-06	R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients	Tamer Ahmed Eltaras et.al.	2406.04227v1	null
2024-06-06	Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning	Xuhan Zuo et.al.	2406.04076v1	null
2024-06-06	Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation	Honglei Zhang et.al.	2406.03933v1	link
2024-06-06	1-D CNN-Based Online Signature Verification with Federated Learning	Lingfeng Zhang et.al.	2406.06597v1	null
2024-06-06	Stochastic Dynamic Network Utility Maximization with Application to Disaster Response	Anna Scaglione et.al.	2406.03750v1	null
2024-06-05	FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles	Cyprien Quéméneur et.al.	2406.03611v1	link
2024-06-05	Fantastyc: Blockchain-based Federated Learning Made Secure and Practical	William Boitier et.al.	2406.03608v1	null
2024-06-05	Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning	Saber Malekmohammadi et.al.	2406.03519v1	link

Personalized

Publish Date	Title	Authors	PDF	Code
2024-06-11	Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks	Bjarke Madsen et.al.	2406.07383v1	null
2024-06-11	Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach	Challapalli Phanindra Revanth et.al.	2406.07332v1	null
2024-06-11	Should XAI Nudge Human Decisions with Explanation Biasing?	Yosuke Fukuchi et.al.	2406.07323v1	null
2024-06-11	Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway	Hamed Babaei Giglou et.al.	2406.07257v1	null
2024-06-11	Which Country Is This? Automatic Country Ranking of Street View Photos	Tim Menzner et.al.	2406.07227v1	null
2024-06-11	A Synthetic Dataset for Personal Attribute Inference	Hanna Yukhymenko et.al.	2406.07217v1	null
2024-06-11	MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance	X. Wang et.al.	2406.07209v1	link
2024-06-11	Can Foundation Models Reliably Identify Spatial Hazards? A Case Study on Curb Segmentation	Diwei Sheng et.al.	2406.07202v1	null
2024-06-11	Unlocking the Potential of the Metaverse for Innovative and Immersive Digital Care	Fatemeh Ebrahimzadeh et.al.	2406.07114v1	null
2024-06-11	A Neck Orthosis with Multi-Directional Variable Stiffness for Persons with Dropped Head Syndrome	Santiago Price Torrendell et.al.	2406.07074v1	null
2024-06-11	pVACview: an interactive visualization tool for efficient neoantigen prioritization and selection	Huiming Xia et.al.	2406.06985v1	null
2024-06-11	Non-autoregressive Personalized Bundle Generation	Wenchuan Yang et.al.	2406.06925v1	null
2024-06-11	Person Transfer in the Field: Examining Real World Sequential Human-Robot Interaction Between Two Robots	Xiang Zhi Tan et.al.	2406.06904v1	null
2024-06-10	Personalized Binomial DAGs Learning with Network Structured Covariates	Boxin Zhao et.al.	2406.06829v1	null
2024-06-10	Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints	T. Tony Cai et.al.	2406.06755v1	null
2024-06-10	Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests	T. Tony Cai et.al.	2406.06749v1	null
2024-06-10	Video-based Exercise Classification and Activated Muscle Group Prediction with Hybrid X3D-SlowFast Network	Manvik Pasula et.al.	2406.06703v1	null
2024-06-10	Decentralized Personalized Federated Learning	Salma Kharrat et.al.	2406.06520v1	null
2024-06-10	Towards a Personal Health Large Language Model	Justin Cosentino et.al.	2406.06474v1	null
2024-06-10	Transforming Wearable Data into Health Insights using Large Language Model Agents	Mike A. Merrill et.al.	2406.06464v2	null
2024-06-10	Estimating Heterogeneous Treatment Effects by Combining Weak Instruments and Observational Data	Miruna Oprescu et.al.	2406.06452v1	link
2024-06-10	Biomarker-Guided Adaptive Enrichment Design with Threshold Detection for Clinical Trials with Time-to-Event Outcome	Kaiyuan Hua et.al.	2406.06426v1	null
2024-06-10	Improving Deep Learning-based Automatic Cranial Defect Reconstruction by Heavy Data Augmentation: From Image Registration to Latent Diffusion Models	Marek Wodzinski et.al.	2406.06372v1	null
2024-06-10	Optimisation of federated learning settings under statistical heterogeneity variations	Basem Suleiman et.al.	2406.06340v1	null
2024-06-10	Human Gaze and Head Rotation during Navigation, Exploration and Object Manipulation in Shared Environments with Robots	Tim Schreiter et.al.	2406.06300v1	null
2024-06-10	Tuning-Free Visual Customization via View Iterative Self-Attention Control	Xiaojie Li et.al.	2406.06258v2	null
2024-06-10	Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning	Xiaoting Lyu et.al.	2406.06207v1	null
2024-06-10	Federated learning in food research	Zuzanna Fendor et.al.	2406.06202v1	null
2024-06-10	Link Prediction in Bipartite Networks	Şükrü Demir İnan Özer et.al.	2406.06658v1	null
2024-06-10	Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm	Ahmed Elbakary et.al.	2406.06655v1	null

Optimization

Publish Date	Title	Authors	PDF	Code
2024-06-11	Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks	Bjarke Madsen et.al.	2406.07383v1	null
2024-06-11	Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach	Challapalli Phanindra Revanth et.al.	2406.07332v1	null
2024-06-11	Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway	Hamed Babaei Giglou et.al.	2406.07257v1	null
2024-06-11	Which Country Is This? Automatic Country Ranking of Street View Photos	Tim Menzner et.al.	2406.07227v1	null
2024-06-10	Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints	T. Tony Cai et.al.	2406.06755v1	null
2024-06-10	Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests	T. Tony Cai et.al.	2406.06749v1	null
2024-06-10	Decentralized Personalized Federated Learning	Salma Kharrat et.al.	2406.06520v1	null
2024-06-10	Optimisation of federated learning settings under statistical heterogeneity variations	Basem Suleiman et.al.	2406.06340v1	null
2024-06-10	Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning	Xiaoting Lyu et.al.	2406.06207v1	null
2024-06-10	Federated learning in food research	Zuzanna Fendor et.al.	2406.06202v1	null
2024-06-10	Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm	Ahmed Elbakary et.al.	2406.06655v1	null
2024-06-10	Federated Machine Reasoning for Resource Provisioning in 6G O-RAN	Swastika Roy et.al.	2406.06128v1	null
2024-06-09	Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis"	Mahtab Talaei et.al.	2406.05858v1	null
2024-06-08	Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey	Shinu M. Rajagopal et.al.	2406.05517v1	null
2024-06-08	PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System	Wei Yuan et.al.	2406.05387v1	null
2024-06-07	Federated LoRA with Sparse Communication	Kevin Kuo et.al.	2406.05233v1	null
2024-06-07	The Russian Legislative Corpus	Denis Saveliev et.al.	2406.04855v1	link
2024-06-07	FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models	Rui Ye et.al.	2406.04845v1	link
2024-06-07	When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain	Lei Xu et.al.	2406.04743v1	null
2024-06-07	Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems	Zhen Cai et.al.	2406.04702v1	null
2024-06-07	Federated Representation Learning in the Under-Parameterized Regime	Renpu Liu et.al.	2406.04596v3	null
2024-06-06	Data Measurements for Decentralized Data Markets	Charles Lu et.al.	2406.04257v1	null
2024-06-06	R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients	Tamer Ahmed Eltaras et.al.	2406.04227v1	null
2024-06-06	Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning	Xuhan Zuo et.al.	2406.04076v1	null
2024-06-06	Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation	Honglei Zhang et.al.	2406.03933v1	link
2024-06-06	1-D CNN-Based Online Signature Verification with Federated Learning	Lingfeng Zhang et.al.	2406.06597v1	null
2024-06-06	Stochastic Dynamic Network Utility Maximization with Application to Disaster Response	Anna Scaglione et.al.	2406.03750v1	null
2024-06-05	FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles	Cyprien Quéméneur et.al.	2406.03611v1	link
2024-06-05	Fantastyc: Blockchain-based Federated Learning Made Secure and Practical	William Boitier et.al.	2406.03608v1	null
2024-06-05	Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning	Saber Malekmohammadi et.al.	2406.03519v1	link

Privacy

Publish Date	Title	Authors	PDF	Code
2024-06-11	Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks	Bjarke Madsen et.al.	2406.07383v1	null
2024-06-11	Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach	Challapalli Phanindra Revanth et.al.	2406.07332v1	null
2024-06-11	Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway	Hamed Babaei Giglou et.al.	2406.07257v1	null
2024-06-11	Which Country Is This? Automatic Country Ranking of Street View Photos	Tim Menzner et.al.	2406.07227v1	null
2024-06-10	Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints	T. Tony Cai et.al.	2406.06755v1	null
2024-06-10	Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests	T. Tony Cai et.al.	2406.06749v1	null
2024-06-10	Decentralized Personalized Federated Learning	Salma Kharrat et.al.	2406.06520v1	null
2024-06-10	Optimisation of federated learning settings under statistical heterogeneity variations	Basem Suleiman et.al.	2406.06340v1	null
2024-06-10	Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning	Xiaoting Lyu et.al.	2406.06207v1	null
2024-06-10	Federated learning in food research	Zuzanna Fendor et.al.	2406.06202v1	null
2024-06-10	Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm	Ahmed Elbakary et.al.	2406.06655v1	null
2024-06-10	Federated Machine Reasoning for Resource Provisioning in 6G O-RAN	Swastika Roy et.al.	2406.06128v1	null
2024-06-09	Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis"	Mahtab Talaei et.al.	2406.05858v1	null
2024-06-08	Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey	Shinu M. Rajagopal et.al.	2406.05517v1	null
2024-06-08	PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System	Wei Yuan et.al.	2406.05387v1	null
2024-06-07	Federated LoRA with Sparse Communication	Kevin Kuo et.al.	2406.05233v1	null
2024-06-07	The Russian Legislative Corpus	Denis Saveliev et.al.	2406.04855v1	link
2024-06-07	FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models	Rui Ye et.al.	2406.04845v1	link
2024-06-07	When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain	Lei Xu et.al.	2406.04743v1	null
2024-06-07	Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems	Zhen Cai et.al.	2406.04702v1	null
2024-06-07	Federated Representation Learning in the Under-Parameterized Regime	Renpu Liu et.al.	2406.04596v3	null
2024-06-06	Data Measurements for Decentralized Data Markets	Charles Lu et.al.	2406.04257v1	null
2024-06-06	R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients	Tamer Ahmed Eltaras et.al.	2406.04227v1	null
2024-06-06	Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning	Xuhan Zuo et.al.	2406.04076v1	null
2024-06-06	Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation	Honglei Zhang et.al.	2406.03933v1	link
2024-06-06	1-D CNN-Based Online Signature Verification with Federated Learning	Lingfeng Zhang et.al.	2406.06597v1	null
2024-06-06	Stochastic Dynamic Network Utility Maximization with Application to Disaster Response	Anna Scaglione et.al.	2406.03750v1	null
2024-06-05	FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles	Cyprien Quéméneur et.al.	2406.03611v1	link
2024-06-05	Fantastyc: Blockchain-based Federated Learning Made Secure and Practical	William Boitier et.al.	2406.03608v1	null
2024-06-05	Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning	Saber Malekmohammadi et.al.	2406.03519v1	link

Asynchronous

Publish Date	Title	Authors	PDF	Code
2024-06-11	Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks	Bjarke Madsen et.al.	2406.07383v1	null
2024-06-11	Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach	Challapalli Phanindra Revanth et.al.	2406.07332v1	null
2024-06-11	Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway	Hamed Babaei Giglou et.al.	2406.07257v1	null
2024-06-11	Which Country Is This? Automatic Country Ranking of Street View Photos	Tim Menzner et.al.	2406.07227v1	null
2024-06-10	Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints	T. Tony Cai et.al.	2406.06755v1	null
2024-06-10	Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests	T. Tony Cai et.al.	2406.06749v1	null
2024-06-10	Decentralized Personalized Federated Learning	Salma Kharrat et.al.	2406.06520v1	null
2024-06-10	Optimisation of federated learning settings under statistical heterogeneity variations	Basem Suleiman et.al.	2406.06340v1	null
2024-06-10	Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning	Xiaoting Lyu et.al.	2406.06207v1	null
2024-06-10	Federated learning in food research	Zuzanna Fendor et.al.	2406.06202v1	null
2024-06-10	Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm	Ahmed Elbakary et.al.	2406.06655v1	null
2024-06-10	Federated Machine Reasoning for Resource Provisioning in 6G O-RAN	Swastika Roy et.al.	2406.06128v1	null
2024-06-09	Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis"	Mahtab Talaei et.al.	2406.05858v1	null
2024-06-08	Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey	Shinu M. Rajagopal et.al.	2406.05517v1	null
2024-06-08	PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System	Wei Yuan et.al.	2406.05387v1	null
2024-06-07	Federated LoRA with Sparse Communication	Kevin Kuo et.al.	2406.05233v1	null
2024-06-07	The Russian Legislative Corpus	Denis Saveliev et.al.	2406.04855v1	link
2024-06-07	FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models	Rui Ye et.al.	2406.04845v1	link
2024-06-07	When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain	Lei Xu et.al.	2406.04743v1	null
2024-06-07	Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems	Zhen Cai et.al.	2406.04702v1	null
2024-06-07	Federated Representation Learning in the Under-Parameterized Regime	Renpu Liu et.al.	2406.04596v3	null
2024-06-06	Data Measurements for Decentralized Data Markets	Charles Lu et.al.	2406.04257v1	null
2024-06-06	R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients	Tamer Ahmed Eltaras et.al.	2406.04227v1	null
2024-06-06	Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning	Xuhan Zuo et.al.	2406.04076v1	null
2024-06-06	Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation	Honglei Zhang et.al.	2406.03933v1	link
2024-06-06	1-D CNN-Based Online Signature Verification with Federated Learning	Lingfeng Zhang et.al.	2406.06597v1	null
2024-06-06	Stochastic Dynamic Network Utility Maximization with Application to Disaster Response	Anna Scaglione et.al.	2406.03750v1	null
2024-06-05	FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles	Cyprien Quéméneur et.al.	2406.03611v1	link
2024-06-05	Fantastyc: Blockchain-based Federated Learning Made Secure and Practical	William Boitier et.al.	2406.03608v1	null
2024-06-05	Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning	Saber Malekmohammadi et.al.	2406.03519v1	link

Dataset

Publish Date	Title	Authors	PDF	Code
2024-06-11	Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks	Bjarke Madsen et.al.	2406.07383v1	null
2024-06-11	Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach	Challapalli Phanindra Revanth et.al.	2406.07332v1	null
2024-06-11	Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway	Hamed Babaei Giglou et.al.	2406.07257v1	null
2024-06-11	Which Country Is This? Automatic Country Ranking of Street View Photos	Tim Menzner et.al.	2406.07227v1	null
2024-06-10	Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints	T. Tony Cai et.al.	2406.06755v1	null
2024-06-10	Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests	T. Tony Cai et.al.	2406.06749v1	null
2024-06-10	Decentralized Personalized Federated Learning	Salma Kharrat et.al.	2406.06520v1	null
2024-06-10	Optimisation of federated learning settings under statistical heterogeneity variations	Basem Suleiman et.al.	2406.06340v1	null
2024-06-10	Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning	Xiaoting Lyu et.al.	2406.06207v1	null
2024-06-10	Federated learning in food research	Zuzanna Fendor et.al.	2406.06202v1	null
2024-06-10	Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm	Ahmed Elbakary et.al.	2406.06655v1	null
2024-06-10	Federated Machine Reasoning for Resource Provisioning in 6G O-RAN	Swastika Roy et.al.	2406.06128v1	null
2024-06-09	Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis"	Mahtab Talaei et.al.	2406.05858v1	null
2024-06-08	Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey	Shinu M. Rajagopal et.al.	2406.05517v1	null
2024-06-08	PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System	Wei Yuan et.al.	2406.05387v1	null
2024-06-07	Federated LoRA with Sparse Communication	Kevin Kuo et.al.	2406.05233v1	null
2024-06-07	The Russian Legislative Corpus	Denis Saveliev et.al.	2406.04855v1	link
2024-06-07	FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models	Rui Ye et.al.	2406.04845v1	link
2024-06-07	When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain	Lei Xu et.al.	2406.04743v1	null
2024-06-07	Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems	Zhen Cai et.al.	2406.04702v1	null
2024-06-07	Federated Representation Learning in the Under-Parameterized Regime	Renpu Liu et.al.	2406.04596v3	null
2024-06-06	Data Measurements for Decentralized Data Markets	Charles Lu et.al.	2406.04257v1	null
2024-06-06	R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients	Tamer Ahmed Eltaras et.al.	2406.04227v1	null
2024-06-06	Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning	Xuhan Zuo et.al.	2406.04076v1	null
2024-06-06	Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation	Honglei Zhang et.al.	2406.03933v1	link
2024-06-06	1-D CNN-Based Online Signature Verification with Federated Learning	Lingfeng Zhang et.al.	2406.06597v1	null
2024-06-06	Stochastic Dynamic Network Utility Maximization with Application to Disaster Response	Anna Scaglione et.al.	2406.03750v1	null
2024-06-05	FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles	Cyprien Quéméneur et.al.	2406.03611v1	link
2024-06-05	Fantastyc: Blockchain-based Federated Learning Made Secure and Practical	William Boitier et.al.	2406.03608v1	null
2024-06-05	Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning	Saber Malekmohammadi et.al.	2406.03519v1	link

Benchmark

Publish Date	Title	Authors	PDF	Code
2024-06-11	Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks	Bjarke Madsen et.al.	2406.07383v1	null
2024-06-11	Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach	Challapalli Phanindra Revanth et.al.	2406.07332v1	null
2024-06-11	Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway	Hamed Babaei Giglou et.al.	2406.07257v1	null
2024-06-11	Which Country Is This? Automatic Country Ranking of Street View Photos	Tim Menzner et.al.	2406.07227v1	null
2024-06-10	Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints	T. Tony Cai et.al.	2406.06755v1	null
2024-06-10	Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests	T. Tony Cai et.al.	2406.06749v1	null
2024-06-10	Decentralized Personalized Federated Learning	Salma Kharrat et.al.	2406.06520v1	null
2024-06-10	Optimisation of federated learning settings under statistical heterogeneity variations	Basem Suleiman et.al.	2406.06340v1	null
2024-06-10	Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning	Xiaoting Lyu et.al.	2406.06207v1	null
2024-06-10	Federated learning in food research	Zuzanna Fendor et.al.	2406.06202v1	null
2024-06-10	Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm	Ahmed Elbakary et.al.	2406.06655v1	null
2024-06-10	Federated Machine Reasoning for Resource Provisioning in 6G O-RAN	Swastika Roy et.al.	2406.06128v1	null
2024-06-09	Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis"	Mahtab Talaei et.al.	2406.05858v1	null
2024-06-08	Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey	Shinu M. Rajagopal et.al.	2406.05517v1	null
2024-06-08	PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System	Wei Yuan et.al.	2406.05387v1	null
2024-06-07	Federated LoRA with Sparse Communication	Kevin Kuo et.al.	2406.05233v1	null
2024-06-07	The Russian Legislative Corpus	Denis Saveliev et.al.	2406.04855v1	link
2024-06-07	FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models	Rui Ye et.al.	2406.04845v1	link
2024-06-07	When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain	Lei Xu et.al.	2406.04743v1	null
2024-06-07	Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems	Zhen Cai et.al.	2406.04702v1	null
2024-06-07	Federated Representation Learning in the Under-Parameterized Regime	Renpu Liu et.al.	2406.04596v3	null
2024-06-06	Data Measurements for Decentralized Data Markets	Charles Lu et.al.	2406.04257v1	null
2024-06-06	R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients	Tamer Ahmed Eltaras et.al.	2406.04227v1	null
2024-06-06	Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning	Xuhan Zuo et.al.	2406.04076v1	null
2024-06-06	Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation	Honglei Zhang et.al.	2406.03933v1	link
2024-06-06	1-D CNN-Based Online Signature Verification with Federated Learning	Lingfeng Zhang et.al.	2406.06597v1	null
2024-06-06	Stochastic Dynamic Network Utility Maximization with Application to Disaster Response	Anna Scaglione et.al.	2406.03750v1	null
2024-06-05	FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles	Cyprien Quéméneur et.al.	2406.03611v1	link
2024-06-05	Fantastyc: Blockchain-based Federated Learning Made Secure and Practical	William Boitier et.al.	2406.03608v1	null
2024-06-05	Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning	Saber Malekmohammadi et.al.	2406.03519v1	link

Efficient

Publish Date	Title	Authors	PDF	Code
2024-06-11	Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks	Bjarke Madsen et.al.	2406.07383v1	null
2024-06-11	Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach	Challapalli Phanindra Revanth et.al.	2406.07332v1	null
2024-06-11	Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway	Hamed Babaei Giglou et.al.	2406.07257v1	null
2024-06-11	Which Country Is This? Automatic Country Ranking of Street View Photos	Tim Menzner et.al.	2406.07227v1	null
2024-06-10	Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints	T. Tony Cai et.al.	2406.06755v1	null
2024-06-10	Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests	T. Tony Cai et.al.	2406.06749v1	null
2024-06-10	Decentralized Personalized Federated Learning	Salma Kharrat et.al.	2406.06520v1	null
2024-06-10	Optimisation of federated learning settings under statistical heterogeneity variations	Basem Suleiman et.al.	2406.06340v1	null
2024-06-10	Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning	Xiaoting Lyu et.al.	2406.06207v1	null
2024-06-10	Federated learning in food research	Zuzanna Fendor et.al.	2406.06202v1	null
2024-06-10	Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm	Ahmed Elbakary et.al.	2406.06655v1	null
2024-06-10	Federated Machine Reasoning for Resource Provisioning in 6G O-RAN	Swastika Roy et.al.	2406.06128v1	null
2024-06-09	Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis"	Mahtab Talaei et.al.	2406.05858v1	null
2024-06-08	Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey	Shinu M. Rajagopal et.al.	2406.05517v1	null
2024-06-08	PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System	Wei Yuan et.al.	2406.05387v1	null
2024-06-07	Federated LoRA with Sparse Communication	Kevin Kuo et.al.	2406.05233v1	null
2024-06-07	The Russian Legislative Corpus	Denis Saveliev et.al.	2406.04855v1	link
2024-06-07	FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models	Rui Ye et.al.	2406.04845v1	link
2024-06-07	When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain	Lei Xu et.al.	2406.04743v1	null
2024-06-07	Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems	Zhen Cai et.al.	2406.04702v1	null
2024-06-07	Federated Representation Learning in the Under-Parameterized Regime	Renpu Liu et.al.	2406.04596v3	null
2024-06-06	Data Measurements for Decentralized Data Markets	Charles Lu et.al.	2406.04257v1	null
2024-06-06	R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients	Tamer Ahmed Eltaras et.al.	2406.04227v1	null
2024-06-06	Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning	Xuhan Zuo et.al.	2406.04076v1	null
2024-06-06	Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation	Honglei Zhang et.al.	2406.03933v1	link
2024-06-06	1-D CNN-Based Online Signature Verification with Federated Learning	Lingfeng Zhang et.al.	2406.06597v1	null
2024-06-06	Stochastic Dynamic Network Utility Maximization with Application to Disaster Response	Anna Scaglione et.al.	2406.03750v1	null
2024-06-05	FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles	Cyprien Quéméneur et.al.	2406.03611v1	link
2024-06-05	Fantastyc: Blockchain-based Federated Learning Made Secure and Practical	William Boitier et.al.	2406.03608v1	null
2024-06-05	Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning	Saber Malekmohammadi et.al.	2406.03519v1	link

Heterogeneous

Publish Date	Title	Authors	PDF	Code
2024-06-11	Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks	Bjarke Madsen et.al.	2406.07383v1	null
2024-06-11	Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach	Challapalli Phanindra Revanth et.al.	2406.07332v1	null
2024-06-11	Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway	Hamed Babaei Giglou et.al.	2406.07257v1	null
2024-06-11	Which Country Is This? Automatic Country Ranking of Street View Photos	Tim Menzner et.al.	2406.07227v1	null
2024-06-10	Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints	T. Tony Cai et.al.	2406.06755v1	null
2024-06-10	Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests	T. Tony Cai et.al.	2406.06749v1	null
2024-06-10	Decentralized Personalized Federated Learning	Salma Kharrat et.al.	2406.06520v1	null
2024-06-10	Optimisation of federated learning settings under statistical heterogeneity variations	Basem Suleiman et.al.	2406.06340v1	null
2024-06-10	Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning	Xiaoting Lyu et.al.	2406.06207v1	null
2024-06-10	Federated learning in food research	Zuzanna Fendor et.al.	2406.06202v1	null
2024-06-10	Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm	Ahmed Elbakary et.al.	2406.06655v1	null
2024-06-10	Federated Machine Reasoning for Resource Provisioning in 6G O-RAN	Swastika Roy et.al.	2406.06128v1	null
2024-06-09	Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis"	Mahtab Talaei et.al.	2406.05858v1	null
2024-06-08	Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey	Shinu M. Rajagopal et.al.	2406.05517v1	null
2024-06-08	PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System	Wei Yuan et.al.	2406.05387v1	null
2024-06-07	Federated LoRA with Sparse Communication	Kevin Kuo et.al.	2406.05233v1	null
2024-06-07	The Russian Legislative Corpus	Denis Saveliev et.al.	2406.04855v1	link
2024-06-07	FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models	Rui Ye et.al.	2406.04845v1	link
2024-06-07	When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain	Lei Xu et.al.	2406.04743v1	null
2024-06-07	Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems	Zhen Cai et.al.	2406.04702v1	null
2024-06-07	Federated Representation Learning in the Under-Parameterized Regime	Renpu Liu et.al.	2406.04596v3	null
2024-06-06	Data Measurements for Decentralized Data Markets	Charles Lu et.al.	2406.04257v1	null
2024-06-06	R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients	Tamer Ahmed Eltaras et.al.	2406.04227v1	null
2024-06-06	Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning	Xuhan Zuo et.al.	2406.04076v1	null
2024-06-06	Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation	Honglei Zhang et.al.	2406.03933v1	link
2024-06-06	1-D CNN-Based Online Signature Verification with Federated Learning	Lingfeng Zhang et.al.	2406.06597v1	null
2024-06-06	Stochastic Dynamic Network Utility Maximization with Application to Disaster Response	Anna Scaglione et.al.	2406.03750v1	null
2024-06-05	FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles	Cyprien Quéméneur et.al.	2406.03611v1	link
2024-06-05	Fantastyc: Blockchain-based Federated Learning Made Secure and Practical	William Boitier et.al.	2406.03608v1	null
2024-06-05	Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning	Saber Malekmohammadi et.al.	2406.03519v1	link

Few-shot Learning

Publish Date	Title	Authors	PDF	Code
2024-06-11	A3VLM: Actionable Articulation-Aware Vision Language Model	Siyuan Huang et.al.	2406.07549v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis	David Ortiz-Perez et.al.	2406.07542v1	link
2024-06-11	CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning	Zeyuan Liu et.al.	2406.07541v1	null
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	QuickLLaMA: Query-aware Inference Acceleration for Large Language Models	Jingyao Li et.al.	2406.07528v1	link
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Change of numeraire for weak martingale transport	Mathias Beiglböck et.al.	2406.07523v1	null
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Faster Spectral Density Estimation and Sparsification in the Nuclear Norm	Yujia Jin et.al.	2406.07521v1	null
2024-06-11	Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics	Reuben R. W. Wang et.al.	2406.07519v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid	Naser Souri et.al.	2406.07503v1	null
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale	William Giarè et.al.	2406.07493v1	null
2024-06-11	Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction	Bekir Z. Demiray et.al.	2406.07484v1	null
2024-06-11	Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery	Biplov Bhandari et.al.	2406.07482v1	null
2024-06-11	The end of multiple choice tests: using AI to enhance assessment	Michael Klymkowsky et.al.	2406.07481v1	null

One-shot Learning

Publish Date	Title	Authors	PDF	Code
2024-06-11	A3VLM: Actionable Articulation-Aware Vision Language Model	Siyuan Huang et.al.	2406.07549v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis	David Ortiz-Perez et.al.	2406.07542v1	link
2024-06-11	CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning	Zeyuan Liu et.al.	2406.07541v1	null
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	QuickLLaMA: Query-aware Inference Acceleration for Large Language Models	Jingyao Li et.al.	2406.07528v1	link
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Change of numeraire for weak martingale transport	Mathias Beiglböck et.al.	2406.07523v1	null
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Faster Spectral Density Estimation and Sparsification in the Nuclear Norm	Yujia Jin et.al.	2406.07521v1	null
2024-06-11	Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics	Reuben R. W. Wang et.al.	2406.07519v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid	Naser Souri et.al.	2406.07503v1	null
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale	William Giarè et.al.	2406.07493v1	null
2024-06-11	Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction	Bekir Z. Demiray et.al.	2406.07484v1	null
2024-06-11	Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery	Biplov Bhandari et.al.	2406.07482v1	null
2024-06-11	The end of multiple choice tests: using AI to enhance assessment	Michael Klymkowsky et.al.	2406.07481v1	null

Meta Learning

Publish Date	Title	Authors	PDF	Code
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Impact of the nuclear equation of state on the formation of twin stars	Nai-Bo Zhang et.al.	2406.07396v1	null
2024-06-11	Fast Adaptive Meta-Heuristic for Large-Scale Facility Location Problem	Bahram Alidaee et.al.	2406.07382v1	null
2024-06-11	A generic and robust quantum agent inspired by deep meta-reinforcement learning	Zibo Miao et.al.	2406.07225v1	null
2024-06-11	Agnostic Sharpness-Aware Minimization	Van-Anh Nguyen et.al.	2406.07107v2	null
2024-06-11	Meta-Backscatter: A New ISAC Paradigm for Battery-Free Internet of Things	Xu Liu et.al.	2406.07077v1	null
2024-06-11	HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation	Wen Luo et.al.	2406.07070v1	null
2024-06-11	Fairness-Aware Meta-Learning via Nash Bargaining	Yi Zeng et.al.	2406.07029v1	null
2024-06-11	Ollabench: Evaluating LLMs' Reasoning for Human-centric Interdependent Cybersecurity	Tam n. Nguyen et.al.	2406.06863v1	link
2024-06-10	Reinforced Compressive Neural Architecture Search for Versatile Adversarial Robustness	Dingrong Wang et.al.	2406.06792v1	link
2024-06-10	Meta Learning Text-to-Speech Synthesis in over 7000 Languages	Florian Lux et.al.	2406.06403v1	link
2024-06-10	Characteristics and Energy Flux Distributions of Decayless Transverse Oscillations Depending on Coronal Regions	Daye Lim et.al.	2406.06368v1	null
2024-06-10	Data Augmentation in Earth Observation: A Diffusion Model Approach	Tiago Sousa et.al.	2406.06218v1	null
2024-06-10	Causality-inspired Latent Feature Augmentation for Single Domain Generalization	Jian Xu et.al.	2406.05980v1	null
2024-06-10	Data Caching for Enterprise-Grade Petabyte-Scale OLAP	Chunxu Tang et.al.	2406.05962v1	null
2024-06-09	Async Learned User Embeddings for Ads Delivery Optimization	Mingwei Tang et.al.	2406.05898v1	null
2024-06-09	Few-Shot Load Forecasting Under Data Scarcity in Smart Grids: A Meta-Learning Approach	Georgios Tsoumplekas et.al.	2406.05887v1	null
2024-06-08	Synergizing Deep Learning and Phase Change Materials for Four-state Broadband Multifunctional Metasurfaces in the Visible Range	Md. Ehsanul Karim et.al.	2406.05519v1	null
2024-06-08	Gradient-based algorithms for multi-objective bi-level optimization	Xinmin Yang et.al.	2406.05455v1	null
2024-06-08	A Survey of Meta-features Used for Automated Selection of Algorithms for Black-box Single-objective Continuous Optimization	Gjorgjina Cenikj et.al.	2406.06629v1	null
2024-06-08	Large Language Model Assisted Adversarial Robustness Neural Architecture Search	Rui Zhong et.al.	2406.05433v1	link
2024-06-07	Massively Multiagent Minigames for Training Generalist Agents	Kyoung Whan Choe et.al.	2406.05071v1	link
2024-06-07	Scenarios and Approaches for Situated Natural Language Explanations	Pengshuo Qiu et.al.	2406.05035v1	null
2024-06-07	Unraveling Trace Anomaly of Supradense Matter via Neutron Star Compactness Scaling	Bao-Jun Cai et.al.	2406.05025v1	null
2024-06-07	Adversarial Tuning: Defending Against Jailbreak Attacks for LLMs	Fan Liu et.al.	2406.06622v1	null
2024-06-07	Cactus-like Metamaterial Structures for Electromagnetically Induced Transparency at THz frequencies	Savvas Papamakarios et.al.	2406.04862v1	null
2024-06-07	Black Box Differential Privacy Auditing Using Total Variation Distance	Antti Koskela et.al.	2406.04827v1	null
2024-06-07	Graph Mining under Data scarcity	Appan Rakaraddi et.al.	2406.04825v2	null
2024-06-07	Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning	Xuehui Yu et.al.	2406.04815v1	link
2024-06-07	Cooperative Meta-Learning with Gradient Augmentation	Jongyun Shin et.al.	2406.04639v1	link

Transfer Learning

Publish Date	Title	Authors	PDF	Code
2024-06-11	A3VLM: Actionable Articulation-Aware Vision Language Model	Siyuan Huang et.al.	2406.07549v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis	David Ortiz-Perez et.al.	2406.07542v1	link
2024-06-11	CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning	Zeyuan Liu et.al.	2406.07541v1	null
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	Coherent Three-Photon Excitation of the Strontium Clock Transition	Junyu He et.al.	2406.07530v1	null
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	QuickLLaMA: Query-aware Inference Acceleration for Large Language Models	Jingyao Li et.al.	2406.07528v1	link
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Change of numeraire for weak martingale transport	Mathias Beiglböck et.al.	2406.07523v1	null
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Faster Spectral Density Estimation and Sparsification in the Nuclear Norm	Yujia Jin et.al.	2406.07521v1	null
2024-06-11	Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics	Reuben R. W. Wang et.al.	2406.07519v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid	Naser Souri et.al.	2406.07503v1	null
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale	William Giarè et.al.	2406.07493v1	null
2024-06-11	Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction	Bekir Z. Demiray et.al.	2406.07484v1	null
2024-06-11	Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery	Biplov Bhandari et.al.	2406.07482v1	null

Unsupervised Learning

Publish Date	Title	Authors	PDF	Code
2024-06-11	A3VLM: Actionable Articulation-Aware Vision Language Model	Siyuan Huang et.al.	2406.07549v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis	David Ortiz-Perez et.al.	2406.07542v1	link
2024-06-11	CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning	Zeyuan Liu et.al.	2406.07541v1	null
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	QuickLLaMA: Query-aware Inference Acceleration for Large Language Models	Jingyao Li et.al.	2406.07528v1	link
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Change of numeraire for weak martingale transport	Mathias Beiglböck et.al.	2406.07523v1	null
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Faster Spectral Density Estimation and Sparsification in the Nuclear Norm	Yujia Jin et.al.	2406.07521v1	null
2024-06-11	Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics	Reuben R. W. Wang et.al.	2406.07519v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid	Naser Souri et.al.	2406.07503v1	null
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale	William Giarè et.al.	2406.07493v1	null
2024-06-11	GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection	Hang Yao et.al.	2406.07487v1	null
2024-06-11	Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction	Bekir Z. Demiray et.al.	2406.07484v1	null
2024-06-11	Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery	Biplov Bhandari et.al.	2406.07482v1	null

GAN

Publish Date	Title	Authors	PDF	Code
2024-06-11	Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring	Huicong Zhang et.al.	2406.07551v1	link
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	RaD-Net 2: A causal two-stage repairing and denoising speech enhancement network with knowledge distillation and complex axial self-attention	Mingshuai Liu et.al.	2406.07498v1	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	Choreographing the Rhythms of Observation: Dynamics for Ranged Observer Bipartite-Unipartite SpatioTemporal (ROBUST) Networks	Ted Edward Holmberg et.al.	2406.07473v1	null
2024-06-11	Microbiomes Through The Looking Glass	Jacopo Pasqualini et.al.	2406.07465v1	null
2024-06-11	Reconfigurable Intelligent Surfaces in Dynamic Rich Scattering Environments: BiLSTM-Based Optimization for Accurate User Localization	Anum Umer et.al.	2406.07463v1	null
2024-06-11	fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions	Alireza Afzal Aghaei et.al.	2406.07456v1	link
2024-06-11	HTVM: Efficient Neural Network Deployment On Heterogeneous TinyML Platforms	Josse Van Delm et.al.	2406.07453v1	null
2024-06-11	Boosted Conformal Prediction Intervals	Ran Xie et.al.	2406.07449v1	null
2024-06-11	Metastability in networks of nonlinear stochastic integrate-and-fire neurons	Siddharth Paliwal et.al.	2406.07445v1	null
2024-06-11	DeformTime: Capturing Variable Dependencies with Deformable Attention for Time Series Forecasting	Yuxuan Shu et.al.	2406.07438v1	null
2024-06-11	Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration	Shashank Agnihotri et.al.	2406.07435v1	null
2024-06-11	Matryoshka Representation Learning for Recommendation	Riwei Lai et.al.	2406.07432v1	link
2024-06-11	GemNet: Menu-Based, Strategy-Proof Multi-Bidder Auctions Through Deep Learning	Tonghan Wang et.al.	2406.07428v1	null
2024-06-11	Graph Reasoning for Explainable Cold Start Recommendation	Jibril Frej et.al.	2406.07420v1	null
2024-06-11	Average-exact mixed anomalies and compatible phases	Yichen Xu et.al.	2406.07417v1	null
2024-06-11	Holistic Memory Diversification for Incremental Learning in Growing Graphs	Ziyue Qiao et.al.	2406.07413v1	null
2024-06-11	Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance	Ruxin Zheng et.al.	2406.07399v1	null
2024-06-11	Holographic reconstruction of black hole spacetime: machine learning and entanglement entropy	Byoungjoon Ahn et.al.	2406.07395v1	null
2024-06-11	DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling	Sixian Wang et.al.	2406.07390v1	null
2024-06-11	Disrupting Bipartite Trading Networks: Matching for Revenue Maximization	Luca D'Amico-Wong et.al.	2406.07385v1	null
2024-06-11	Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks	Bjarke Madsen et.al.	2406.07383v1	null
2024-06-11	COLoRIS: Localization-agnostic Smart Surfaces Enabling Opportunistic ISAC in 6G Networks	Guillermo Encinas-Lago et.al.	2406.07377v1	null
2024-06-11	Decoding planetary surfaces by counting cracks	S. Silver et.al.	2406.07376v1	null
2024-06-11	Improving the realism of robotic surgery simulation through injection of learning-based estimated errors	Juan Antonio Barragan et.al.	2406.07375v1	null
2024-06-11	Deep Implicit Optimization for Robust and Flexible Image Registration	Rohit Jena et.al.	2406.07361v1	null
2024-06-11	Toxic Memes: A Survey of Computational Perspectives on the Detection and Explanation of Meme Toxicities	Delfina Sol Martinez Pandiani et.al.	2406.07353v1	null
2024-06-11	Stochastic Analysis of Homogeneous Wireless Networks Assisted by Intelligent Reflecting Surfaces	Ali H. Abdollahi Bafghi et.al.	2406.07352v1	null

Multi-modal

Vision-Language

Publish Date	Title	Authors	PDF	Code

Image Caption

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Interpolating between Hausdorff and box dimension	Amlan Banaji et.al.	2406.07527v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502v1	link
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	Trim 3D Gaussian Splatting for Accurate Geometry Representation	Lue Fan et.al.	2406.07499v1	null
2024-06-11	GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection	Hang Yao et.al.	2406.07487v1	null
2024-06-11	Novel Optimized Designs of Modulo $2n+1$ Adder for Quantum Computing	Bhaskar Gaur et.al.	2406.07486v1	null
2024-06-11	Image Neural Field Diffusion Models	Yinbo Chen et.al.	2406.07480v1	null
2024-06-11	VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs	Zesen Cheng et.al.	2406.07476v1	link
2024-06-11	fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions	Alireza Afzal Aghaei et.al.	2406.07456v1	link
2024-06-11	An Optimism-based Approach to Online Evaluation of Generative Models	Xiaoyan Hu et.al.	2406.07451v1	null
2024-06-11	Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning	Shuvendu Roy et.al.	2406.07450v1	link
2024-06-11	Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration	Shashank Agnihotri et.al.	2406.07435v1	null
2024-06-11	Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments	Christopher D. Hsu et.al.	2406.07431v1	null
2024-06-11	Learning Domain-Invariant Features for Out-of-Context News Detection	Yimeng Gu et.al.	2406.07430v1	null
2024-06-11	DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 38 Subclasses	Abdurrahim Yilmaz et.al.	2406.07426v1	null
2024-06-11	Persistent currents in mesoscopic spin-orbit coupled rings due to an applied Zeeman field	Bijay Kumar Sahoo et.al.	2406.07405v1	null
2024-06-11	Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance	Ruxin Zheng et.al.	2406.07399v1	null
2024-06-11	Visual Representation Learning with Stochastic Frame Prediction	Huiwon Jang et.al.	2406.07398v1	null
2024-06-11	DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling	Sixian Wang et.al.	2406.07390v1	null

Multi-modal

Publish Date	Title	Authors	PDF	Code
2024-06-11	Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis	David Ortiz-Perez et.al.	2406.07542v1	link
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502v1	link
2024-06-11	Multimodal Belief Prediction	John Murzaku et.al.	2406.07466v1	null
2024-06-11	World Models with Hints of Large Language Models for Goal Achieving	Zeyuan Liu et.al.	2406.07381v1	null
2024-06-11	Toxic Memes: A Survey of Computational Perspectives on the Detection and Explanation of Meme Toxicities	Delfina Sol Martinez Pandiani et.al.	2406.07353v1	null
2024-06-11	Transferring Knowledge from Large Foundation Models to Small Downstream Models	Shikai Qiu et.al.	2406.07337v1	null
2024-06-11	MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting	Zhiqi Ai et.al.	2406.07310v1	null
2024-06-11	Which Country Is This? Automatic Country Ranking of Street View Photos	Tim Menzner et.al.	2406.07227v1	null
2024-06-11	Open-World Human-Object Interaction Detection via Multi-modal Prompts	Jie Yang et.al.	2406.07221v1	null
2024-06-11	Target Speech Diarization with Multimodal Prompts	Yidi Jiang et.al.	2406.07198v1	null
2024-06-11	RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker	Yunfeng Li et.al.	2406.07189v1	link
2024-06-11	Unified Modeling Enhanced Multimodal Learning for Precision Neuro-Oncology	Huahui Yi et.al.	2406.07078v1	link
2024-06-11	Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study	Yichi Zhang et.al.	2406.07057v1	null
2024-06-11	Evolving from Single-modal to Multi-modal Facial Deepfake Detection: A Survey	Ping Liu et.al.	2406.06965v1	null
2024-06-11	Missingness-resilient Video-enhanced Multimodal Disfluency Detection	Payal Mohapatra et.al.	2406.06964v1	null
2024-06-11	Eyeballing Combinatorial Problems: A Case Study of Using Multimodal Large Language Models to Solve Traveling Salesman Problems	Mohammed Elhenawy et.al.	2406.06865v1	null
2024-06-10	FlexLoc: Conditional Neural Networks for Zero-Shot Sensor Perspective Invariance in Object Localization with Distributed Multimodal Sensors	Jason Wu et.al.	2406.06796v1	link
2024-06-10	BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification	June-Woo Kim et.al.	2406.06786v1	null
2024-06-10	MolX: Enhancing Large Language Models for Molecular Learning with A Multi-Modal Extension	Khiem Le et.al.	2406.06777v1	null
2024-06-10	Merlin: A Vision Language Foundation Model for 3D Computed Tomography	Louis Blankemeier et.al.	2406.06512v1	null
2024-06-10	AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction	Zhen Xing et.al.	2406.06465v1	null
2024-06-10	VCR: Visual Caption Restoration	Tianyu Zhang et.al.	2406.06462v1	link
2024-06-10	Margin-aware Preference Optimization for Aligning Diffusion Models without Reference	Jiwoo Hong et.al.	2406.06424v1	null
2024-06-10	STimage-1K4M: A histopathology image-gene expression dataset for spatial transcriptomics	Jiawen Chen et.al.	2406.06393v1	link
2024-06-10	Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization	Yi Gu et.al.	2406.06382v1	link
2024-06-10	ASTRA: Aligning Speech and Text Representations for Asr without Sampling	Neeraj Gaur et.al.	2406.06664v1	null
2024-06-10	MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing	Yu-Fen Huang et.al.	2406.06375v1	link
2024-06-10	A Guide to Stochastic Optimisation for Large-Scale Inverse Problems	Matthias J. Ehrhardt et.al.	2406.06342v1	null

VQA

Publish Date	Title	Authors	PDF	Code
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	QuickLLaMA: Query-aware Inference Acceleration for Large Language Models	Jingyao Li et.al.	2406.07528v1	link
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale	William Giarè et.al.	2406.07493v1	null
2024-06-11	Paraphrasing in Affirmative Terms Improves Negation Understanding	MohammadHossein Rezaei et.al.	2406.07492v1	null
2024-06-11	ReduceFormer: Attention with Tensor Reduction by Summation	John Yang et.al.	2406.07488v1	null
2024-06-11	GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection	Hang Yao et.al.	2406.07487v1	null
2024-06-11	PITCH: Productivity and Mental Well-being Coaching through Daily Conversational Interaction	Adnan Abbas et.al.	2406.07485v1	null
2024-06-11	The end of multiple choice tests: using AI to enhance assessment	Michael Klymkowsky et.al.	2406.07481v1	null
2024-06-11	VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs	Zesen Cheng et.al.	2406.07476v1	link
2024-06-11	Partially Observed Trajectory Inference using Optimal Transport and a Dynamics Prior	Anming Gu et.al.	2406.07475v1	null
2024-06-11	Estimating the Hallucination Rate of Generative AI	Andrew Jesson et.al.	2406.07457v1	null
2024-06-11	Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis	Qining Zhang et.al.	2406.07455v1	null
2024-06-11	Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning	Shuvendu Roy et.al.	2406.07450v1	link
2024-06-11	Constructions of Turán systems that are tight up to a multiplicative constant	Oleg Pikhurko et.al.	2406.07443v1	null
2024-06-11	Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for Sampling	Denis Blessing et.al.	2406.07423v1	link
2024-06-11	Holistic Memory Diversification for Incremental Learning in Growing Graphs	Ziyue Qiao et.al.	2406.07413v1	null
2024-06-11	Guiding LLM Temporal Logic Generation with Explicit Separation of Data and Control	William Murphy et.al.	2406.07400v1	null
2024-06-11	Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance	Ruxin Zheng et.al.	2406.07399v1	null
2024-06-11	Limited Out-of-Context Knowledge Reasoning in Large Language Models	Peng Hu et.al.	2406.07393v1	null
2024-06-11	AI.vs.Clinician: Unveiling Intricate Interactions Between AI and Clinicians through an Open-Access Database	Wanling Gao et.al.	2406.07362v1	null
2024-06-11	Deep Implicit Optimization for Robust and Flexible Image Registration	Rohit Jena et.al.	2406.07361v1	null
2024-06-11	PSMC: Provable and Scalable Algorithms for Motif Conductance Based Graph Clustering	Longlong Lin et.al.	2406.07357v1	null
2024-06-11	DR-RAG: Applying Dynamic Document Relevance to Retrieval-Augmented Generation for Question-Answering	Zijian Hei et.al.	2406.07348v2	null
2024-06-11	Beyond Training: Optimizing Reinforcement Learning Based Job Shop Scheduling Through Adaptive Action Sampling	Constantin Waubert de Puiseau et.al.	2406.07325v1	null
2024-06-11	The magic of entangled top quarks	Chris D. White et.al.	2406.07321v2	null
2024-06-11	Rethinking the impact of noisy labels in graph classification: A utility and privacy perspective	De Li et.al.	2406.07314v1	null
2024-06-11	BertaQA: How Much Do Language Models Know About Local Culture?	Julen Etxaniz et.al.	2406.07302v1	link

Text and Image Generation

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring	Huicong Zhang et.al.	2406.07551v1	link
2024-06-11	A3VLM: Actionable Articulation-Aware Vision Language Model	Siyuan Huang et.al.	2406.07549v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis	David Ortiz-Perez et.al.	2406.07542v1	link
2024-06-11	CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning	Zeyuan Liu et.al.	2406.07541v1	null
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Autoregressive Pretraining with Mamba in Vision	Sucheng Ren et.al.	2406.07537v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Dynamics of the non-radial energy-critical inhomogeneous NLS	Carlos M. Guzmán et.al.	2406.07535v1	null
2024-06-11	On the potential of probing the neutron star composition in accreting X-ray binaries	Kaiser Arf et.al.	2406.07534v1	null
2024-06-11	Interpreting DESI 2024 BAO: late-time dynamical dark energy or a local effect?	Ioannis D. Gialamas et.al.	2406.07533v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	Interacting-bath dynamical embedding for capturing non-local electron correlation in solids	Jiachen Li et.al.	2406.07531v1	null
2024-06-11	Coherent Three-Photon Excitation of the Strontium Clock Transition	Junyu He et.al.	2406.07530v1	null
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	QuickLLaMA: Query-aware Inference Acceleration for Large Language Models	Jingyao Li et.al.	2406.07528v1	link
2024-06-11	Interpolating between Hausdorff and box dimension	Amlan Banaji et.al.	2406.07527v1	null
2024-06-11	Cosmological constraints on $Λ_{\rm s}$CDM scenario in a type II minimally modified gravity	Ozgur Akarsu et.al.	2406.07526v1	null
2024-06-11	Will Southeast Asia be the next global manufacturing hub? A multiway cointegration, causality, and dynamic connectedness analyses on factors influencing offshore decisions	Haibo Wang et.al.	2406.07525v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Change of numeraire for weak martingale transport	Mathias Beiglböck et.al.	2406.07523v1	null
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null

Alignment

Publish Date	Title	Authors	PDF	Code
2024-06-11	Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis	David Ortiz-Perez et.al.	2406.07542v1	link
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502v1	link
2024-06-11	Multimodal Belief Prediction	John Murzaku et.al.	2406.07466v1	null
2024-06-11	World Models with Hints of Large Language Models for Goal Achieving	Zeyuan Liu et.al.	2406.07381v1	null
2024-06-11	Toxic Memes: A Survey of Computational Perspectives on the Detection and Explanation of Meme Toxicities	Delfina Sol Martinez Pandiani et.al.	2406.07353v1	null
2024-06-11	Transferring Knowledge from Large Foundation Models to Small Downstream Models	Shikai Qiu et.al.	2406.07337v1	null
2024-06-11	MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting	Zhiqi Ai et.al.	2406.07310v1	null
2024-06-11	Which Country Is This? Automatic Country Ranking of Street View Photos	Tim Menzner et.al.	2406.07227v1	null
2024-06-11	Open-World Human-Object Interaction Detection via Multi-modal Prompts	Jie Yang et.al.	2406.07221v1	null
2024-06-11	Target Speech Diarization with Multimodal Prompts	Yidi Jiang et.al.	2406.07198v1	null
2024-06-11	RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker	Yunfeng Li et.al.	2406.07189v1	link
2024-06-11	Unified Modeling Enhanced Multimodal Learning for Precision Neuro-Oncology	Huahui Yi et.al.	2406.07078v1	link
2024-06-11	Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study	Yichi Zhang et.al.	2406.07057v1	null
2024-06-11	Evolving from Single-modal to Multi-modal Facial Deepfake Detection: A Survey	Ping Liu et.al.	2406.06965v1	null
2024-06-11	Missingness-resilient Video-enhanced Multimodal Disfluency Detection	Payal Mohapatra et.al.	2406.06964v1	null
2024-06-11	Eyeballing Combinatorial Problems: A Case Study of Using Multimodal Large Language Models to Solve Traveling Salesman Problems	Mohammed Elhenawy et.al.	2406.06865v1	null
2024-06-10	FlexLoc: Conditional Neural Networks for Zero-Shot Sensor Perspective Invariance in Object Localization with Distributed Multimodal Sensors	Jason Wu et.al.	2406.06796v1	link
2024-06-10	BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification	June-Woo Kim et.al.	2406.06786v1	null
2024-06-10	MolX: Enhancing Large Language Models for Molecular Learning with A Multi-Modal Extension	Khiem Le et.al.	2406.06777v1	null
2024-06-10	Merlin: A Vision Language Foundation Model for 3D Computed Tomography	Louis Blankemeier et.al.	2406.06512v1	null
2024-06-10	AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction	Zhen Xing et.al.	2406.06465v1	null
2024-06-10	VCR: Visual Caption Restoration	Tianyu Zhang et.al.	2406.06462v1	link
2024-06-10	Margin-aware Preference Optimization for Aligning Diffusion Models without Reference	Jiwoo Hong et.al.	2406.06424v1	null
2024-06-10	STimage-1K4M: A histopathology image-gene expression dataset for spatial transcriptomics	Jiawen Chen et.al.	2406.06393v1	link
2024-06-10	Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization	Yi Gu et.al.	2406.06382v1	link
2024-06-10	ASTRA: Aligning Speech and Text Representations for Asr without Sampling	Neeraj Gaur et.al.	2406.06664v1	null
2024-06-10	MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing	Yu-Fen Huang et.al.	2406.06375v1	link
2024-06-10	A Guide to Stochastic Optimisation for Large-Scale Inverse Problems	Matthias J. Ehrhardt et.al.	2406.06342v1	null

Transformer

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring	Huicong Zhang et.al.	2406.07551v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Change of numeraire for weak martingale transport	Mathias Beiglböck et.al.	2406.07523v1	null
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization	Frederic Kirstein et.al.	2406.07494v2	null
2024-06-11	ReduceFormer: Attention with Tensor Reduction by Summation	John Yang et.al.	2406.07488v1	null
2024-06-11	Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction	Bekir Z. Demiray et.al.	2406.07484v1	null
2024-06-11	Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis	Qining Zhang et.al.	2406.07455v1	null
2024-06-11	Textual Similarity as a Key Metric in Machine Translation Quality Estimation	Kun Sun et.al.	2406.07440v1	null
2024-06-11	DeformTime: Capturing Variable Dependencies with Deformable Attention for Time Series Forecasting	Yuxuan Shu et.al.	2406.07438v1	null
2024-06-11	Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration	Shashank Agnihotri et.al.	2406.07435v1	null
2024-06-11	Making 'syscall' a Privilege not a Right	Fangfei Yang et.al.	2406.07429v1	null
2024-06-11	GemNet: Menu-Based, Strategy-Proof Multi-Bidder Auctions Through Deep Learning	Tonghan Wang et.al.	2406.07428v1	null
2024-06-11	Entropy, slicing problem and functional Mahler's conjecture	Matthieu Fradelizi et.al.	2406.07406v1	null
2024-06-11	Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy	Xiaohan Huang et.al.	2406.07404v1	null
2024-06-11	A Survey on Recent Random Walk-based Methods for Embedding Knowledge Graphs	Elika Bozorgi et.al.	2406.07402v1	null
2024-06-11	Fast and accurate evaluation of Biot-Savart integrals over spatial curves	Juan Ignacio Polanco et.al.	2406.07366v1	null
2024-06-11	Chebyshev Approximated Variational Coupled Cluster for Quantum Computing	Luca Erhart et.al.	2406.07364v1	null
2024-06-11	Deep Implicit Optimization for Robust and Flexible Image Registration	Rohit Jena et.al.	2406.07361v1	null
2024-06-11	Global-Regularized Neighborhood Regression for Efficient Zero-Shot Texture Anomaly Detection	Haiming Yao et.al.	2406.07333v1	null
2024-06-11	Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach	Challapalli Phanindra Revanth et.al.	2406.07332v1	null
2024-06-11	Instruct Large Language Models to Drive like Humans	Ruijun Zhang et.al.	2406.07296v1	link
2024-06-11	$\mathscr{D}$-modules on the basic affine space and large $\mathfrak{g}$-modules	Masatoshi Kitagawa et.al.	2406.07279v1	null
2024-06-11	Are Protein Language Models Compute Optimal?	Yaiza Serrano et.al.	2406.07249v1	null
2024-06-11	Dynamical Mean-Field Theory of Self-Attention Neural Networks	Ángel Poc-López et.al.	2406.07247v1	null

Vision Transformer

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring	Huicong Zhang et.al.	2406.07551v1	link
2024-06-11	A3VLM: Actionable Articulation-Aware Vision Language Model	Siyuan Huang et.al.	2406.07549v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Autoregressive Pretraining with Mamba in Vision	Sucheng Ren et.al.	2406.07537v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	Change of numeraire for weak martingale transport	Mathias Beiglböck et.al.	2406.07523v1	null
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502v1	link
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	Trim 3D Gaussian Splatting for Accurate Geometry Representation	Lue Fan et.al.	2406.07499v1	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization	Frederic Kirstein et.al.	2406.07494v2	null
2024-06-11	ReduceFormer: Attention with Tensor Reduction by Summation	John Yang et.al.	2406.07488v1	null
2024-06-11	GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection	Hang Yao et.al.	2406.07487v1	null
2024-06-11	Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction	Bekir Z. Demiray et.al.	2406.07484v1	null
2024-06-11	Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery	Biplov Bhandari et.al.	2406.07482v1	null

Reinforcement Learning

Publish Date	Title	Authors	PDF	Code
2024-06-11	CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning	Zeyuan Liu et.al.	2406.07541v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis	Qining Zhang et.al.	2406.07455v1	null
2024-06-11	Enhanced Gene Selection in Single-Cell Genomics: Pre-Filtering Synergy and Reinforced Optimization	Weiliang Zhang et.al.	2406.07418v1	null
2024-06-11	Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy	Xiaohan Huang et.al.	2406.07404v1	null
2024-06-11	Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks	Bjarke Madsen et.al.	2406.07383v1	null
2024-06-11	World Models with Hints of Large Language Models for Goal Achieving	Zeyuan Liu et.al.	2406.07381v1	null
2024-06-11	EdgeTimer: Adaptive Multi-Timescale Scheduling in Mobile Edge Computing with Deep Reinforcement Learning	Yijun Hao et.al.	2406.07342v1	null
2024-06-11	Beyond Training: Optimizing Reinforcement Learning Based Job Shop Scheduling Through Adaptive Action Sampling	Constantin Waubert de Puiseau et.al.	2406.07325v1	null
2024-06-11	Multi-objective Reinforcement learning from AI Feedback	Marcus Williams et.al.	2406.07295v2	null
2024-06-11	Hybrid Reinforcement Learning from Offline Observation Alone	Yuda Song et.al.	2406.07253v1	null
2024-06-11	A generic and robust quantum agent inspired by deep meta-reinforcement learning	Zibo Miao et.al.	2406.07225v1	null
2024-06-11	Semantic-Aware Spectrum Sharing in Internet of Vehicles Based on Deep Reinforcement Learning	Zhiyu Shao et.al.	2406.07213v1	link
2024-06-11	Machine learning potential for the Cu-W system	Manura Liyanage et.al.	2406.07157v1	null
2024-06-11	Failures Are Fated, But Can Be Faded: Characterizing and Mitigating Unwanted Behaviors in Large-Scale Vision and Language Models	Som Sagar et.al.	2406.07145v1	null
2024-06-11	CHARME: A chain-based reinforcement learning approach for the minor embedding problem	Hoang M. Ngo et.al.	2406.07124v1	null
2024-06-11	Augmenting Offline RL with Unlabeled Data	Zhao Wang et.al.	2406.07117v1	null
2024-06-11	Optimal Gait Control for a Tendon-driven Soft Quadruped Robot by Model-based Reinforcement Learning	Xuezhi Niu et.al.	2406.07069v1	null
2024-06-11	Integrating Domain Knowledge for handling Limited Data in Offline RL	Briti Gangopadhyay et.al.	2406.07041v1	null
2024-06-11	Entropy-Reinforced Planning with Large Language Models for Drug Discovery	Xuefeng Liu et.al.	2406.07025v1	null
2024-06-11	Delving into ChatGPT usage in academic writing through excess vocabulary	Dmitry Kobak et.al.	2406.07016v1	null
2024-06-11	DNN Partitioning, Task Offloading, and Resource Allocation in Dynamic Vehicular Networks: A Lyapunov-Guided Diffusion-Based Reinforcement Learning Approach	Zhang Liu et.al.	2406.06986v1	null
2024-06-11	Joint Demonstration and Preference Learning Improves Policy Alignment with Human Feedback	Chenliang Li et.al.	2406.06874v1	null
2024-06-11	Sample Complexity Reduction via Policy Difference Estimation in Tabular Reinforcement Learning	Adhyyan Narang et.al.	2406.06856v1	null
2024-06-10	Reinforced Compressive Neural Architecture Search for Versatile Adversarial Robustness	Dingrong Wang et.al.	2406.06792v1	link
2024-06-10	Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach For Adaptive Brain Stimulation	Michelle Pan et.al.	2406.06714v1	null
2024-06-10	Verification-Guided Shielding for Deep Reinforcement Learning	Davide Corsi et.al.	2406.06507v1	null
2024-06-10	Adaptive Opponent Policy Detection in Multi-Agent MDPs: Real-Time Strategy Switch Identification Using Running Error Estimation	Mohidul Haque Mridul et.al.	2406.06500v1	null
2024-06-10	Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity	Calarina Muslimani et.al.	2406.06495v1	null
2024-06-10	Towards Real-World Efficiency: Domain Randomization in Reinforcement Learning for Pre-Capture of Free-Floating Moving Targets by Autonomous Robots	Bahador Beigomi et.al.	2406.06460v1	link

Robotics

Publish Date	Title	Authors	PDF	Code
2024-06-11	A3VLM: Actionable Articulation-Aware Vision Language Model	Siyuan Huang et.al.	2406.07549v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding	Ming Hu et.al.	2406.07471v2	null
2024-06-11	Visual Representation Learning with Stochastic Frame Prediction	Huiwon Jang et.al.	2406.07398v1	null
2024-06-11	Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks	Bjarke Madsen et.al.	2406.07383v1	null
2024-06-11	Improving the realism of robotic surgery simulation through injection of learning-based estimated errors	Juan Antonio Barragan et.al.	2406.07375v1	null
2024-06-11	iMESA: Incremental Distributed Optimization for Collaborative Simultaneous Localization and Mapping	Daniel McGann et.al.	2406.07371v1	null
2024-06-11	Realistic Data Generation for 6D Pose Estimation of Surgical Instruments	Juan Antonio Barragan et.al.	2406.07328v1	null
2024-06-11	Should XAI Nudge Human Decisions with Explanation Biasing?	Yosuke Fukuchi et.al.	2406.07323v1	null
2024-06-11	Experimental Modeling of Chiral Active Robots and a Minimal Model of Non-Gaussian Displacements	Yuxuan Zhou et.al.	2406.07313v1	null
2024-06-11	Instruct Large Language Models to Drive like Humans	Ruijun Zhang et.al.	2406.07296v1	link
2024-06-11	OTO Planner: An Efficient Only Travelling Once Exploration Planner for Complex and Unknown Environments	Bo Zhou et.al.	2406.07294v1	null
2024-06-11	3D Voxel Maps to 2D Occupancy Maps for Efficient Path Planning for Aerial and Ground Robots	Scott Fredriksson et.al.	2406.07270v1	null
2024-06-11	Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph	Sergey Linok et.al.	2406.07113v1	null
2024-06-11	A Neck Orthosis with Multi-Directional Variable Stiffness for Persons with Dropped Head Syndrome	Santiago Price Torrendell et.al.	2406.07074v1	null
2024-06-11	Optimal Gait Control for a Tendon-driven Soft Quadruped Robot by Model-based Reinforcement Learning	Xuezhi Niu et.al.	2406.07069v1	null
2024-06-11	Optimal Gait Design for a Soft Quadruped Robot via Multi-fidelity Bayesian Optimization	Kaige Tan et.al.	2406.07065v1	null
2024-06-11	GPU-Accelerated Optimization-Based Collision Avoidance	Zeming Wu et.al.	2406.07048v1	null
2024-06-11	Neural Visibility Field for Uncertainty-Driven Active Mapping	Shangjie Xue et.al.	2406.06948v1	null
2024-06-11	CAAP: Context-Aware Action Planning Prompting to Solve Computer Tasks with Front-End UI Only	Junhee Cho et.al.	2406.06947v1	link
2024-06-11	Person Transfer in the Field: Examining Real World Sequential Human-Robot Interaction Between Two Robots	Xiang Zhi Tan et.al.	2406.06904v1	null
2024-06-11	Developing, Analyzing, and Evaluating Vehicular Lane Keeping Algorithms Under Dynamic Lighting and Weather Conditions Using Electric Vehicles	Michael Khalfin et.al.	2406.06899v1	null
2024-06-11	Joint Demonstration and Preference Learning Improves Policy Alignment with Human Feedback	Chenliang Li et.al.	2406.06874v1	null
2024-06-10	HO-Cap: A Capture System and Dataset for 3D Reconstruction and Pose Tracking of Hand-Object Interaction	Jikai Wang et.al.	2406.06843v1	null
2024-06-10	FlexLoc: Conditional Neural Networks for Zero-Shot Sensor Perspective Invariance in Object Localization with Distributed Multimodal Sensors	Jason Wu et.al.	2406.06796v1	link
2024-06-10	Starling Formation-Flying Optical Experiment: Initial Operations and Flight Results	Justin Kruger et.al.	2406.06748v1	null
2024-06-10	Stochastic Guidance of Buoyancy Controlled Vehicles under Ice Shelves using Ocean Currents	Federico Rossi et.al.	2406.06724v1	null
2024-06-10	Verification-Guided Shielding for Deep Reinforcement Learning	Davide Corsi et.al.	2406.06507v1	null
2024-06-10	Demonstrating HumanTHOR: A Simulation Platform and Benchmark for Human-Robot Collaboration in a Shared Workspace	Chenxu Wang et.al.	2406.06498v1	null

SFM

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring	Huicong Zhang et.al.	2406.07551v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis	David Ortiz-Perez et.al.	2406.07542v1	link
2024-06-11	CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning	Zeyuan Liu et.al.	2406.07541v1	null
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Interpreting DESI 2024 BAO: late-time dynamical dark energy or a local effect?	Ioannis D. Gialamas et.al.	2406.07533v1	null
2024-06-11	Coherent Three-Photon Excitation of the Strontium Clock Transition	Junyu He et.al.	2406.07530v1	null
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	Cosmological constraints on $Λ_{\rm s}$CDM scenario in a type II minimally modified gravity	Ozgur Akarsu et.al.	2406.07526v1	null
2024-06-11	Will Southeast Asia be the next global manufacturing hub? A multiway cointegration, causality, and dynamic connectedness analyses on factors influencing offshore decisions	Haibo Wang et.al.	2406.07525v1	null
2024-06-11	Change of numeraire for weak martingale transport	Mathias Beiglböck et.al.	2406.07523v1	null
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Faster Spectral Density Estimation and Sparsification in the Nuclear Norm	Yujia Jin et.al.	2406.07521v1	null
2024-06-11	Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics	Reuben R. W. Wang et.al.	2406.07519v1	null
2024-06-11	The canonical trace of Cohen-Macaulay algebras of codimension 2	Antonino Ficarra et.al.	2406.07517v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven"	D. T. Chung et.al.	2406.07512v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum	N. -O. Stutzer et.al.	2406.07511v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null
2024-06-11	Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices	Atli Sigurgeirsson et.al.	2406.07504v1	null
2024-06-11	Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid	Naser Souri et.al.	2406.07503v1	null
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502v1	link
2024-06-11	Trim 3D Gaussian Splatting for Accurate Geometry Representation	Lue Fan et.al.	2406.07499v1	null

SLAM

Publish Date	Title	Authors	PDF	Code
2024-06-10	Notes on Kalman Filter (KF, EKF, ESKF, IEKF, IESKF)	Gyubeom Im et.al.	2406.06427v1	null
2024-06-10	Notes on Various Errors and Jacobian Derivations for SLAM	Gyubeom Im et.al.	2406.06422v1	null
2024-06-10	Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation	Shenghao Li et.al.	2406.06374v1	link
2024-06-10	Visual-Inertial SLAM as Simple as A, B, VINS	Nathaniel Merrill et.al.	2406.05969v1	null
2024-06-09	MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps	Jianhao Zheng et.al.	2406.05849v1	null
2024-06-06	Open Problem: Active Representation Learning	Nikola Milosevic et.al.	2406.03845v1	null
2024-06-04	ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization	Chen Mao et.al.	2406.01906v1	link
2024-06-03	Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry	Takayuki Kanai et.al.	2406.00929v1	null
2024-06-02	Visual place recognition for aerial imagery: A survey	Ivan Moskalenko et.al.	2406.00885v1	link
2024-05-30	Structure Gaussian SLAM with Manhattan World Hypothesis	Shuhong Liu et.al.	2405.20031v1	null
2024-05-30	Semantic Landmark Detection & Classification Using Neural Networks For 3D In-Air Sonar	Wouter Jansen et.al.	2405.19869v1	null
2024-05-30	SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization	Jiang Wang et.al.	2405.19813v1	link
2024-05-30	TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM	Peifeng Jiang et.al.	2405.19614v1	null
2024-05-27	CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy	Richard Elvira et.al.	2405.16932v1	null
2024-05-26	Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians	Erik Sandström et.al.	2405.16544v1	link
2024-05-24	NeB-SLAM: Neural Blocks-based Salable RGB-D SLAM for Unknown Scenes	Lizhi Bai et.al.	2405.15151v1	null
2024-05-23	ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization	Han Song et.al.	2405.15082v2	null
2024-05-23	Synergistic Global-space Camera and Human Reconstruction from Videos	Yizhou Zhao et.al.	2405.14855v1	null
2024-05-23	CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments	Yang Zhou et.al.	2405.14731v1	link
2024-05-23	Efficient Robot Learning for Perception and Mapping	Niclas Vödisch et.al.	2405.14688v1	null
2024-05-22	Monocular Gaussian SLAM with Language Extended Loop Closure	Tian Lan et.al.	2405.13748v1	null
2024-05-21	NV-LIO: LiDAR-Inertial Odometry using Normal Vectors Towards Robust SLAM in Multifloor Environments	Dongha Chung et.al.	2405.12563v2	link
2024-05-18	Outlier-Robust Long-Term Robotic Mapping Leveraging Ground Segmentation	Hyungtae Lim et.al.	2405.11176v3	null
2024-05-18	MotionGS : Compact Gaussian Splatting SLAM by Motion Filter	Xinli Guo et.al.	2405.11129v2	link
2024-05-17	CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion	Gang Wang et.al.	2405.10793v2	null
2024-05-17	Occupancy-SLAM: Simultaneously Optimizing Robot Poses and Continuous Occupancy Map	Liang Zhao et.al.	2405.10743v1	null
2024-05-14	IPC: Incremental Probabilistic Consensus-based Consistent Set Maximization for SLAM Backends	Emilio Olivastri et.al.	2405.08503v1	link
2024-05-13	OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition	Qiuchi Xiang et.al.	2405.07966v1	link
2024-05-13	SceneFactory: A Workflow-centric and Unified Framework for Incremental Scene Modeling	Yijun Yuan et.al.	2405.07847v1	null
2024-05-12	NGD-SLAM: Towards Real-Time SLAM for Dynamic Environments without GPU	Yuhao Zhang et.al.	2405.07392v1	link

Visual Localization

Publish Date	Title	Authors	PDF	Code
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	Cinematic Gaussians: Real-Time HDR Radiance Fields with Depth of Field	Chao Wang et.al.	2406.07329v1	null
2024-06-11	Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs	Kamil Jeziorek et.al.	2406.07318v1	null
2024-06-11	Let Go of Your Labels with Unsupervised Transfer	Artyom Gadetsky et.al.	2406.07236v1	link
2024-06-11	RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker	Yunfeng Li et.al.	2406.07189v1	link
2024-06-11	Increased accuracy and signal-to-noise ratio through recent improvements in Infra-Red Video Bolometer fabrication and calibration	Fabio Federici et.al.	2406.07139v1	null
2024-06-11	PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving	Yining Shi et.al.	2406.07037v1	null
2024-06-11	MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results	Xin Jin et.al.	2406.07006v1	null
2024-06-11	Generative Lifting of Multiview to 3D from Unknown Pose: Wrapping NeRF inside Diffusion	Xin Yuan et.al.	2406.06972v1	null
2024-06-11	Neural Visibility Field for Uncertainty-Driven Active Mapping	Shangjie Xue et.al.	2406.06948v1	null
2024-06-11	High-velocity blue-shifted Fe XXV He$α$ line during a superflare of the RS CVn-type star IM Peg	Shun Inoue et.al.	2406.06940v1	null
2024-06-10	The PAU Survey: Photometric Calibration of Narrow Band Images	F. J. Castander et.al.	2406.06850v1	null
2024-06-10	HO-Cap: A Capture System and Dataset for 3D Reconstruction and Pose Tracking of Hand-Object Interaction	Jikai Wang et.al.	2406.06843v1	null
2024-06-10	Starling Formation-Flying Optical Experiment: Initial Operations and Flight Results	Justin Kruger et.al.	2406.06748v1	null
2024-06-10	PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction	Danpeng Chen et.al.	2406.06521v1	null
2024-06-10	SYM3D: Learning Symmetric Triplanes for Better 3D-Awareness of GANs	Jing Yang et.al.	2406.06432v1	null
2024-06-10	Notes on Various Errors and Jacobian Derivations for SLAM	Gyubeom Im et.al.	2406.06422v1	null
2024-06-10	Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation	Shenghao Li et.al.	2406.06374v1	link
2024-06-10	Relativistic and wide-angle corrections to galaxy power spectra	Sheean Jolicoeur et.al.	2406.06274v1	null
2024-06-10	DualAD: Disentangling the Dynamic and Static World for End-to-End Driving	Simon Doll et.al.	2406.06264v1	null
2024-06-10	Vript: A Video Is Worth Thousands of Words	Dongjie Yang et.al.	2406.06040v1	link
2024-06-09	SlowPerception: Physical-World Latency Attack against Visual Perception in Autonomous Driving	Chen Ma et.al.	2406.05800v1	null
2024-06-09	Region of Interest Loss for Anonymizing Learned Image Compression	Christoph Liebender et.al.	2406.05726v1	null
2024-06-09	Enhancing the light yield of He:CF$_4$ based gaseous detector	F. D. Amaro et.al.	2406.05713v1	null
2024-06-09	MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation	Yan Ma et.al.	2406.05690v1	link
2024-06-08	The PLATO Mission	Heike Rauer et.al.	2406.05447v1	null
2024-06-08	MotionClone: Training-Free Motion Cloning for Controllable Video Generation	Pengyang Ling et.al.	2406.05338v2	null
2024-06-07	Lessons from the Cruise Robotaxi Pedestrian Dragging Mishap	Philip Koopman et.al.	2406.05281v1	null
2024-06-07	A Tensor Decomposition Perspective on Second-order RNNs	Maude Lizaire et.al.	2406.05045v1	link

Contrastive Learning

Publish Date	Title	Authors	PDF	Code
2024-06-11	A3VLM: Actionable Articulation-Aware Vision Language Model	Siyuan Huang et.al.	2406.07549v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis	David Ortiz-Perez et.al.	2406.07542v1	link
2024-06-11	CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning	Zeyuan Liu et.al.	2406.07541v1	null
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	Coherent Three-Photon Excitation of the Strontium Clock Transition	Junyu He et.al.	2406.07530v1	null
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	QuickLLaMA: Query-aware Inference Acceleration for Large Language Models	Jingyao Li et.al.	2406.07528v1	link
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Change of numeraire for weak martingale transport	Mathias Beiglböck et.al.	2406.07523v1	null
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Faster Spectral Density Estimation and Sparsification in the Nuclear Norm	Yujia Jin et.al.	2406.07521v1	null
2024-06-11	Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics	Reuben R. W. Wang et.al.	2406.07519v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid	Naser Souri et.al.	2406.07503v1	null
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale	William Giarè et.al.	2406.07493v1	null
2024-06-11	Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction	Bekir Z. Demiray et.al.	2406.07484v1	null
2024-06-11	Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery	Biplov Bhandari et.al.	2406.07482v1	null

Medical Application

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring	Huicong Zhang et.al.	2406.07551v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis	David Ortiz-Perez et.al.	2406.07542v1	link
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	Coherent Three-Photon Excitation of the Strontium Clock Transition	Junyu He et.al.	2406.07530v1	null
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	QuickLLaMA: Query-aware Inference Acceleration for Large Language Models	Jingyao Li et.al.	2406.07528v1	link
2024-06-11	Interpolating between Hausdorff and box dimension	Amlan Banaji et.al.	2406.07527v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum	N. -O. Stutzer et.al.	2406.07511v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results I. Improved data selection and processing	J. G. S. Lunde et.al.	2406.07510v1	null
2024-06-11	$(J/ψ, J/ψ)$, and $(η_c, η_c)$ production through two intermediate photons in electron-positron annihilation at B-factories	Shashank Bhatnagar et.al.	2406.07508v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502v1	link
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	Trim 3D Gaussian Splatting for Accurate Geometry Representation	Lue Fan et.al.	2406.07499v1	null
2024-06-11	RaD-Net 2: A causal two-stage repairing and denoising speech enhancement network with knowledge distillation and complex axial self-attention	Mingshuai Liu et.al.	2406.07498v1	null
2024-06-11	A pilot protocol and cohort for the investigation of non-pathological variability in speech	Nicholas Cummins et.al.	2406.07497v1	null
2024-06-11	CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization	Frederic Kirstein et.al.	2406.07494v2	null
2024-06-11	GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection	Hang Yao et.al.	2406.07487v1	null
2024-06-11	Novel Optimized Designs of Modulo $2n+1$ Adder for Quantum Computing	Bhaskar Gaur et.al.	2406.07486v1	null

Medical Image Analysis

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring	Huicong Zhang et.al.	2406.07551v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis	David Ortiz-Perez et.al.	2406.07542v1	link
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	Coherent Three-Photon Excitation of the Strontium Clock Transition	Junyu He et.al.	2406.07530v1	null
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	QuickLLaMA: Query-aware Inference Acceleration for Large Language Models	Jingyao Li et.al.	2406.07528v1	link
2024-06-11	Interpolating between Hausdorff and box dimension	Amlan Banaji et.al.	2406.07527v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum	N. -O. Stutzer et.al.	2406.07511v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results I. Improved data selection and processing	J. G. S. Lunde et.al.	2406.07510v1	null
2024-06-11	$(J/ψ, J/ψ)$, and $(η_c, η_c)$ production through two intermediate photons in electron-positron annihilation at B-factories	Shashank Bhatnagar et.al.	2406.07508v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502v1	link
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	Trim 3D Gaussian Splatting for Accurate Geometry Representation	Lue Fan et.al.	2406.07499v1	null
2024-06-11	RaD-Net 2: A causal two-stage repairing and denoising speech enhancement network with knowledge distillation and complex axial self-attention	Mingshuai Liu et.al.	2406.07498v1	null
2024-06-11	A pilot protocol and cohort for the investigation of non-pathological variability in speech	Nicholas Cummins et.al.	2406.07497v1	null
2024-06-11	CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization	Frederic Kirstein et.al.	2406.07494v2	null
2024-06-11	GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection	Hang Yao et.al.	2406.07487v1	null
2024-06-11	Novel Optimized Designs of Modulo $2n+1$ Adder for Quantum Computing	Bhaskar Gaur et.al.	2406.07486v1	null

Medical Multi-modal

Publish Date	Title	Authors	PDF	Code
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis	David Ortiz-Perez et.al.	2406.07542v1	link
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Will Southeast Asia be the next global manufacturing hub? A multiway cointegration, causality, and dynamic connectedness analyses on factors influencing offshore decisions	Haibo Wang et.al.	2406.07525v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502v1	link
2024-06-11	CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization	Frederic Kirstein et.al.	2406.07494v2	null
2024-06-11	Choreographing the Rhythms of Observation: Dynamics for Ranged Observer Bipartite-Unipartite SpatioTemporal (ROBUST) Networks	Ted Edward Holmberg et.al.	2406.07473v1	null
2024-06-11	4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models	Heng Yu et.al.	2406.07472v1	null
2024-06-11	Multimodal Belief Prediction	John Murzaku et.al.	2406.07466v1	null
2024-06-11	Resummation of Multi-Stress Tensors in Higher Dimensions	Kuo-Wei Huang et.al.	2406.07458v1	null
2024-06-11	An Optimism-based Approach to Online Evaluation of Generative Models	Xiaoyan Hu et.al.	2406.07451v1	null
2024-06-11	Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning	Shuvendu Roy et.al.	2406.07450v1	link
2024-06-11	Graph-based multi-Feature fusion method for speech emotion recognition	Xueyu Liu et.al.	2406.07437v1	null
2024-06-11	Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments	Christopher D. Hsu et.al.	2406.07431v1	null
2024-06-11	GemNet: Menu-Based, Strategy-Proof Multi-Bidder Auctions Through Deep Learning	Tonghan Wang et.al.	2406.07428v1	null
2024-06-11	DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 38 Subclasses	Abdurrahim Yilmaz et.al.	2406.07426v1	null
2024-06-11	Single-Codec: Single-Codebook Speech Codec towards High-Performance Speech Generation	Hanzhao Li et.al.	2406.07422v1	null
2024-06-11	Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks	Bjarke Madsen et.al.	2406.07383v1	null
2024-06-11	World Models with Hints of Large Language Models for Goal Achieving	Zeyuan Liu et.al.	2406.07381v1	null
2024-06-11	Improving the realism of robotic surgery simulation through injection of learning-based estimated errors	Juan Antonio Barragan et.al.	2406.07375v1	null
2024-06-11	iMESA: Incremental Distributed Optimization for Collaborative Simultaneous Localization and Mapping	Daniel McGann et.al.	2406.07371v1	null
2024-06-11	BvSP: Broad-view Soft Prompting for Few-Shot Aspect Sentiment Quad Prediction	Yinhao Bai et.al.	2406.07365v1	link
2024-06-11	AI.vs.Clinician: Unveiling Intricate Interactions Between AI and Clinicians through an Open-Access Database	Wanling Gao et.al.	2406.07362v1	null
2024-06-11	Deep Implicit Optimization for Robust and Flexible Image Registration	Rohit Jena et.al.	2406.07361v1	null
2024-06-11	GLIMPSE: Pragmatically Informative Multi-Document Summarization for Scholarly Reviews	Maxime Darrin et.al.	2406.07359v1	null
2024-06-11	Toxic Memes: A Survey of Computational Perspectives on the Detection and Explanation of Meme Toxicities	Delfina Sol Martinez Pandiani et.al.	2406.07353v1	null
2024-06-11	DR-RAG: Applying Dynamic Document Relevance to Retrieval-Augmented Generation for Question-Answering	Zijian Hei et.al.	2406.07348v2	null
2024-06-11	Few-Body Quantum Chaos, Localization, and Multi-Photon Entanglement in Optical Synthetic Frequency Dimension	Junlin Wang et.al.	2406.07346v1	null

Graph Neural Network

Publish Date	Title	Authors	PDF	Code
2024-06-11	Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring	Huicong Zhang et.al.	2406.07551v1	link
2024-06-11	Faster Spectral Density Estimation and Sparsification in the Nuclear Norm	Yujia Jin et.al.	2406.07521v1	null
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null
2024-06-11	RaD-Net 2: A causal two-stage repairing and denoising speech enhancement network with knowledge distillation and complex axial self-attention	Mingshuai Liu et.al.	2406.07498v1	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization	Frederic Kirstein et.al.	2406.07494v2	null
2024-06-11	Image Neural Field Diffusion Models	Yinbo Chen et.al.	2406.07480v1	null
2024-06-11	Lower bounds for sphere packing in arbitrary norms	Carl Schildkraut et.al.	2406.07479v1	null
2024-06-11	Choreographing the Rhythms of Observation: Dynamics for Ranged Observer Bipartite-Unipartite SpatioTemporal (ROBUST) Networks	Ted Edward Holmberg et.al.	2406.07473v1	null
2024-06-11	Microbiomes Through The Looking Glass	Jacopo Pasqualini et.al.	2406.07465v1	null
2024-06-11	Reconfigurable Intelligent Surfaces in Dynamic Rich Scattering Environments: BiLSTM-Based Optimization for Accurate User Localization	Anum Umer et.al.	2406.07463v1	null
2024-06-11	fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions	Alireza Afzal Aghaei et.al.	2406.07456v1	link
2024-06-11	HTVM: Efficient Neural Network Deployment On Heterogeneous TinyML Platforms	Josse Van Delm et.al.	2406.07453v1	null
2024-06-11	Boosted Conformal Prediction Intervals	Ran Xie et.al.	2406.07449v1	null
2024-06-11	Metastability in networks of nonlinear stochastic integrate-and-fire neurons	Siddharth Paliwal et.al.	2406.07445v1	null
2024-06-11	DeformTime: Capturing Variable Dependencies with Deformable Attention for Time Series Forecasting	Yuxuan Shu et.al.	2406.07438v1	null
2024-06-11	Graph-based multi-Feature fusion method for speech emotion recognition	Xueyu Liu et.al.	2406.07437v1	null
2024-06-11	Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration	Shashank Agnihotri et.al.	2406.07435v1	null
2024-06-11	Matryoshka Representation Learning for Recommendation	Riwei Lai et.al.	2406.07432v1	link
2024-06-11	Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments	Christopher D. Hsu et.al.	2406.07431v1	null
2024-06-11	GemNet: Menu-Based, Strategy-Proof Multi-Bidder Auctions Through Deep Learning	Tonghan Wang et.al.	2406.07428v1	null
2024-06-11	Graph Reasoning for Explainable Cold Start Recommendation	Jibril Frej et.al.	2406.07420v1	null
2024-06-11	Average-exact mixed anomalies and compatible phases	Yichen Xu et.al.	2406.07417v1	null
2024-06-11	Heat operators and isometry groups of Cuntz-Krieger algebras	Dimitris Michail Gerontogiannis et.al.	2406.07416v1	null
2024-06-11	Holistic Memory Diversification for Incremental Learning in Growing Graphs	Ziyue Qiao et.al.	2406.07413v1	null
2024-06-11	Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy	Xiaohan Huang et.al.	2406.07404v1	null
2024-06-11	A Survey on Recent Random Walk-based Methods for Embedding Knowledge Graphs	Elika Bozorgi et.al.	2406.07402v1	null
2024-06-11	Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance	Ruxin Zheng et.al.	2406.07399v1	null

Large-Language Model

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	A3VLM: Actionable Articulation-Aware Vision Language Model	Siyuan Huang et.al.	2406.07549v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis	David Ortiz-Perez et.al.	2406.07542v1	link
2024-06-11	CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning	Zeyuan Liu et.al.	2406.07541v1	null
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Autoregressive Pretraining with Mamba in Vision	Sucheng Ren et.al.	2406.07537v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Interpreting DESI 2024 BAO: late-time dynamical dark energy or a local effect?	Ioannis D. Gialamas et.al.	2406.07533v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	QuickLLaMA: Query-aware Inference Acceleration for Large Language Models	Jingyao Li et.al.	2406.07528v1	link
2024-06-11	Cosmological constraints on $Λ_{\rm s}$CDM scenario in a type II minimally modified gravity	Ozgur Akarsu et.al.	2406.07526v1	null
2024-06-11	Will Southeast Asia be the next global manufacturing hub? A multiway cointegration, causality, and dynamic connectedness analyses on factors influencing offshore decisions	Haibo Wang et.al.	2406.07525v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics	Reuben R. W. Wang et.al.	2406.07519v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven"	D. T. Chung et.al.	2406.07512v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results I. Improved data selection and processing	J. G. S. Lunde et.al.	2406.07510v1	null
2024-06-11	Uniqueness on average of large isoperimetric sets in noncompact manifolds with nonnegative Ricci curvature	Gioacchino Antonelli et.al.	2406.07509v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null

Edge Computing

Privacy

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring	Huicong Zhang et.al.	2406.07551v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	Autoregressive Pretraining with Mamba in Vision	Sucheng Ren et.al.	2406.07537v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	Interacting-bath dynamical embedding for capturing non-local electron correlation in solids	Jiachen Li et.al.	2406.07531v1	null
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Faster Spectral Density Estimation and Sparsification in the Nuclear Norm	Yujia Jin et.al.	2406.07521v1	null
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report	KBTG Labs et.al.	2406.07505v1	null
2024-06-11	Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices	Atli Sigurgeirsson et.al.	2406.07504v1	null
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502v1	link
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	Trim 3D Gaussian Splatting for Accurate Geometry Representation	Lue Fan et.al.	2406.07499v1	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization	Frederic Kirstein et.al.	2406.07494v2	null
2024-06-11	Paraphrasing in Affirmative Terms Improves Negation Understanding	MohammadHossein Rezaei et.al.	2406.07492v1	null

Efficient

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring	Huicong Zhang et.al.	2406.07551v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning	Zeyuan Liu et.al.	2406.07541v1	null
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Autoregressive Pretraining with Mamba in Vision	Sucheng Ren et.al.	2406.07537v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	Interacting-bath dynamical embedding for capturing non-local electron correlation in solids	Jiachen Li et.al.	2406.07531v1	null
2024-06-11	Coherent Three-Photon Excitation of the Strontium Clock Transition	Junyu He et.al.	2406.07530v1	null
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Faster Spectral Density Estimation and Sparsification in the Nuclear Norm	Yujia Jin et.al.	2406.07521v1	null
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System	SBND Collaboration et.al.	2406.07514v1	null
2024-06-11	Accurate Current Sharing in a DC Microgrid Using Modified Droop Control Algorithm	Naser Souri et.al.	2406.07513v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report	KBTG Labs et.al.	2406.07505v1	null
2024-06-11	Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices	Atli Sigurgeirsson et.al.	2406.07504v1	null

Scalability

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring	Huicong Zhang et.al.	2406.07551v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	Autoregressive Pretraining with Mamba in Vision	Sucheng Ren et.al.	2406.07537v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	Interacting-bath dynamical embedding for capturing non-local electron correlation in solids	Jiachen Li et.al.	2406.07531v1	null
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Faster Spectral Density Estimation and Sparsification in the Nuclear Norm	Yujia Jin et.al.	2406.07521v1	null
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report	KBTG Labs et.al.	2406.07505v1	null
2024-06-11	Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices	Atli Sigurgeirsson et.al.	2406.07504v1	null
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502v1	link
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	Trim 3D Gaussian Splatting for Accurate Geometry Representation	Lue Fan et.al.	2406.07499v1	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization	Frederic Kirstein et.al.	2406.07494v2	null
2024-06-11	Paraphrasing in Affirmative Terms Improves Negation Understanding	MohammadHossein Rezaei et.al.	2406.07492v1	null

Performance

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring	Huicong Zhang et.al.	2406.07551v1	link
2024-06-11	A3VLM: Actionable Articulation-Aware Vision Language Model	Siyuan Huang et.al.	2406.07549v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning	Zeyuan Liu et.al.	2406.07541v1	null
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Autoregressive Pretraining with Mamba in Vision	Sucheng Ren et.al.	2406.07537v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Interpreting DESI 2024 BAO: late-time dynamical dark energy or a local effect?	Ioannis D. Gialamas et.al.	2406.07533v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	Interacting-bath dynamical embedding for capturing non-local electron correlation in solids	Jiachen Li et.al.	2406.07531v1	null
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	Cosmological constraints on $Λ_{\rm s}$CDM scenario in a type II minimally modified gravity	Ozgur Akarsu et.al.	2406.07526v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Faster Spectral Density Estimation and Sparsification in the Nuclear Norm	Yujia Jin et.al.	2406.07521v1	null
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System	SBND Collaboration et.al.	2406.07514v1	null
2024-06-11	Accurate Current Sharing in a DC Microgrid Using Modified Droop Control Algorithm	Naser Souri et.al.	2406.07513v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum	N. -O. Stutzer et.al.	2406.07511v1	null
2024-06-11	COMAP Pathfinder -- Season 2 results I. Improved data selection and processing	J. G. S. Lunde et.al.	2406.07510v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null

Reliability

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring	Huicong Zhang et.al.	2406.07551v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	Autoregressive Pretraining with Mamba in Vision	Sucheng Ren et.al.	2406.07537v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	Interacting-bath dynamical embedding for capturing non-local electron correlation in solids	Jiachen Li et.al.	2406.07531v1	null
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Faster Spectral Density Estimation and Sparsification in the Nuclear Norm	Yujia Jin et.al.	2406.07521v1	null
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report	KBTG Labs et.al.	2406.07505v1	null
2024-06-11	Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices	Atli Sigurgeirsson et.al.	2406.07504v1	null
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502v1	link
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	Trim 3D Gaussian Splatting for Accurate Geometry Representation	Lue Fan et.al.	2406.07499v1	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization	Frederic Kirstein et.al.	2406.07494v2	null
2024-06-11	Paraphrasing in Affirmative Terms Improves Negation Understanding	MohammadHossein Rezaei et.al.	2406.07492v1	null

Trust

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring	Huicong Zhang et.al.	2406.07551v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	Autoregressive Pretraining with Mamba in Vision	Sucheng Ren et.al.	2406.07537v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	Interacting-bath dynamical embedding for capturing non-local electron correlation in solids	Jiachen Li et.al.	2406.07531v1	null
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Faster Spectral Density Estimation and Sparsification in the Nuclear Norm	Yujia Jin et.al.	2406.07521v1	null
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report	KBTG Labs et.al.	2406.07505v1	null
2024-06-11	Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices	Atli Sigurgeirsson et.al.	2406.07504v1	null
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502v1	link
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	Trim 3D Gaussian Splatting for Accurate Geometry Representation	Lue Fan et.al.	2406.07499v1	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization	Frederic Kirstein et.al.	2406.07494v2	null
2024-06-11	Paraphrasing in Affirmative Terms Improves Negation Understanding	MohammadHossein Rezaei et.al.	2406.07492v1	null

Secure

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring	Huicong Zhang et.al.	2406.07551v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	Autoregressive Pretraining with Mamba in Vision	Sucheng Ren et.al.	2406.07537v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	Interacting-bath dynamical embedding for capturing non-local electron correlation in solids	Jiachen Li et.al.	2406.07531v1	null
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Faster Spectral Density Estimation and Sparsification in the Nuclear Norm	Yujia Jin et.al.	2406.07521v1	null
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null
2024-06-11	Instant 3D Human Avatar Generation using Image Diffusion Models	Nikos Kolotouros et.al.	2406.07516v1	null
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515v1	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507v1	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506v1	link
2024-06-11	THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report	KBTG Labs et.al.	2406.07505v1	null
2024-06-11	Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices	Atli Sigurgeirsson et.al.	2406.07504v1	null
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502v1	link
2024-06-11	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	null
2024-06-11	Trim 3D Gaussian Splatting for Accurate Geometry Representation	Lue Fan et.al.	2406.07499v1	null
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496v1	link
2024-06-11	CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization	Frederic Kirstein et.al.	2406.07494v2	null
2024-06-11	Paraphrasing in Affirmative Terms Improves Negation Understanding	MohammadHossein Rezaei et.al.	2406.07492v1	null

Edge Computing

Publish Date	Title	Authors	PDF	Code
2024-06-11	An Image is Worth 32 Tokens for Reconstruction and Generation	Qihang Yu et.al.	2406.07550v1	null
2024-06-11	Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring	Huicong Zhang et.al.	2406.07551v1	link
2024-06-11	A3VLM: Actionable Articulation-Aware Vision Language Model	Siyuan Huang et.al.	2406.07549v1	link
2024-06-11	Image and Video Tokenization with Binary Spherical Quantization	Yue Zhao et.al.	2406.07548v1	link
2024-06-11	Zero-shot Image Editing with Reference Imitation	Xi Chen et.al.	2406.07547v1	null
2024-06-11	Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Xingyu Fu et.al.	2406.07546v1	null
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545v1	link
2024-06-11	Situational Awareness Matters in 3D Vision Language Reasoning	Yunze Man et.al.	2406.07544v1	null
2024-06-11	Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning	Chenyu Yang et.al.	2406.07543v1	link
2024-06-11	Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis	David Ortiz-Perez et.al.	2406.07542v1	link
2024-06-11	CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning	Zeyuan Liu et.al.	2406.07541v1	null
2024-06-11	Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance	Kuan Heng Lin et.al.	2406.07540v1	null
2024-06-11	BAKU: An Efficient Transformer for Multi-Task Policy Learning	Siddhant Haldar et.al.	2406.07539v1	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538v1	null
2024-06-11	Autoregressive Pretraining with Mamba in Vision	Sucheng Ren et.al.	2406.07537v1	null
2024-06-11	Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection	Wenxiao Wang et.al.	2406.07536v1	null
2024-06-11	Dynamics of the non-radial energy-critical inhomogeneous NLS	Carlos M. Guzmán et.al.	2406.07535v1	null
2024-06-11	On the potential of probing the neutron star composition in accreting X-ray binaries	Kaiser Arf et.al.	2406.07534v1	null
2024-06-11	Interpreting DESI 2024 BAO: late-time dynamical dark energy or a local effect?	Ioannis D. Gialamas et.al.	2406.07533v1	null
2024-06-11	Hearing Anything Anywhere	Mason Wang et.al.	2406.07532v1	link
2024-06-11	Interacting-bath dynamical embedding for capturing non-local electron correlation in solids	Jiachen Li et.al.	2406.07531v1	null
2024-06-11	Coherent Three-Photon Excitation of the Strontium Clock Transition	Junyu He et.al.	2406.07530v1	null
2024-06-11	MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation	Lu Li et.al.	2406.07529v1	null
2024-06-11	QuickLLaMA: Query-aware Inference Acceleration for Large Language Models	Jingyao Li et.al.	2406.07528v1	link
2024-06-11	Cosmological constraints on $Λ_{\rm s}$CDM scenario in a type II minimally modified gravity	Ozgur Akarsu et.al.	2406.07526v1	null
2024-06-11	Will Southeast Asia be the next global manufacturing hub? A multiway cointegration, causality, and dynamic connectedness analyses on factors influencing offshore decisions	Haibo Wang et.al.	2406.07525v1	null
2024-06-11	Simple and Effective Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2406.07524v1	link
2024-06-11	Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Liliang Ren et.al.	2406.07522v1	null
2024-06-11	Faster Spectral Density Estimation and Sparsification in the Nuclear Norm	Yujia Jin et.al.	2406.07521v1	null
2024-06-11	Neural Gaffer: Relighting Any Object via Diffusion	Haian Jin et.al.	2406.07520v1	null

Name		Name	Last commit message	Last commit date
Latest commit History 1,467 Commits
.github/workflows		.github/workflows
database		database
overrides		overrides
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
arxiv-daily.json		arxiv-daily.json
config.py		config.py
daily_arxiv.py		daily_arxiv.py
main.py		main.py
mkdocs.yml		mkdocs.yml
requirements-mkdocs.txt		requirements-mkdocs.txt
requirements.txt		requirements.txt

License

beiyuouo/arxiv-daily

Folders and files

Latest commit

History

Repository files navigation

arxiv-daily

Computer Vision

Object Tracking

Image Classification

Multi-Object Tracking

Object Detection

Image Matching

Semantic Segmentation

Instance Segmentation

Keypoint Detection

3D Vision

Point Cloud Matching

3D Object Tracking

3D Object Detection

Point Cloud Segmentation

Point Cloud Registration

Point Cloud Completion

Point Cloud

Federated Learning

Federated Learning

Framework

Communication

Personalized

Optimization

Privacy

Asynchronous

Dataset

Benchmark

Efficient

Heterogeneous

Few-shot Learning

Few-shot Learning

One-shot Learning

Meta Learning

Transfer Learning

Transfer Learning

Unsupervised Learning

Unsupervised Learning

GAN

Multi-modal

Vision-Language

Image Caption

Multi-modal

VQA

Text and Image Generation

Alignment

Transformer

Transformer

Vision Transformer

Reinforcement Learning

Reinforcement Learning

Robotics

Robotics

SFM

SLAM

Visual Localization

Contrastive Learning

Contrastive Learning

Medical Application

Medical Application

Medical Image Analysis

Medical Multi-modal

Graph Neural Network

Graph Neural Network

Large-Language Model

Large-Language Model

Edge Computing

Privacy

Efficient

Scalability

Performance

Reliability

Trust

Secure