Skip to content

πŸŽ“ Automatically Update Some Fields Papers Daily using Github Actions (Update Every 12th hours)

License

Notifications You must be signed in to change notification settings

beiyuouo/arxiv-daily

Repository files navigation

arxiv-daily

Automated deployment @ 2024-06-13 09:04:46 Asia/Shanghai

Welcome to contribute! Add your topics and keywords in topic.yml. You can also view historical data through the storage.

Computer Vision

Object Tracking

Publish Date Title Authors PDF Code
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549v1 link
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds Gabriella Tarantello et.al. 2406.07518v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven" D. T. Chung et.al. 2406.07512v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 A pilot protocol and cohort for the investigation of non-pathological variability in speech Nicholas Cummins et.al. 2406.07497v1 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 Partially Observed Trajectory Inference using Optimal Transport and a Dynamics Prior Anming Gu et.al. 2406.07475v1 null
2024-06-11 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models Heng Yu et.al. 2406.07472v1 null
2024-06-11 Exploring non-radial oscillation modes in dark matter admixed neutron stars Pratik Thakur et.al. 2406.07470v1 null
2024-06-11 Reserve Provision from Electric Vehicles: Aggregate Boundaries and Stochastic Model Predictive Control Jacob ThrΓ€n et.al. 2406.07454v1 null
2024-06-11 Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments Christopher D. Hsu et.al. 2406.07431v1 null
2024-06-11 Single and merger soliton dynamics in scalar field dark matter with and without self-interactions Matthias Stallovits et.al. 2406.07419v1 null
2024-06-11 Visual Representation Learning with Stochastic Frame Prediction Huiwon Jang et.al. 2406.07398v1 null
2024-06-11 Operad of posets 101: The Wixarika posets JosΓ© Antonio Arciniega-NevΓ‘rez et.al. 2406.07370v1 null
2024-06-11 Fast and accurate evaluation of Biot-Savart integrals over spatial curves Juan Ignacio Polanco et.al. 2406.07366v1 null
2024-06-11 Deep Implicit Optimization for Robust and Flexible Image Registration Rohit Jena et.al. 2406.07361v1 null
2024-06-11 Machine Learning approaches to classical density functional theory Alessandro Simon et.al. 2406.07345v1 null
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332v1 null
2024-06-11 The representation and computational efficiency of the Tolman-Oppenheimer-Volkoff equations in isotropic coordinates DΓ‘niel Barta et.al. 2406.07319v1 null
2024-06-11 A directional total variation minimization algorithm for isotropic resolution in digital breast tomosynthesis Emil Y. Sidky et.al. 2406.07306v1 null
2024-06-11 Shadows, rings and optical appearance of a magnetically charged regular black hole illuminated by various accretion disks Soroush Zare et.al. 2406.07300v1 null
2024-06-11 Multi-objective Reinforcement learning from AI Feedback Marcus Williams et.al. 2406.07295v2 null
2024-06-11 Joint Learning of Context and Feedback Embeddings in Spoken Dialogue Livia Qian et.al. 2406.07291v1 null
2024-06-11 Unsupervised Object Detection with Theoretical Guarantees Marian Longa et.al. 2406.07284v1 null

Image Classification

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Dynamics of the non-radial energy-critical inhomogeneous NLS Carlos M. GuzmΓ‘n et.al. 2406.07535v1 null
2024-06-11 Interpolating between Hausdorff and box dimension Amlan Banaji et.al. 2406.07527v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502v1 link
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 Trim 3D Gaussian Splatting for Accurate Geometry Representation Lue Fan et.al. 2406.07499v1 null
2024-06-11 GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection Hang Yao et.al. 2406.07487v1 null
2024-06-11 Novel Optimized Designs of Modulo $2n+1$ Adder for Quantum Computing Bhaskar Gaur et.al. 2406.07486v1 null
2024-06-11 Image Neural Field Diffusion Models Yinbo Chen et.al. 2406.07480v1 null
2024-06-11 fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions Alireza Afzal Aghaei et.al. 2406.07456v1 link
2024-06-11 An Optimism-based Approach to Online Evaluation of Generative Models Xiaoyan Hu et.al. 2406.07451v1 null
2024-06-11 Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning Shuvendu Roy et.al. 2406.07450v1 link
2024-06-11 Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration Shashank Agnihotri et.al. 2406.07435v1 null
2024-06-11 Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments Christopher D. Hsu et.al. 2406.07431v1 null
2024-06-11 Learning Domain-Invariant Features for Out-of-Context News Detection Yimeng Gu et.al. 2406.07430v1 null
2024-06-11 DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 38 Subclasses Abdurrahim Yilmaz et.al. 2406.07426v1 null
2024-06-11 MINERS: Multilingual Language Models as Semantic Retrievers Genta Indra Winata et.al. 2406.07424v1 null
2024-06-11 Holistic Memory Diversification for Incremental Learning in Growing Graphs Ziyue Qiao et.al. 2406.07413v1 null

Multi-Object Tracking

Publish Date Title Authors PDF Code
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Autoregressive Pretraining with Mamba in Vision Sucheng Ren et.al. 2406.07537v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds Gabriella Tarantello et.al. 2406.07518v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven" D. T. Chung et.al. 2406.07512v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502v1 link
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 A pilot protocol and cohort for the investigation of non-pathological variability in speech Nicholas Cummins et.al. 2406.07497v1 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 Partially Observed Trajectory Inference using Optimal Transport and a Dynamics Prior Anming Gu et.al. 2406.07475v1 null
2024-06-11 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models Heng Yu et.al. 2406.07472v1 null
2024-06-11 Exploring non-radial oscillation modes in dark matter admixed neutron stars Pratik Thakur et.al. 2406.07470v1 null
2024-06-11 Reserve Provision from Electric Vehicles: Aggregate Boundaries and Stochastic Model Predictive Control Jacob ThrΓ€n et.al. 2406.07454v1 null
2024-06-11 Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning Shuvendu Roy et.al. 2406.07450v1 link
2024-06-11 Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments Christopher D. Hsu et.al. 2406.07431v1 null
2024-06-11 Single and merger soliton dynamics in scalar field dark matter with and without self-interactions Matthias Stallovits et.al. 2406.07419v1 null
2024-06-11 Enhanced Gene Selection in Single-Cell Genomics: Pre-Filtering Synergy and Reinforced Optimization Weiliang Zhang et.al. 2406.07418v1 null
2024-06-11 Visual Representation Learning with Stochastic Frame Prediction Huiwon Jang et.al. 2406.07398v1 null
2024-06-11 DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling Sixian Wang et.al. 2406.07390v1 null
2024-06-11 Operad of posets 101: The Wixarika posets JosΓ© Antonio Arciniega-NevΓ‘rez et.al. 2406.07370v1 null

Object Detection

Publish Date Title Authors PDF Code
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549v1 link
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds Gabriella Tarantello et.al. 2406.07518v1 null
2024-06-11 Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System SBND Collaboration et.al. 2406.07514v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum N. -O. Stutzer et.al. 2406.07511v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 A pilot protocol and cohort for the investigation of non-pathological variability in speech Nicholas Cummins et.al. 2406.07497v1 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 Neutrino magnetic dipole portal with low energy neutrino nucleus scattering data Ying-Ying Li et.al. 2406.07477v1 null
2024-06-11 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models Heng Yu et.al. 2406.07472v1 null
2024-06-11 Exploring non-radial oscillation modes in dark matter admixed neutron stars Pratik Thakur et.al. 2406.07470v1 null
2024-06-11 Reserve Provision from Electric Vehicles: Aggregate Boundaries and Stochastic Model Predictive Control Jacob ThrΓ€n et.al. 2406.07454v1 null
2024-06-11 Search for photons above 10$^{18}$ eV by simultaneously measuring the atmospheric depth and the muon content of air showers at the Pierre Auger Observatory The Pierre Auger Collaboration et.al. 2406.07439v1 null
2024-06-11 Single and merger soliton dynamics in scalar field dark matter with and without self-interactions Matthias Stallovits et.al. 2406.07419v1 null
2024-06-11 Visual Representation Learning with Stochastic Frame Prediction Huiwon Jang et.al. 2406.07398v1 null
2024-06-11 Operad of posets 101: The Wixarika posets JosΓ© Antonio Arciniega-NevΓ‘rez et.al. 2406.07370v1 null
2024-06-11 Fast and accurate evaluation of Biot-Savart integrals over spatial curves Juan Ignacio Polanco et.al. 2406.07366v1 null
2024-06-11 Deep Implicit Optimization for Robust and Flexible Image Registration Rohit Jena et.al. 2406.07361v1 null
2024-06-11 Machine Learning approaches to classical density functional theory Alessandro Simon et.al. 2406.07345v1 null
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332v1 null
2024-06-11 The representation and computational efficiency of the Tolman-Oppenheimer-Volkoff equations in isotropic coordinates DΓ‘niel Barta et.al. 2406.07319v1 null
2024-06-11 A directional total variation minimization algorithm for isotropic resolution in digital breast tomosynthesis Emil Y. Sidky et.al. 2406.07306v1 null
2024-06-11 Shadows, rings and optical appearance of a magnetically charged regular black hole illuminated by various accretion disks Soroush Zare et.al. 2406.07300v1 null
2024-06-11 Multi-objective Reinforcement learning from AI Feedback Marcus Williams et.al. 2406.07295v2 null
2024-06-11 Joint Learning of Context and Feedback Embeddings in Spoken Dialogue Livia Qian et.al. 2406.07291v1 null
2024-06-11 Unsupervised Object Detection with Theoretical Guarantees Marian Longa et.al. 2406.07284v1 null

Image Matching

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Interpolating between Hausdorff and box dimension Amlan Banaji et.al. 2406.07527v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven" D. T. Chung et.al. 2406.07512v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502v1 link
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 Trim 3D Gaussian Splatting for Accurate Geometry Representation Lue Fan et.al. 2406.07499v1 null
2024-06-11 GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection Hang Yao et.al. 2406.07487v1 null
2024-06-11 Novel Optimized Designs of Modulo $2n+1$ Adder for Quantum Computing Bhaskar Gaur et.al. 2406.07486v1 null
2024-06-11 Advancing Annotation of Stance in Social Media Posts: A Comparative Analysis of Large Language Models and Crowd Sourcing Mao Li et.al. 2406.07483v1 null
2024-06-11 Image Neural Field Diffusion Models Yinbo Chen et.al. 2406.07480v1 null
2024-06-11 Resummation of Multi-Stress Tensors in Higher Dimensions Kuo-Wei Huang et.al. 2406.07458v1 null
2024-06-11 fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions Alireza Afzal Aghaei et.al. 2406.07456v1 link
2024-06-11 An Optimism-based Approach to Online Evaluation of Generative Models Xiaoyan Hu et.al. 2406.07451v1 null
2024-06-11 Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning Shuvendu Roy et.al. 2406.07450v1 link
2024-06-11 Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration Shashank Agnihotri et.al. 2406.07435v1 null
2024-06-11 Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments Christopher D. Hsu et.al. 2406.07431v1 null
2024-06-11 Learning Domain-Invariant Features for Out-of-Context News Detection Yimeng Gu et.al. 2406.07430v1 null
2024-06-11 DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 38 Subclasses Abdurrahim Yilmaz et.al. 2406.07426v1 null
2024-06-11 Optimal Marital Strategies: How Couples Develop Successful Interaction Styles Micah Henson et.al. 2406.07403v1 null

Semantic Segmentation

Publish Date Title Authors PDF Code
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 QuickLLaMA: Query-aware Inference Acceleration for Large Language Models Jingyao Li et.al. 2406.07528v1 link
2024-06-11 CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization Frederic Kirstein et.al. 2406.07494v2 null
2024-06-11 Textual Similarity as a Key Metric in Machine Translation Quality Estimation Kun Sun et.al. 2406.07440v1 null
2024-06-11 MINERS: Multilingual Language Models as Semantic Retrievers Genta Indra Winata et.al. 2406.07424v1 null
2024-06-11 Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech Yin-Long Liu et.al. 2406.07410v1 null
2024-06-11 Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance Ruxin Zheng et.al. 2406.07399v1 null
2024-06-11 Visual Representation Learning with Stochastic Frame Prediction Huiwon Jang et.al. 2406.07398v1 null
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332v1 null
2024-06-11 A Banach space whose set of norm-attaining functionals is algebraically trivial Miguel Martin et.al. 2406.07273v1 null
2024-06-11 Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation Jinyuan Li et.al. 2406.07268v1 null
2024-06-11 Semantic-Aware Spectrum Sharing in Internet of Vehicles Based on Deep Reinforcement Learning Zhiyu Shao et.al. 2406.07213v1 link
2024-06-11 Can Foundation Models Reliably Identify Spatial Hazards? A Case Study on Curb Segmentation Diwei Sheng et.al. 2406.07202v1 null
2024-06-11 Target Speech Diarization with Multimodal Prompts Yidi Jiang et.al. 2406.07198v1 null
2024-06-11 RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker Yunfeng Li et.al. 2406.07189v1 link
2024-06-11 TernaryLLM: Ternarized Large Language Model Tianqi Chen et.al. 2406.07177v1 null
2024-06-11 ULog: Unsupervised Log Parsing with Large Language Models through Log Contrastive Units Junjie Huang et.al. 2406.07174v1 null
2024-06-11 FaceGPT: Self-supervised Learning to Chat about 3D Human Faces Haoran Wang et.al. 2406.07163v1 null
2024-06-11 EEG classification for visual brain decoding with spatio-temporal and transformer based paradigms Akanksha Sharma et.al. 2406.07153v1 null
2024-06-11 Translating speech with just images Dan Oneata et.al. 2406.07133v1 null
2024-06-11 Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph Sergey Linok et.al. 2406.07113v1 null
2024-06-11 AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding Xing Zhang et.al. 2406.07091v1 null
2024-06-11 CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation Zhongzhen Huang et.al. 2406.07085v1 null
2024-06-11 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation Mingqi Gao et.al. 2406.07043v1 link
2024-06-11 EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network Yining Shi et.al. 2406.07042v1 link
2024-06-11 PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving Yining Shi et.al. 2406.07037v1 null
2024-06-11 RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks Zhechao Wang et.al. 2406.07032v1 null
2024-06-11 LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection Jiahua Xu et.al. 2406.07023v2 null
2024-06-11 Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models Sooyeon Go et.al. 2406.07008v1 null

Instance Segmentation

Publish Date Title Authors PDF Code
2024-06-11 Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis Qining Zhang et.al. 2406.07455v1 null
2024-06-11 On the Robustness of Document-Level Relation Extraction Models to Entity Name Variations Shiao Meng et.al. 2406.07444v1 null
2024-06-11 Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech Yin-Long Liu et.al. 2406.07410v1 null
2024-06-11 Visual Representation Learning with Stochastic Frame Prediction Huiwon Jang et.al. 2406.07398v1 null
2024-06-11 PSMC: Provable and Scalable Algorithms for Motif Conductance Based Graph Clustering Longlong Lin et.al. 2406.07357v1 null
2024-06-11 The Theory of Intrinsic Time: A Primer James B. Glattfelder et.al. 2406.07354v1 null
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332v1 null
2024-06-11 A Banach space whose set of norm-attaining functionals is algebraically trivial Miguel Martin et.al. 2406.07273v1 null
2024-06-11 Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation Jinyuan Li et.al. 2406.07268v1 null
2024-06-11 Can Foundation Models Reliably Identify Spatial Hazards? A Case Study on Curb Segmentation Diwei Sheng et.al. 2406.07202v1 null
2024-06-11 Quantum repeaters based on stationary Gottesman-Kitaev-Preskill qubits Stefan HΓ€ussler et.al. 2406.07158v1 null
2024-06-11 Scaling Large-Language-Model-based Multi-Agent Collaboration Chen Qian et.al. 2406.07155v1 link
2024-06-11 CHARME: A chain-based reinforcement learning approach for the minor embedding problem Hoang M. Ngo et.al. 2406.07124v1 null
2024-06-11 The Treatment of Ties in Rank-Biased Overlap Matteo Corsi et.al. 2406.07121v1 null
2024-06-11 Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph Sergey Linok et.al. 2406.07113v1 null
2024-06-11 Large amplitude quasi-periodic traveling waves in two dimensional forced rotating fluids Roberta Bianchini et.al. 2406.07099v1 null
2024-06-11 Edge Rendering Architecture for multiuser XR Experiences and E2E Performance Assessment Inhar Yeregui et.al. 2406.07087v1 null
2024-06-11 CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation Zhongzhen Huang et.al. 2406.07085v1 null
2024-06-11 Triage of 3D pathology data via 2.5D multiple-instance learning to guide pathologist assessments Gan Gao et.al. 2406.07061v1 link
2024-06-11 Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study Yichi Zhang et.al. 2406.07057v1 null
2024-06-11 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation Mingqi Gao et.al. 2406.07043v1 link
2024-06-11 PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving Yining Shi et.al. 2406.07037v1 null
2024-06-11 RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks Zhechao Wang et.al. 2406.07032v1 null
2024-06-11 LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection Jiahua Xu et.al. 2406.07023v2 null
2024-06-11 Dual Thinking and Perceptual Analysis of Deep Learning Models using Human Adversarial Examples Kailas Dayanandan et.al. 2406.06967v1 link
2024-06-11 Distributional MIPLIB: a Multi-Domain Library for Advancing ML-Guided MILP Methods Weimin Huang et.al. 2406.06954v1 null
2024-06-11 Sparse Bayesian Networks: Efficient Uncertainty Quantification in Medical Image Analysis Zeinab Abboud et.al. 2406.06946v1 null
2024-06-11 UVIS: Unsupervised Video Instance Segmentation Shuaiyi Huang et.al. 2406.06908v1 null
2024-06-11 Person Transfer in the Field: Examining Real World Sequential Human-Robot Interaction Between Two Robots Xiang Zhi Tan et.al. 2406.06904v1 null
2024-06-11 Universal spatial inflation of human mobility Lu Zhong et.al. 2406.06889v1 null

Keypoint Detection

Publish Date Title Authors PDF Code
2024-06-11 Differentiability and Optimization of Multiparameter Persistent Homology Luis Scoccola et.al. 2406.07224v1 null
2024-06-10 Relative descriptors for quantum agents David MΓΆckli et.al. 2406.06719v1 null
2024-06-08 Unsupervised learning of Data-driven Facial Expression Coding System (DFECS) using keypoint tracking Shivansh Chandra Tripathi et.al. 2406.05434v1 null
2024-06-07 Expected Lipschitz-Killing curvatures for spin random fields and other non-isotropic fields Francesca Pistolato et.al. 2406.04850v1 null
2024-06-07 LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model Dongkai Wang et.al. 2406.04659v1 link
2024-06-06 Monocular Localization with Semantics Map for Autonomous Vehicles Jixiang Wan et.al. 2406.03835v1 null
2024-06-05 Image Copy-Move Forgery Detection and Localization Scheme: How to Avoid Missed Detection and False Alarm Li Jiang et.al. 2406.03271v1 null
2024-06-05 Topological Neural Networks go Persistent, Equivariant, and Continuous Yogesh Verma et.al. 2406.03164v1 null
2024-06-05 How precisely are solute clusters in RPV steels characterized by atom probe experiments? N. Castin et.al. 2406.02973v1 null
2024-06-05 Homotopic Path Set Planning for Robot Manipulation and Navigation Jing Huang et.al. 2406.02885v1 link
2024-06-05 Controllable Talking Face Generation by Implicit Facial Keypoints Editing Dong Zhao et.al. 2406.02880v1 null
2024-06-04 Machine learning Hubbard parameters with equivariant neural networks Martin Uhrin et.al. 2406.02457v1 link
2024-06-03 Scale-Free Image Keypoints Using Differentiable Persistent Homology Giovanni Barbarani et.al. 2406.01315v1 link
2024-06-03 MoFormer: Multi-objective Antimicrobial Peptide Generation Based on Conditional Transformer Joint Multi-modal Fusion Descriptor Li Wang et.al. 2406.02610v1 null
2024-06-02 W-Net: A Facial Feature-Guided Face Super-Resolution Network Hao Liu et.al. 2406.00676v1 null
2024-06-02 SAM-LAD: Segment Anything Model Meets Zero-Shot Logic Anomaly Detection Yun Peng et.al. 2406.00625v2 null
2024-06-01 CapeX: Category-Agnostic Pose Estimation from Textual Point Explanation Matan Rusanovsky et.al. 2406.00384v1 link
2024-05-31 Learning from metastable grain boundaries Avanish Mishra et.al. 2406.00204v1 null
2024-05-30 Estimating Human Poses Across Datasets: A Unified Skeleton and Multi-Teacher Distillation Approach Muhammad Saif Ullah Khan et.al. 2405.20084v1 null
2024-05-30 KITRO: Refining Human Mesh by 2D Clues and Kinematic-tree Rotation Fengyuan Yang et.al. 2405.19833v1 link
2024-05-30 Automatic Dance Video Segmentation for Understanding Choreography Koki Endo et.al. 2405.19727v1 null
2024-05-30 SMPLX-Lite: A Realistic and Drivable Avatar Benchmark with Rich Geometry and Texture Annotations Yujiao Jiang et.al. 2405.19609v1 null
2024-05-29 SDPRLayers: Certifiable Backpropagation Through Polynomial Optimization Problems in Robotics Connor Holmes et.al. 2405.19309v1 null
2024-05-29 Greedy Kernel Methods for Approximating Breakthrough Curves for Reactive Flow from 3D Porous Geometry Data Robin Herkert et.al. 2405.19170v1 null
2024-05-29 PointNetPGAP-SLC: A 3D LiDAR-based Place Recognition Approach with Segment-level Consistency Training for Mobile Robots in Horticulture T. Barros et.al. 2405.19038v1 link
2024-05-29 Classification analysis of transition-metal chalcogenides and oxides using quantum machine learning Kurudi V Vedavyasa et.al. 2405.18989v1 null
2024-05-29 Diffeomorphic interpolation for efficient persistence-based topological optimization Mathieu Carriere et.al. 2405.18820v1 null
2024-05-28 Temperature-Dependent Chirality in Halide Perovskites Mike Pols et.al. 2405.18643v1 null
2024-05-28 What can machine learning help with microstructure-informed materials modeling and design? Xiang-Long Peng et.al. 2405.18396v1 null
2024-05-28 Relational Self-supervised Distillation with Compact Descriptors for Image Copy Detection Juntae Kim et.al. 2405.17928v3 null

3D Vision

Point Cloud Matching

Publish Date Title Authors PDF Code
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Dynamics of the non-radial energy-critical inhomogeneous NLS Carlos M. GuzmΓ‘n et.al. 2406.07535v1 null
2024-06-11 Interpolating between Hausdorff and box dimension Amlan Banaji et.al. 2406.07527v1 null
2024-06-11 On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds Gabriella Tarantello et.al. 2406.07518v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven" D. T. Chung et.al. 2406.07512v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results I. Improved data selection and processing J. G. S. Lunde et.al. 2406.07510v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null
2024-06-11 Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices Atli Sigurgeirsson et.al. 2406.07504v1 null
2024-06-11 ReduceFormer: Attention with Tensor Reduction by Summation John Yang et.al. 2406.07488v1 null
2024-06-11 Advancing Annotation of Stance in Social Media Posts: A Comparative Analysis of Large Language Models and Crowd Sourcing Mao Li et.al. 2406.07483v1 null
2024-06-11 Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery Biplov Bhandari et.al. 2406.07482v1 null
2024-06-11 Resummation of Multi-Stress Tensors in Higher Dimensions Kuo-Wei Huang et.al. 2406.07458v1 null
2024-06-11 Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments Christopher D. Hsu et.al. 2406.07431v1 null
2024-06-11 Support for fragile porous dust in a gravitationally self-regulated disk around IM Lup Takahiro Ueda et.al. 2406.07427v1 null
2024-06-11 Adic curves: stable reduction, skeletons and metric structure Katharina HΓΌbner et.al. 2406.07414v1 null
2024-06-11 Private Geometric Median Mahdi Haghifam et.al. 2406.07407v1 null
2024-06-11 Optimal Marital Strategies: How Couples Develop Successful Interaction Styles Micah Henson et.al. 2406.07403v1 null
2024-06-11 Disrupting Bipartite Trading Networks: Matching for Revenue Maximization Luca D'Amico-Wong et.al. 2406.07385v1 null
2024-06-11 Closing the Computational-Query Depth Gap in Parallel Stochastic Convex Optimization Arun Jambulapati et.al. 2406.07373v1 null
2024-06-11 Analytical Delta-V Approximation for Nonlinear Programming of Multi-target Rendezvous and Flyby Trajectories An-Yi Huang et.al. 2406.07341v1 null
2024-06-11 Searching for gravitational waves from stellar-mass binary black holes early inspiral Xue-Ting Zhang et.al. 2406.07336v1 null
2024-06-11 Maximum number of points on an intersection of a cubic threefold and a non-degenerate Hermitian threefold Mrinmoy Datta et.al. 2406.07326v1 null
2024-06-11 Lyapunov equations: a (fixed) point of view Richard Pates et.al. 2406.07324v1 null
2024-06-11 Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs Kamil Jeziorek et.al. 2406.07318v1 null
2024-06-11 Morse Index Stability for the Ginzburg-Landau Approximation Francesca Da Lio et.al. 2406.07317v1 null
2024-06-11 Sum the Probabilities to $m$ and Stop Zakaria Derbazi et.al. 2406.07283v1 null
2024-06-11 Efficient 3D Molecular Generation with Flow Matching and Scale Optimal Transport Ross Irwin et.al. 2406.07266v1 null
2024-06-11 Coupled-channel $J^{--}$ meson resonances from lattice QCD Jozef J. Dudek et.al. 2406.07261v1 null
2024-06-11 Hybrid Reinforcement Learning from Offline Observation Alone Yuda Song et.al. 2406.07253v1 null

3D Object Tracking

Publish Date Title Authors PDF Code
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549v1 link
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds Gabriella Tarantello et.al. 2406.07518v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System SBND Collaboration et.al. 2406.07514v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven" D. T. Chung et.al. 2406.07512v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum N. -O. Stutzer et.al. 2406.07511v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 Trim 3D Gaussian Splatting for Accurate Geometry Representation Lue Fan et.al. 2406.07499v1 null
2024-06-11 A pilot protocol and cohort for the investigation of non-pathological variability in speech Nicholas Cummins et.al. 2406.07497v1 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 ReduceFormer: Attention with Tensor Reduction by Summation John Yang et.al. 2406.07488v1 null
2024-06-11 Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery Biplov Bhandari et.al. 2406.07482v1 null
2024-06-11 Partially Observed Trajectory Inference using Optimal Transport and a Dynamics Prior Anming Gu et.al. 2406.07475v1 null
2024-06-11 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models Heng Yu et.al. 2406.07472v1 null
2024-06-11 Exploring non-radial oscillation modes in dark matter admixed neutron stars Pratik Thakur et.al. 2406.07470v1 null
2024-06-11 Existence and asymptotic autonomous robustness of random attractors for three-dimensional stochastic globally modified Navier-Stokes equations on unbounded domains Kush Kinra et.al. 2406.07460v1 null
2024-06-11 Reserve Provision from Electric Vehicles: Aggregate Boundaries and Stochastic Model Predictive Control Jacob ThrΓ€n et.al. 2406.07454v1 null
2024-06-11 Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments Christopher D. Hsu et.al. 2406.07431v1 null
2024-06-11 Single and merger soliton dynamics in scalar field dark matter with and without self-interactions Matthias Stallovits et.al. 2406.07419v1 null
2024-06-11 Visual Representation Learning with Stochastic Frame Prediction Huiwon Jang et.al. 2406.07398v1 null
2024-06-11 Operad of posets 101: The Wixarika posets JosΓ© Antonio Arciniega-NevΓ‘rez et.al. 2406.07370v1 null

3D Object Detection

Publish Date Title Authors PDF Code
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds Gabriella Tarantello et.al. 2406.07518v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System SBND Collaboration et.al. 2406.07514v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven" D. T. Chung et.al. 2406.07512v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum N. -O. Stutzer et.al. 2406.07511v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid Naser Souri et.al. 2406.07503v1 null
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 Trim 3D Gaussian Splatting for Accurate Geometry Representation Lue Fan et.al. 2406.07499v1 null
2024-06-11 A pilot protocol and cohort for the investigation of non-pathological variability in speech Nicholas Cummins et.al. 2406.07497v1 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 Prospects for the detection of Dark Matter with Long-lived Mediators in the Sun using the Southern Wide-field Gamma-ray Observatory Micael Andrade et.al. 2406.07489v1 null
2024-06-11 ReduceFormer: Attention with Tensor Reduction by Summation John Yang et.al. 2406.07488v1 null
2024-06-11 GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection Hang Yao et.al. 2406.07487v1 null
2024-06-11 Advancing Annotation of Stance in Social Media Posts: A Comparative Analysis of Large Language Models and Crowd Sourcing Mao Li et.al. 2406.07483v1 null
2024-06-11 Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery Biplov Bhandari et.al. 2406.07482v1 null
2024-06-11 Neutrino magnetic dipole portal with low energy neutrino nucleus scattering data Ying-Ying Li et.al. 2406.07477v1 null
2024-06-11 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models Heng Yu et.al. 2406.07472v1 null
2024-06-11 Exploring non-radial oscillation modes in dark matter admixed neutron stars Pratik Thakur et.al. 2406.07470v1 null
2024-06-11 Anomaly Detection on Unstable Logs with GPT Models Fatemeh Hadadi et.al. 2406.07467v1 null
2024-06-11 Existence and asymptotic autonomous robustness of random attractors for three-dimensional stochastic globally modified Navier-Stokes equations on unbounded domains Kush Kinra et.al. 2406.07460v1 null
2024-06-11 Reserve Provision from Electric Vehicles: Aggregate Boundaries and Stochastic Model Predictive Control Jacob ThrΓ€n et.al. 2406.07454v1 null

Point Cloud Segmentation

Publish Date Title Authors PDF Code
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Dynamics of the non-radial energy-critical inhomogeneous NLS Carlos M. GuzmΓ‘n et.al. 2406.07535v1 null
2024-06-11 Interpolating between Hausdorff and box dimension Amlan Banaji et.al. 2406.07527v1 null
2024-06-11 On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds Gabriella Tarantello et.al. 2406.07518v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results I. Improved data selection and processing J. G. S. Lunde et.al. 2406.07510v1 null
2024-06-11 Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices Atli Sigurgeirsson et.al. 2406.07504v1 null
2024-06-11 ReduceFormer: Attention with Tensor Reduction by Summation John Yang et.al. 2406.07488v1 null
2024-06-11 Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery Biplov Bhandari et.al. 2406.07482v1 null
2024-06-11 Resummation of Multi-Stress Tensors in Higher Dimensions Kuo-Wei Huang et.al. 2406.07458v1 null
2024-06-11 Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments Christopher D. Hsu et.al. 2406.07431v1 null
2024-06-11 Support for fragile porous dust in a gravitationally self-regulated disk around IM Lup Takahiro Ueda et.al. 2406.07427v1 null
2024-06-11 Adic curves: stable reduction, skeletons and metric structure Katharina HΓΌbner et.al. 2406.07414v1 null
2024-06-11 Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech Yin-Long Liu et.al. 2406.07410v1 null
2024-06-11 Private Geometric Median Mahdi Haghifam et.al. 2406.07407v1 null
2024-06-11 Visual Representation Learning with Stochastic Frame Prediction Huiwon Jang et.al. 2406.07398v1 null
2024-06-11 Analytical Delta-V Approximation for Nonlinear Programming of Multi-target Rendezvous and Flyby Trajectories An-Yi Huang et.al. 2406.07341v1 null
2024-06-11 Searching for gravitational waves from stellar-mass binary black holes early inspiral Xue-Ting Zhang et.al. 2406.07336v1 null
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332v1 null
2024-06-11 Maximum number of points on an intersection of a cubic threefold and a non-degenerate Hermitian threefold Mrinmoy Datta et.al. 2406.07326v1 null
2024-06-11 Lyapunov equations: a (fixed) point of view Richard Pates et.al. 2406.07324v1 null
2024-06-11 Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs Kamil Jeziorek et.al. 2406.07318v1 null
2024-06-11 Morse Index Stability for the Ginzburg-Landau Approximation Francesca Da Lio et.al. 2406.07317v1 null
2024-06-11 Sum the Probabilities to $m$ and Stop Zakaria Derbazi et.al. 2406.07283v1 null
2024-06-11 A Banach space whose set of norm-attaining functionals is algebraically trivial Miguel Martin et.al. 2406.07273v1 null
2024-06-11 Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation Jinyuan Li et.al. 2406.07268v1 null
2024-06-11 Coupled-channel $J^{--}$ meson resonances from lattice QCD Jozef J. Dudek et.al. 2406.07261v1 null
2024-06-11 Even dimensional Fermat cubics are rational over any field Alex Massarenti et.al. 2406.07223v1 null
2024-06-11 Can Foundation Models Reliably Identify Spatial Hazards? A Case Study on Curb Segmentation Diwei Sheng et.al. 2406.07202v1 null
2024-06-11 A Multi-step Approach for Minimizing Risk in Decentralized Exchanges Daniele Maria Di Nosse et.al. 2406.07200v2 null
2024-06-11 TernaryLLM: Ternarized Large Language Model Tianqi Chen et.al. 2406.07177v1 null

Point Cloud Registration

Publish Date Title Authors PDF Code
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Dynamics of the non-radial energy-critical inhomogeneous NLS Carlos M. GuzmΓ‘n et.al. 2406.07535v1 null
2024-06-11 Interpolating between Hausdorff and box dimension Amlan Banaji et.al. 2406.07527v1 null
2024-06-11 On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds Gabriella Tarantello et.al. 2406.07518v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results I. Improved data selection and processing J. G. S. Lunde et.al. 2406.07510v1 null
2024-06-11 Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices Atli Sigurgeirsson et.al. 2406.07504v1 null
2024-06-11 ReduceFormer: Attention with Tensor Reduction by Summation John Yang et.al. 2406.07488v1 null
2024-06-11 Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery Biplov Bhandari et.al. 2406.07482v1 null
2024-06-11 Resummation of Multi-Stress Tensors in Higher Dimensions Kuo-Wei Huang et.al. 2406.07458v1 null
2024-06-11 Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments Christopher D. Hsu et.al. 2406.07431v1 null
2024-06-11 Support for fragile porous dust in a gravitationally self-regulated disk around IM Lup Takahiro Ueda et.al. 2406.07427v1 null
2024-06-11 Adic curves: stable reduction, skeletons and metric structure Katharina HΓΌbner et.al. 2406.07414v1 null
2024-06-11 Private Geometric Median Mahdi Haghifam et.al. 2406.07407v1 null
2024-06-11 Deep Implicit Optimization for Robust and Flexible Image Registration Rohit Jena et.al. 2406.07361v1 null
2024-06-11 Analytical Delta-V Approximation for Nonlinear Programming of Multi-target Rendezvous and Flyby Trajectories An-Yi Huang et.al. 2406.07341v1 null
2024-06-11 Searching for gravitational waves from stellar-mass binary black holes early inspiral Xue-Ting Zhang et.al. 2406.07336v1 null
2024-06-11 Maximum number of points on an intersection of a cubic threefold and a non-degenerate Hermitian threefold Mrinmoy Datta et.al. 2406.07326v1 null
2024-06-11 Lyapunov equations: a (fixed) point of view Richard Pates et.al. 2406.07324v1 null
2024-06-11 Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs Kamil Jeziorek et.al. 2406.07318v1 null
2024-06-11 Morse Index Stability for the Ginzburg-Landau Approximation Francesca Da Lio et.al. 2406.07317v1 null
2024-06-11 Sum the Probabilities to $m$ and Stop Zakaria Derbazi et.al. 2406.07283v1 null
2024-06-11 Coupled-channel $J^{--}$ meson resonances from lattice QCD Jozef J. Dudek et.al. 2406.07261v1 null
2024-06-11 Even dimensional Fermat cubics are rational over any field Alex Massarenti et.al. 2406.07223v1 null
2024-06-11 A Multi-step Approach for Minimizing Risk in Decentralized Exchanges Daniele Maria Di Nosse et.al. 2406.07200v2 null
2024-06-11 TernaryLLM: Ternarized Large Language Model Tianqi Chen et.al. 2406.07177v1 null
2024-06-11 Ultrametric-preserving functions as monoid endomorphisms Oleksiy Dovgoshey et.al. 2406.07166v2 null
2024-06-11 ACCO: Automated Causal CNN Scheduling Optimizer for Real-Time Edge Accelerators Jun Yin et.al. 2406.07161v1 null
2024-06-11 Deep Learning-Based Approach for User Activity Detection with Grant-Free Random Access in Cell-Free Massive MIMO Ali Elkeshawy et.al. 2406.07160v1 null
2024-06-11 A portrait of the rotation of Ultra-Cool Dwarfs revealed by TESS D. O. Fontinele et.al. 2406.07154v1 null
2024-06-11 High-performance in-vacuum optical system for quantum optics experiments in a Penning-trap JoaquΓ­n Berrocal et.al. 2406.07152v1 null

Point Cloud Completion

Publish Date Title Authors PDF Code
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Dynamics of the non-radial energy-critical inhomogeneous NLS Carlos M. GuzmΓ‘n et.al. 2406.07535v1 null
2024-06-11 Interpolating between Hausdorff and box dimension Amlan Banaji et.al. 2406.07527v1 null
2024-06-11 On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds Gabriella Tarantello et.al. 2406.07518v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results I. Improved data selection and processing J. G. S. Lunde et.al. 2406.07510v1 null
2024-06-11 Uniqueness on average of large isoperimetric sets in noncompact manifolds with nonnegative Ricci curvature Gioacchino Antonelli et.al. 2406.07509v1 null
2024-06-11 Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices Atli Sigurgeirsson et.al. 2406.07504v1 null
2024-06-11 ReduceFormer: Attention with Tensor Reduction by Summation John Yang et.al. 2406.07488v1 null
2024-06-11 Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery Biplov Bhandari et.al. 2406.07482v1 null
2024-06-11 Resummation of Multi-Stress Tensors in Higher Dimensions Kuo-Wei Huang et.al. 2406.07458v1 null
2024-06-11 McEval: Massively Multilingual Code Evaluation Linzheng Chai et.al. 2406.07436v1 null
2024-06-11 Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments Christopher D. Hsu et.al. 2406.07431v1 null
2024-06-11 Support for fragile porous dust in a gravitationally self-regulated disk around IM Lup Takahiro Ueda et.al. 2406.07427v1 null
2024-06-11 Adic curves: stable reduction, skeletons and metric structure Katharina HΓΌbner et.al. 2406.07414v1 null
2024-06-11 VersiCode: Towards Version-controllable Code Generation Tongtong Wu et.al. 2406.07411v1 null
2024-06-11 Private Geometric Median Mahdi Haghifam et.al. 2406.07407v1 null
2024-06-11 Limited Out-of-Context Knowledge Reasoning in Large Language Models Peng Hu et.al. 2406.07393v1 null
2024-06-11 A mechanical qubit Yu Yang et.al. 2406.07360v1 null
2024-06-11 Finite $W$-algebra invariants via Lax type operators Jonathan S. Brown et.al. 2406.07350v1 null
2024-06-11 Analytical Delta-V Approximation for Nonlinear Programming of Multi-target Rendezvous and Flyby Trajectories An-Yi Huang et.al. 2406.07341v1 null
2024-06-11 Searching for gravitational waves from stellar-mass binary black holes early inspiral Xue-Ting Zhang et.al. 2406.07336v1 null
2024-06-11 Maximum number of points on an intersection of a cubic threefold and a non-degenerate Hermitian threefold Mrinmoy Datta et.al. 2406.07326v1 null
2024-06-11 Lyapunov equations: a (fixed) point of view Richard Pates et.al. 2406.07324v1 null
2024-06-11 Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs Kamil Jeziorek et.al. 2406.07318v1 null
2024-06-11 Morse Index Stability for the Ginzburg-Landau Approximation Francesca Da Lio et.al. 2406.07317v1 null
2024-06-11 Sum the Probabilities to $m$ and Stop Zakaria Derbazi et.al. 2406.07283v1 null
2024-06-11 Coupled-channel $J^{--}$ meson resonances from lattice QCD Jozef J. Dudek et.al. 2406.07261v1 null
2024-06-11 Hybrid Reinforcement Learning from Offline Observation Alone Yuda Song et.al. 2406.07253v1 null
2024-06-11 Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Tomoya Nishida et.al. 2406.07250v1 null
2024-06-11 Variational inequalities and smooth-fit principle for singular stochastic control problems in Hilbert spaces Salvatore Federico et.al. 2406.07242v1 null

Point Cloud

Publish Date Title Authors PDF Code
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Dynamics of the non-radial energy-critical inhomogeneous NLS Carlos M. GuzmΓ‘n et.al. 2406.07535v1 null
2024-06-11 Interpolating between Hausdorff and box dimension Amlan Banaji et.al. 2406.07527v1 null
2024-06-11 On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds Gabriella Tarantello et.al. 2406.07518v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results I. Improved data selection and processing J. G. S. Lunde et.al. 2406.07510v1 null
2024-06-11 Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices Atli Sigurgeirsson et.al. 2406.07504v1 null
2024-06-11 ReduceFormer: Attention with Tensor Reduction by Summation John Yang et.al. 2406.07488v1 null
2024-06-11 Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery Biplov Bhandari et.al. 2406.07482v1 null
2024-06-11 Resummation of Multi-Stress Tensors in Higher Dimensions Kuo-Wei Huang et.al. 2406.07458v1 null
2024-06-11 Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments Christopher D. Hsu et.al. 2406.07431v1 null
2024-06-11 Support for fragile porous dust in a gravitationally self-regulated disk around IM Lup Takahiro Ueda et.al. 2406.07427v1 null
2024-06-11 Adic curves: stable reduction, skeletons and metric structure Katharina HΓΌbner et.al. 2406.07414v1 null
2024-06-11 Private Geometric Median Mahdi Haghifam et.al. 2406.07407v1 null
2024-06-11 Analytical Delta-V Approximation for Nonlinear Programming of Multi-target Rendezvous and Flyby Trajectories An-Yi Huang et.al. 2406.07341v1 null
2024-06-11 Searching for gravitational waves from stellar-mass binary black holes early inspiral Xue-Ting Zhang et.al. 2406.07336v1 null
2024-06-11 Maximum number of points on an intersection of a cubic threefold and a non-degenerate Hermitian threefold Mrinmoy Datta et.al. 2406.07326v1 null
2024-06-11 Lyapunov equations: a (fixed) point of view Richard Pates et.al. 2406.07324v1 null
2024-06-11 Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs Kamil Jeziorek et.al. 2406.07318v1 null
2024-06-11 Morse Index Stability for the Ginzburg-Landau Approximation Francesca Da Lio et.al. 2406.07317v1 null
2024-06-11 Sum the Probabilities to $m$ and Stop Zakaria Derbazi et.al. 2406.07283v1 null
2024-06-11 Coupled-channel $J^{--}$ meson resonances from lattice QCD Jozef J. Dudek et.al. 2406.07261v1 null
2024-06-11 Even dimensional Fermat cubics are rational over any field Alex Massarenti et.al. 2406.07223v1 null
2024-06-11 A Multi-step Approach for Minimizing Risk in Decentralized Exchanges Daniele Maria Di Nosse et.al. 2406.07200v2 null
2024-06-11 TernaryLLM: Ternarized Large Language Model Tianqi Chen et.al. 2406.07177v1 null
2024-06-11 Ultrametric-preserving functions as monoid endomorphisms Oleksiy Dovgoshey et.al. 2406.07166v2 null
2024-06-11 ACCO: Automated Causal CNN Scheduling Optimizer for Real-Time Edge Accelerators Jun Yin et.al. 2406.07161v1 null
2024-06-11 Deep Learning-Based Approach for User Activity Detection with Grant-Free Random Access in Cell-Free Massive MIMO Ali Elkeshawy et.al. 2406.07160v1 null
2024-06-11 A portrait of the rotation of Ultra-Cool Dwarfs revealed by TESS D. O. Fontinele et.al. 2406.07154v1 null
2024-06-11 High-performance in-vacuum optical system for quantum optics experiments in a Penning-trap JoaquΓ­n Berrocal et.al. 2406.07152v1 null
2024-06-11 Partial yet definite emergence of the Kardar-Parisi-Zhang class in isotropic spin chains Kazumasa A. Takeuchi et.al. 2406.07150v1 null

Federated Learning

Federated Learning

Publish Date Title Authors PDF Code
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis David Ortiz-Perez et.al. 2406.07542v1 link
2024-06-11 CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning Zeyuan Liu et.al. 2406.07541v1 null
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 QuickLLaMA: Query-aware Inference Acceleration for Large Language Models Jingyao Li et.al. 2406.07528v1 link
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Change of numeraire for weak martingale transport Mathias BeiglbΓΆck et.al. 2406.07523v1 null
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Faster Spectral Density Estimation and Sparsification in the Nuclear Norm Yujia Jin et.al. 2406.07521v1 null
2024-06-11 Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics Reuben R. W. Wang et.al. 2406.07519v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid Naser Souri et.al. 2406.07503v1 null
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale William Giarè et.al. 2406.07493v1 null
2024-06-11 Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction Bekir Z. Demiray et.al. 2406.07484v1 null
2024-06-11 Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery Biplov Bhandari et.al. 2406.07482v1 null
2024-06-11 The end of multiple choice tests: using AI to enhance assessment Michael Klymkowsky et.al. 2406.07481v1 null

Framework

Publish Date Title Authors PDF Code
2024-06-11 Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks Bjarke Madsen et.al. 2406.07383v1 null
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332v1 null
2024-06-11 Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway Hamed Babaei Giglou et.al. 2406.07257v1 null
2024-06-11 Which Country Is This? Automatic Country Ranking of Street View Photos Tim Menzner et.al. 2406.07227v1 null
2024-06-10 Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints T. Tony Cai et.al. 2406.06755v1 null
2024-06-10 Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests T. Tony Cai et.al. 2406.06749v1 null
2024-06-10 Decentralized Personalized Federated Learning Salma Kharrat et.al. 2406.06520v1 null
2024-06-10 Optimisation of federated learning settings under statistical heterogeneity variations Basem Suleiman et.al. 2406.06340v1 null
2024-06-10 Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning Xiaoting Lyu et.al. 2406.06207v1 null
2024-06-10 Federated learning in food research Zuzanna Fendor et.al. 2406.06202v1 null
2024-06-10 Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm Ahmed Elbakary et.al. 2406.06655v1 null
2024-06-10 Federated Machine Reasoning for Resource Provisioning in 6G O-RAN Swastika Roy et.al. 2406.06128v1 null
2024-06-09 Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" Mahtab Talaei et.al. 2406.05858v1 null
2024-06-08 Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey Shinu M. Rajagopal et.al. 2406.05517v1 null
2024-06-08 PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System Wei Yuan et.al. 2406.05387v1 null
2024-06-07 Federated LoRA with Sparse Communication Kevin Kuo et.al. 2406.05233v1 null
2024-06-07 The Russian Legislative Corpus Denis Saveliev et.al. 2406.04855v1 link
2024-06-07 FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models Rui Ye et.al. 2406.04845v1 link
2024-06-07 When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain Lei Xu et.al. 2406.04743v1 null
2024-06-07 Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems Zhen Cai et.al. 2406.04702v1 null
2024-06-07 Federated Representation Learning in the Under-Parameterized Regime Renpu Liu et.al. 2406.04596v3 null
2024-06-06 Data Measurements for Decentralized Data Markets Charles Lu et.al. 2406.04257v1 null
2024-06-06 R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients Tamer Ahmed Eltaras et.al. 2406.04227v1 null
2024-06-06 Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning Xuhan Zuo et.al. 2406.04076v1 null
2024-06-06 Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation Honglei Zhang et.al. 2406.03933v1 link
2024-06-06 1-D CNN-Based Online Signature Verification with Federated Learning Lingfeng Zhang et.al. 2406.06597v1 null
2024-06-06 Stochastic Dynamic Network Utility Maximization with Application to Disaster Response Anna Scaglione et.al. 2406.03750v1 null
2024-06-05 FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles Cyprien QuΓ©mΓ©neur et.al. 2406.03611v1 link
2024-06-05 Fantastyc: Blockchain-based Federated Learning Made Secure and Practical William Boitier et.al. 2406.03608v1 null
2024-06-05 Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning Saber Malekmohammadi et.al. 2406.03519v1 link

Communication

Publish Date Title Authors PDF Code
2024-06-11 Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks Bjarke Madsen et.al. 2406.07383v1 null
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332v1 null
2024-06-11 Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway Hamed Babaei Giglou et.al. 2406.07257v1 null
2024-06-11 Which Country Is This? Automatic Country Ranking of Street View Photos Tim Menzner et.al. 2406.07227v1 null
2024-06-10 Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints T. Tony Cai et.al. 2406.06755v1 null
2024-06-10 Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests T. Tony Cai et.al. 2406.06749v1 null
2024-06-10 Decentralized Personalized Federated Learning Salma Kharrat et.al. 2406.06520v1 null
2024-06-10 Optimisation of federated learning settings under statistical heterogeneity variations Basem Suleiman et.al. 2406.06340v1 null
2024-06-10 Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning Xiaoting Lyu et.al. 2406.06207v1 null
2024-06-10 Federated learning in food research Zuzanna Fendor et.al. 2406.06202v1 null
2024-06-10 Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm Ahmed Elbakary et.al. 2406.06655v1 null
2024-06-10 Federated Machine Reasoning for Resource Provisioning in 6G O-RAN Swastika Roy et.al. 2406.06128v1 null
2024-06-09 Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" Mahtab Talaei et.al. 2406.05858v1 null
2024-06-08 Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey Shinu M. Rajagopal et.al. 2406.05517v1 null
2024-06-08 PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System Wei Yuan et.al. 2406.05387v1 null
2024-06-07 Federated LoRA with Sparse Communication Kevin Kuo et.al. 2406.05233v1 null
2024-06-07 The Russian Legislative Corpus Denis Saveliev et.al. 2406.04855v1 link
2024-06-07 FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models Rui Ye et.al. 2406.04845v1 link
2024-06-07 When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain Lei Xu et.al. 2406.04743v1 null
2024-06-07 Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems Zhen Cai et.al. 2406.04702v1 null
2024-06-07 Federated Representation Learning in the Under-Parameterized Regime Renpu Liu et.al. 2406.04596v3 null
2024-06-06 Data Measurements for Decentralized Data Markets Charles Lu et.al. 2406.04257v1 null
2024-06-06 R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients Tamer Ahmed Eltaras et.al. 2406.04227v1 null
2024-06-06 Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning Xuhan Zuo et.al. 2406.04076v1 null
2024-06-06 Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation Honglei Zhang et.al. 2406.03933v1 link
2024-06-06 1-D CNN-Based Online Signature Verification with Federated Learning Lingfeng Zhang et.al. 2406.06597v1 null
2024-06-06 Stochastic Dynamic Network Utility Maximization with Application to Disaster Response Anna Scaglione et.al. 2406.03750v1 null
2024-06-05 FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles Cyprien QuΓ©mΓ©neur et.al. 2406.03611v1 link
2024-06-05 Fantastyc: Blockchain-based Federated Learning Made Secure and Practical William Boitier et.al. 2406.03608v1 null
2024-06-05 Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning Saber Malekmohammadi et.al. 2406.03519v1 link

Personalized

Publish Date Title Authors PDF Code
2024-06-11 Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks Bjarke Madsen et.al. 2406.07383v1 null
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332v1 null
2024-06-11 Should XAI Nudge Human Decisions with Explanation Biasing? Yosuke Fukuchi et.al. 2406.07323v1 null
2024-06-11 Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway Hamed Babaei Giglou et.al. 2406.07257v1 null
2024-06-11 Which Country Is This? Automatic Country Ranking of Street View Photos Tim Menzner et.al. 2406.07227v1 null
2024-06-11 A Synthetic Dataset for Personal Attribute Inference Hanna Yukhymenko et.al. 2406.07217v1 null
2024-06-11 MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance X. Wang et.al. 2406.07209v1 link
2024-06-11 Can Foundation Models Reliably Identify Spatial Hazards? A Case Study on Curb Segmentation Diwei Sheng et.al. 2406.07202v1 null
2024-06-11 Unlocking the Potential of the Metaverse for Innovative and Immersive Digital Care Fatemeh Ebrahimzadeh et.al. 2406.07114v1 null
2024-06-11 A Neck Orthosis with Multi-Directional Variable Stiffness for Persons with Dropped Head Syndrome Santiago Price Torrendell et.al. 2406.07074v1 null
2024-06-11 pVACview: an interactive visualization tool for efficient neoantigen prioritization and selection Huiming Xia et.al. 2406.06985v1 null
2024-06-11 Non-autoregressive Personalized Bundle Generation Wenchuan Yang et.al. 2406.06925v1 null
2024-06-11 Person Transfer in the Field: Examining Real World Sequential Human-Robot Interaction Between Two Robots Xiang Zhi Tan et.al. 2406.06904v1 null
2024-06-10 Personalized Binomial DAGs Learning with Network Structured Covariates Boxin Zhao et.al. 2406.06829v1 null
2024-06-10 Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints T. Tony Cai et.al. 2406.06755v1 null
2024-06-10 Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests T. Tony Cai et.al. 2406.06749v1 null
2024-06-10 Video-based Exercise Classification and Activated Muscle Group Prediction with Hybrid X3D-SlowFast Network Manvik Pasula et.al. 2406.06703v1 null
2024-06-10 Decentralized Personalized Federated Learning Salma Kharrat et.al. 2406.06520v1 null
2024-06-10 Towards a Personal Health Large Language Model Justin Cosentino et.al. 2406.06474v1 null
2024-06-10 Transforming Wearable Data into Health Insights using Large Language Model Agents Mike A. Merrill et.al. 2406.06464v2 null
2024-06-10 Estimating Heterogeneous Treatment Effects by Combining Weak Instruments and Observational Data Miruna Oprescu et.al. 2406.06452v1 link
2024-06-10 Biomarker-Guided Adaptive Enrichment Design with Threshold Detection for Clinical Trials with Time-to-Event Outcome Kaiyuan Hua et.al. 2406.06426v1 null
2024-06-10 Improving Deep Learning-based Automatic Cranial Defect Reconstruction by Heavy Data Augmentation: From Image Registration to Latent Diffusion Models Marek Wodzinski et.al. 2406.06372v1 null
2024-06-10 Optimisation of federated learning settings under statistical heterogeneity variations Basem Suleiman et.al. 2406.06340v1 null
2024-06-10 Human Gaze and Head Rotation during Navigation, Exploration and Object Manipulation in Shared Environments with Robots Tim Schreiter et.al. 2406.06300v1 null
2024-06-10 Tuning-Free Visual Customization via View Iterative Self-Attention Control Xiaojie Li et.al. 2406.06258v2 null
2024-06-10 Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning Xiaoting Lyu et.al. 2406.06207v1 null
2024-06-10 Federated learning in food research Zuzanna Fendor et.al. 2406.06202v1 null
2024-06-10 Link Prediction in Bipartite Networks ŞükrΓΌ Demir Δ°nan Γ–zer et.al. 2406.06658v1 null
2024-06-10 Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm Ahmed Elbakary et.al. 2406.06655v1 null

Optimization

Publish Date Title Authors PDF Code
2024-06-11 Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks Bjarke Madsen et.al. 2406.07383v1 null
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332v1 null
2024-06-11 Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway Hamed Babaei Giglou et.al. 2406.07257v1 null
2024-06-11 Which Country Is This? Automatic Country Ranking of Street View Photos Tim Menzner et.al. 2406.07227v1 null
2024-06-10 Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints T. Tony Cai et.al. 2406.06755v1 null
2024-06-10 Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests T. Tony Cai et.al. 2406.06749v1 null
2024-06-10 Decentralized Personalized Federated Learning Salma Kharrat et.al. 2406.06520v1 null
2024-06-10 Optimisation of federated learning settings under statistical heterogeneity variations Basem Suleiman et.al. 2406.06340v1 null
2024-06-10 Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning Xiaoting Lyu et.al. 2406.06207v1 null
2024-06-10 Federated learning in food research Zuzanna Fendor et.al. 2406.06202v1 null
2024-06-10 Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm Ahmed Elbakary et.al. 2406.06655v1 null
2024-06-10 Federated Machine Reasoning for Resource Provisioning in 6G O-RAN Swastika Roy et.al. 2406.06128v1 null
2024-06-09 Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" Mahtab Talaei et.al. 2406.05858v1 null
2024-06-08 Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey Shinu M. Rajagopal et.al. 2406.05517v1 null
2024-06-08 PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System Wei Yuan et.al. 2406.05387v1 null
2024-06-07 Federated LoRA with Sparse Communication Kevin Kuo et.al. 2406.05233v1 null
2024-06-07 The Russian Legislative Corpus Denis Saveliev et.al. 2406.04855v1 link
2024-06-07 FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models Rui Ye et.al. 2406.04845v1 link
2024-06-07 When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain Lei Xu et.al. 2406.04743v1 null
2024-06-07 Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems Zhen Cai et.al. 2406.04702v1 null
2024-06-07 Federated Representation Learning in the Under-Parameterized Regime Renpu Liu et.al. 2406.04596v3 null
2024-06-06 Data Measurements for Decentralized Data Markets Charles Lu et.al. 2406.04257v1 null
2024-06-06 R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients Tamer Ahmed Eltaras et.al. 2406.04227v1 null
2024-06-06 Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning Xuhan Zuo et.al. 2406.04076v1 null
2024-06-06 Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation Honglei Zhang et.al. 2406.03933v1 link
2024-06-06 1-D CNN-Based Online Signature Verification with Federated Learning Lingfeng Zhang et.al. 2406.06597v1 null
2024-06-06 Stochastic Dynamic Network Utility Maximization with Application to Disaster Response Anna Scaglione et.al. 2406.03750v1 null
2024-06-05 FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles Cyprien QuΓ©mΓ©neur et.al. 2406.03611v1 link
2024-06-05 Fantastyc: Blockchain-based Federated Learning Made Secure and Practical William Boitier et.al. 2406.03608v1 null
2024-06-05 Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning Saber Malekmohammadi et.al. 2406.03519v1 link

Privacy

Publish Date Title Authors PDF Code
2024-06-11 Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks Bjarke Madsen et.al. 2406.07383v1 null
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332v1 null
2024-06-11 Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway Hamed Babaei Giglou et.al. 2406.07257v1 null
2024-06-11 Which Country Is This? Automatic Country Ranking of Street View Photos Tim Menzner et.al. 2406.07227v1 null
2024-06-10 Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints T. Tony Cai et.al. 2406.06755v1 null
2024-06-10 Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests T. Tony Cai et.al. 2406.06749v1 null
2024-06-10 Decentralized Personalized Federated Learning Salma Kharrat et.al. 2406.06520v1 null
2024-06-10 Optimisation of federated learning settings under statistical heterogeneity variations Basem Suleiman et.al. 2406.06340v1 null
2024-06-10 Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning Xiaoting Lyu et.al. 2406.06207v1 null
2024-06-10 Federated learning in food research Zuzanna Fendor et.al. 2406.06202v1 null
2024-06-10 Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm Ahmed Elbakary et.al. 2406.06655v1 null
2024-06-10 Federated Machine Reasoning for Resource Provisioning in 6G O-RAN Swastika Roy et.al. 2406.06128v1 null
2024-06-09 Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" Mahtab Talaei et.al. 2406.05858v1 null
2024-06-08 Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey Shinu M. Rajagopal et.al. 2406.05517v1 null
2024-06-08 PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System Wei Yuan et.al. 2406.05387v1 null
2024-06-07 Federated LoRA with Sparse Communication Kevin Kuo et.al. 2406.05233v1 null
2024-06-07 The Russian Legislative Corpus Denis Saveliev et.al. 2406.04855v1 link
2024-06-07 FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models Rui Ye et.al. 2406.04845v1 link
2024-06-07 When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain Lei Xu et.al. 2406.04743v1 null
2024-06-07 Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems Zhen Cai et.al. 2406.04702v1 null
2024-06-07 Federated Representation Learning in the Under-Parameterized Regime Renpu Liu et.al. 2406.04596v3 null
2024-06-06 Data Measurements for Decentralized Data Markets Charles Lu et.al. 2406.04257v1 null
2024-06-06 R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients Tamer Ahmed Eltaras et.al. 2406.04227v1 null
2024-06-06 Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning Xuhan Zuo et.al. 2406.04076v1 null
2024-06-06 Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation Honglei Zhang et.al. 2406.03933v1 link
2024-06-06 1-D CNN-Based Online Signature Verification with Federated Learning Lingfeng Zhang et.al. 2406.06597v1 null
2024-06-06 Stochastic Dynamic Network Utility Maximization with Application to Disaster Response Anna Scaglione et.al. 2406.03750v1 null
2024-06-05 FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles Cyprien QuΓ©mΓ©neur et.al. 2406.03611v1 link
2024-06-05 Fantastyc: Blockchain-based Federated Learning Made Secure and Practical William Boitier et.al. 2406.03608v1 null
2024-06-05 Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning Saber Malekmohammadi et.al. 2406.03519v1 link

Asynchronous

Publish Date Title Authors PDF Code
2024-06-11 Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks Bjarke Madsen et.al. 2406.07383v1 null
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332v1 null
2024-06-11 Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway Hamed Babaei Giglou et.al. 2406.07257v1 null
2024-06-11 Which Country Is This? Automatic Country Ranking of Street View Photos Tim Menzner et.al. 2406.07227v1 null
2024-06-10 Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints T. Tony Cai et.al. 2406.06755v1 null
2024-06-10 Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests T. Tony Cai et.al. 2406.06749v1 null
2024-06-10 Decentralized Personalized Federated Learning Salma Kharrat et.al. 2406.06520v1 null
2024-06-10 Optimisation of federated learning settings under statistical heterogeneity variations Basem Suleiman et.al. 2406.06340v1 null
2024-06-10 Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning Xiaoting Lyu et.al. 2406.06207v1 null
2024-06-10 Federated learning in food research Zuzanna Fendor et.al. 2406.06202v1 null
2024-06-10 Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm Ahmed Elbakary et.al. 2406.06655v1 null
2024-06-10 Federated Machine Reasoning for Resource Provisioning in 6G O-RAN Swastika Roy et.al. 2406.06128v1 null
2024-06-09 Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" Mahtab Talaei et.al. 2406.05858v1 null
2024-06-08 Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey Shinu M. Rajagopal et.al. 2406.05517v1 null
2024-06-08 PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System Wei Yuan et.al. 2406.05387v1 null
2024-06-07 Federated LoRA with Sparse Communication Kevin Kuo et.al. 2406.05233v1 null
2024-06-07 The Russian Legislative Corpus Denis Saveliev et.al. 2406.04855v1 link
2024-06-07 FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models Rui Ye et.al. 2406.04845v1 link
2024-06-07 When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain Lei Xu et.al. 2406.04743v1 null
2024-06-07 Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems Zhen Cai et.al. 2406.04702v1 null
2024-06-07 Federated Representation Learning in the Under-Parameterized Regime Renpu Liu et.al. 2406.04596v3 null
2024-06-06 Data Measurements for Decentralized Data Markets Charles Lu et.al. 2406.04257v1 null
2024-06-06 R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients Tamer Ahmed Eltaras et.al. 2406.04227v1 null
2024-06-06 Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning Xuhan Zuo et.al. 2406.04076v1 null
2024-06-06 Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation Honglei Zhang et.al. 2406.03933v1 link
2024-06-06 1-D CNN-Based Online Signature Verification with Federated Learning Lingfeng Zhang et.al. 2406.06597v1 null
2024-06-06 Stochastic Dynamic Network Utility Maximization with Application to Disaster Response Anna Scaglione et.al. 2406.03750v1 null
2024-06-05 FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles Cyprien QuΓ©mΓ©neur et.al. 2406.03611v1 link
2024-06-05 Fantastyc: Blockchain-based Federated Learning Made Secure and Practical William Boitier et.al. 2406.03608v1 null
2024-06-05 Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning Saber Malekmohammadi et.al. 2406.03519v1 link

Dataset

Publish Date Title Authors PDF Code
2024-06-11 Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks Bjarke Madsen et.al. 2406.07383v1 null
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332v1 null
2024-06-11 Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway Hamed Babaei Giglou et.al. 2406.07257v1 null
2024-06-11 Which Country Is This? Automatic Country Ranking of Street View Photos Tim Menzner et.al. 2406.07227v1 null
2024-06-10 Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints T. Tony Cai et.al. 2406.06755v1 null
2024-06-10 Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests T. Tony Cai et.al. 2406.06749v1 null
2024-06-10 Decentralized Personalized Federated Learning Salma Kharrat et.al. 2406.06520v1 null
2024-06-10 Optimisation of federated learning settings under statistical heterogeneity variations Basem Suleiman et.al. 2406.06340v1 null
2024-06-10 Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning Xiaoting Lyu et.al. 2406.06207v1 null
2024-06-10 Federated learning in food research Zuzanna Fendor et.al. 2406.06202v1 null
2024-06-10 Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm Ahmed Elbakary et.al. 2406.06655v1 null
2024-06-10 Federated Machine Reasoning for Resource Provisioning in 6G O-RAN Swastika Roy et.al. 2406.06128v1 null
2024-06-09 Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" Mahtab Talaei et.al. 2406.05858v1 null
2024-06-08 Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey Shinu M. Rajagopal et.al. 2406.05517v1 null
2024-06-08 PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System Wei Yuan et.al. 2406.05387v1 null
2024-06-07 Federated LoRA with Sparse Communication Kevin Kuo et.al. 2406.05233v1 null
2024-06-07 The Russian Legislative Corpus Denis Saveliev et.al. 2406.04855v1 link
2024-06-07 FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models Rui Ye et.al. 2406.04845v1 link
2024-06-07 When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain Lei Xu et.al. 2406.04743v1 null
2024-06-07 Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems Zhen Cai et.al. 2406.04702v1 null
2024-06-07 Federated Representation Learning in the Under-Parameterized Regime Renpu Liu et.al. 2406.04596v3 null
2024-06-06 Data Measurements for Decentralized Data Markets Charles Lu et.al. 2406.04257v1 null
2024-06-06 R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients Tamer Ahmed Eltaras et.al. 2406.04227v1 null
2024-06-06 Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning Xuhan Zuo et.al. 2406.04076v1 null
2024-06-06 Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation Honglei Zhang et.al. 2406.03933v1 link
2024-06-06 1-D CNN-Based Online Signature Verification with Federated Learning Lingfeng Zhang et.al. 2406.06597v1 null
2024-06-06 Stochastic Dynamic Network Utility Maximization with Application to Disaster Response Anna Scaglione et.al. 2406.03750v1 null
2024-06-05 FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles Cyprien QuΓ©mΓ©neur et.al. 2406.03611v1 link
2024-06-05 Fantastyc: Blockchain-based Federated Learning Made Secure and Practical William Boitier et.al. 2406.03608v1 null
2024-06-05 Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning Saber Malekmohammadi et.al. 2406.03519v1 link

Benchmark

Publish Date Title Authors PDF Code
2024-06-11 Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks Bjarke Madsen et.al. 2406.07383v1 null
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332v1 null
2024-06-11 Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway Hamed Babaei Giglou et.al. 2406.07257v1 null
2024-06-11 Which Country Is This? Automatic Country Ranking of Street View Photos Tim Menzner et.al. 2406.07227v1 null
2024-06-10 Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints T. Tony Cai et.al. 2406.06755v1 null
2024-06-10 Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests T. Tony Cai et.al. 2406.06749v1 null
2024-06-10 Decentralized Personalized Federated Learning Salma Kharrat et.al. 2406.06520v1 null
2024-06-10 Optimisation of federated learning settings under statistical heterogeneity variations Basem Suleiman et.al. 2406.06340v1 null
2024-06-10 Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning Xiaoting Lyu et.al. 2406.06207v1 null
2024-06-10 Federated learning in food research Zuzanna Fendor et.al. 2406.06202v1 null
2024-06-10 Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm Ahmed Elbakary et.al. 2406.06655v1 null
2024-06-10 Federated Machine Reasoning for Resource Provisioning in 6G O-RAN Swastika Roy et.al. 2406.06128v1 null
2024-06-09 Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" Mahtab Talaei et.al. 2406.05858v1 null
2024-06-08 Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey Shinu M. Rajagopal et.al. 2406.05517v1 null
2024-06-08 PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System Wei Yuan et.al. 2406.05387v1 null
2024-06-07 Federated LoRA with Sparse Communication Kevin Kuo et.al. 2406.05233v1 null
2024-06-07 The Russian Legislative Corpus Denis Saveliev et.al. 2406.04855v1 link
2024-06-07 FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models Rui Ye et.al. 2406.04845v1 link
2024-06-07 When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain Lei Xu et.al. 2406.04743v1 null
2024-06-07 Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems Zhen Cai et.al. 2406.04702v1 null
2024-06-07 Federated Representation Learning in the Under-Parameterized Regime Renpu Liu et.al. 2406.04596v3 null
2024-06-06 Data Measurements for Decentralized Data Markets Charles Lu et.al. 2406.04257v1 null
2024-06-06 R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients Tamer Ahmed Eltaras et.al. 2406.04227v1 null
2024-06-06 Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning Xuhan Zuo et.al. 2406.04076v1 null
2024-06-06 Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation Honglei Zhang et.al. 2406.03933v1 link
2024-06-06 1-D CNN-Based Online Signature Verification with Federated Learning Lingfeng Zhang et.al. 2406.06597v1 null
2024-06-06 Stochastic Dynamic Network Utility Maximization with Application to Disaster Response Anna Scaglione et.al. 2406.03750v1 null
2024-06-05 FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles Cyprien QuΓ©mΓ©neur et.al. 2406.03611v1 link
2024-06-05 Fantastyc: Blockchain-based Federated Learning Made Secure and Practical William Boitier et.al. 2406.03608v1 null
2024-06-05 Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning Saber Malekmohammadi et.al. 2406.03519v1 link

Efficient

Publish Date Title Authors PDF Code
2024-06-11 Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks Bjarke Madsen et.al. 2406.07383v1 null
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332v1 null
2024-06-11 Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway Hamed Babaei Giglou et.al. 2406.07257v1 null
2024-06-11 Which Country Is This? Automatic Country Ranking of Street View Photos Tim Menzner et.al. 2406.07227v1 null
2024-06-10 Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints T. Tony Cai et.al. 2406.06755v1 null
2024-06-10 Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests T. Tony Cai et.al. 2406.06749v1 null
2024-06-10 Decentralized Personalized Federated Learning Salma Kharrat et.al. 2406.06520v1 null
2024-06-10 Optimisation of federated learning settings under statistical heterogeneity variations Basem Suleiman et.al. 2406.06340v1 null
2024-06-10 Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning Xiaoting Lyu et.al. 2406.06207v1 null
2024-06-10 Federated learning in food research Zuzanna Fendor et.al. 2406.06202v1 null
2024-06-10 Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm Ahmed Elbakary et.al. 2406.06655v1 null
2024-06-10 Federated Machine Reasoning for Resource Provisioning in 6G O-RAN Swastika Roy et.al. 2406.06128v1 null
2024-06-09 Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" Mahtab Talaei et.al. 2406.05858v1 null
2024-06-08 Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey Shinu M. Rajagopal et.al. 2406.05517v1 null
2024-06-08 PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System Wei Yuan et.al. 2406.05387v1 null
2024-06-07 Federated LoRA with Sparse Communication Kevin Kuo et.al. 2406.05233v1 null
2024-06-07 The Russian Legislative Corpus Denis Saveliev et.al. 2406.04855v1 link
2024-06-07 FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models Rui Ye et.al. 2406.04845v1 link
2024-06-07 When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain Lei Xu et.al. 2406.04743v1 null
2024-06-07 Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems Zhen Cai et.al. 2406.04702v1 null
2024-06-07 Federated Representation Learning in the Under-Parameterized Regime Renpu Liu et.al. 2406.04596v3 null
2024-06-06 Data Measurements for Decentralized Data Markets Charles Lu et.al. 2406.04257v1 null
2024-06-06 R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients Tamer Ahmed Eltaras et.al. 2406.04227v1 null
2024-06-06 Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning Xuhan Zuo et.al. 2406.04076v1 null
2024-06-06 Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation Honglei Zhang et.al. 2406.03933v1 link
2024-06-06 1-D CNN-Based Online Signature Verification with Federated Learning Lingfeng Zhang et.al. 2406.06597v1 null
2024-06-06 Stochastic Dynamic Network Utility Maximization with Application to Disaster Response Anna Scaglione et.al. 2406.03750v1 null
2024-06-05 FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles Cyprien QuΓ©mΓ©neur et.al. 2406.03611v1 link
2024-06-05 Fantastyc: Blockchain-based Federated Learning Made Secure and Practical William Boitier et.al. 2406.03608v1 null
2024-06-05 Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning Saber Malekmohammadi et.al. 2406.03519v1 link

Heterogeneous

Publish Date Title Authors PDF Code
2024-06-11 Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks Bjarke Madsen et.al. 2406.07383v1 null
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332v1 null
2024-06-11 Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway Hamed Babaei Giglou et.al. 2406.07257v1 null
2024-06-11 Which Country Is This? Automatic Country Ranking of Street View Photos Tim Menzner et.al. 2406.07227v1 null
2024-06-10 Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints T. Tony Cai et.al. 2406.06755v1 null
2024-06-10 Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests T. Tony Cai et.al. 2406.06749v1 null
2024-06-10 Decentralized Personalized Federated Learning Salma Kharrat et.al. 2406.06520v1 null
2024-06-10 Optimisation of federated learning settings under statistical heterogeneity variations Basem Suleiman et.al. 2406.06340v1 null
2024-06-10 Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning Xiaoting Lyu et.al. 2406.06207v1 null
2024-06-10 Federated learning in food research Zuzanna Fendor et.al. 2406.06202v1 null
2024-06-10 Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm Ahmed Elbakary et.al. 2406.06655v1 null
2024-06-10 Federated Machine Reasoning for Resource Provisioning in 6G O-RAN Swastika Roy et.al. 2406.06128v1 null
2024-06-09 Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" Mahtab Talaei et.al. 2406.05858v1 null
2024-06-08 Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey Shinu M. Rajagopal et.al. 2406.05517v1 null
2024-06-08 PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System Wei Yuan et.al. 2406.05387v1 null
2024-06-07 Federated LoRA with Sparse Communication Kevin Kuo et.al. 2406.05233v1 null
2024-06-07 The Russian Legislative Corpus Denis Saveliev et.al. 2406.04855v1 link
2024-06-07 FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models Rui Ye et.al. 2406.04845v1 link
2024-06-07 When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain Lei Xu et.al. 2406.04743v1 null
2024-06-07 Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems Zhen Cai et.al. 2406.04702v1 null
2024-06-07 Federated Representation Learning in the Under-Parameterized Regime Renpu Liu et.al. 2406.04596v3 null
2024-06-06 Data Measurements for Decentralized Data Markets Charles Lu et.al. 2406.04257v1 null
2024-06-06 R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients Tamer Ahmed Eltaras et.al. 2406.04227v1 null
2024-06-06 Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning Xuhan Zuo et.al. 2406.04076v1 null
2024-06-06 Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation Honglei Zhang et.al. 2406.03933v1 link
2024-06-06 1-D CNN-Based Online Signature Verification with Federated Learning Lingfeng Zhang et.al. 2406.06597v1 null
2024-06-06 Stochastic Dynamic Network Utility Maximization with Application to Disaster Response Anna Scaglione et.al. 2406.03750v1 null
2024-06-05 FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles Cyprien QuΓ©mΓ©neur et.al. 2406.03611v1 link
2024-06-05 Fantastyc: Blockchain-based Federated Learning Made Secure and Practical William Boitier et.al. 2406.03608v1 null
2024-06-05 Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning Saber Malekmohammadi et.al. 2406.03519v1 link

Few-shot Learning

Few-shot Learning

Publish Date Title Authors PDF Code
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis David Ortiz-Perez et.al. 2406.07542v1 link
2024-06-11 CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning Zeyuan Liu et.al. 2406.07541v1 null
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 QuickLLaMA: Query-aware Inference Acceleration for Large Language Models Jingyao Li et.al. 2406.07528v1 link
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Change of numeraire for weak martingale transport Mathias BeiglbΓΆck et.al. 2406.07523v1 null
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Faster Spectral Density Estimation and Sparsification in the Nuclear Norm Yujia Jin et.al. 2406.07521v1 null
2024-06-11 Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics Reuben R. W. Wang et.al. 2406.07519v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid Naser Souri et.al. 2406.07503v1 null
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale William Giarè et.al. 2406.07493v1 null
2024-06-11 Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction Bekir Z. Demiray et.al. 2406.07484v1 null
2024-06-11 Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery Biplov Bhandari et.al. 2406.07482v1 null
2024-06-11 The end of multiple choice tests: using AI to enhance assessment Michael Klymkowsky et.al. 2406.07481v1 null

One-shot Learning

Publish Date Title Authors PDF Code
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis David Ortiz-Perez et.al. 2406.07542v1 link
2024-06-11 CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning Zeyuan Liu et.al. 2406.07541v1 null
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 QuickLLaMA: Query-aware Inference Acceleration for Large Language Models Jingyao Li et.al. 2406.07528v1 link
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Change of numeraire for weak martingale transport Mathias BeiglbΓΆck et.al. 2406.07523v1 null
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Faster Spectral Density Estimation and Sparsification in the Nuclear Norm Yujia Jin et.al. 2406.07521v1 null
2024-06-11 Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics Reuben R. W. Wang et.al. 2406.07519v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid Naser Souri et.al. 2406.07503v1 null
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale William Giarè et.al. 2406.07493v1 null
2024-06-11 Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction Bekir Z. Demiray et.al. 2406.07484v1 null
2024-06-11 Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery Biplov Bhandari et.al. 2406.07482v1 null
2024-06-11 The end of multiple choice tests: using AI to enhance assessment Michael Klymkowsky et.al. 2406.07481v1 null

Meta Learning

Publish Date Title Authors PDF Code
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Impact of the nuclear equation of state on the formation of twin stars Nai-Bo Zhang et.al. 2406.07396v1 null
2024-06-11 Fast Adaptive Meta-Heuristic for Large-Scale Facility Location Problem Bahram Alidaee et.al. 2406.07382v1 null
2024-06-11 A generic and robust quantum agent inspired by deep meta-reinforcement learning Zibo Miao et.al. 2406.07225v1 null
2024-06-11 Agnostic Sharpness-Aware Minimization Van-Anh Nguyen et.al. 2406.07107v2 null
2024-06-11 Meta-Backscatter: A New ISAC Paradigm for Battery-Free Internet of Things Xu Liu et.al. 2406.07077v1 null
2024-06-11 HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation Wen Luo et.al. 2406.07070v1 null
2024-06-11 Fairness-Aware Meta-Learning via Nash Bargaining Yi Zeng et.al. 2406.07029v1 null
2024-06-11 Ollabench: Evaluating LLMs' Reasoning for Human-centric Interdependent Cybersecurity Tam n. Nguyen et.al. 2406.06863v1 link
2024-06-10 Reinforced Compressive Neural Architecture Search for Versatile Adversarial Robustness Dingrong Wang et.al. 2406.06792v1 link
2024-06-10 Meta Learning Text-to-Speech Synthesis in over 7000 Languages Florian Lux et.al. 2406.06403v1 link
2024-06-10 Characteristics and Energy Flux Distributions of Decayless Transverse Oscillations Depending on Coronal Regions Daye Lim et.al. 2406.06368v1 null
2024-06-10 Data Augmentation in Earth Observation: A Diffusion Model Approach Tiago Sousa et.al. 2406.06218v1 null
2024-06-10 Causality-inspired Latent Feature Augmentation for Single Domain Generalization Jian Xu et.al. 2406.05980v1 null
2024-06-10 Data Caching for Enterprise-Grade Petabyte-Scale OLAP Chunxu Tang et.al. 2406.05962v1 null
2024-06-09 Async Learned User Embeddings for Ads Delivery Optimization Mingwei Tang et.al. 2406.05898v1 null
2024-06-09 Few-Shot Load Forecasting Under Data Scarcity in Smart Grids: A Meta-Learning Approach Georgios Tsoumplekas et.al. 2406.05887v1 null
2024-06-08 Synergizing Deep Learning and Phase Change Materials for Four-state Broadband Multifunctional Metasurfaces in the Visible Range Md. Ehsanul Karim et.al. 2406.05519v1 null
2024-06-08 Gradient-based algorithms for multi-objective bi-level optimization Xinmin Yang et.al. 2406.05455v1 null
2024-06-08 A Survey of Meta-features Used for Automated Selection of Algorithms for Black-box Single-objective Continuous Optimization Gjorgjina Cenikj et.al. 2406.06629v1 null
2024-06-08 Large Language Model Assisted Adversarial Robustness Neural Architecture Search Rui Zhong et.al. 2406.05433v1 link
2024-06-07 Massively Multiagent Minigames for Training Generalist Agents Kyoung Whan Choe et.al. 2406.05071v1 link
2024-06-07 Scenarios and Approaches for Situated Natural Language Explanations Pengshuo Qiu et.al. 2406.05035v1 null
2024-06-07 Unraveling Trace Anomaly of Supradense Matter via Neutron Star Compactness Scaling Bao-Jun Cai et.al. 2406.05025v1 null
2024-06-07 Adversarial Tuning: Defending Against Jailbreak Attacks for LLMs Fan Liu et.al. 2406.06622v1 null
2024-06-07 Cactus-like Metamaterial Structures for Electromagnetically Induced Transparency at THz frequencies Savvas Papamakarios et.al. 2406.04862v1 null
2024-06-07 Black Box Differential Privacy Auditing Using Total Variation Distance Antti Koskela et.al. 2406.04827v1 null
2024-06-07 Graph Mining under Data scarcity Appan Rakaraddi et.al. 2406.04825v2 null
2024-06-07 Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning Xuehui Yu et.al. 2406.04815v1 link
2024-06-07 Cooperative Meta-Learning with Gradient Augmentation Jongyun Shin et.al. 2406.04639v1 link

Transfer Learning

Transfer Learning

Publish Date Title Authors PDF Code
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis David Ortiz-Perez et.al. 2406.07542v1 link
2024-06-11 CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning Zeyuan Liu et.al. 2406.07541v1 null
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 Coherent Three-Photon Excitation of the Strontium Clock Transition Junyu He et.al. 2406.07530v1 null
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 QuickLLaMA: Query-aware Inference Acceleration for Large Language Models Jingyao Li et.al. 2406.07528v1 link
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Change of numeraire for weak martingale transport Mathias BeiglbΓΆck et.al. 2406.07523v1 null
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Faster Spectral Density Estimation and Sparsification in the Nuclear Norm Yujia Jin et.al. 2406.07521v1 null
2024-06-11 Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics Reuben R. W. Wang et.al. 2406.07519v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid Naser Souri et.al. 2406.07503v1 null
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale William Giarè et.al. 2406.07493v1 null
2024-06-11 Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction Bekir Z. Demiray et.al. 2406.07484v1 null
2024-06-11 Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery Biplov Bhandari et.al. 2406.07482v1 null

Unsupervised Learning

Unsupervised Learning

Publish Date Title Authors PDF Code
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis David Ortiz-Perez et.al. 2406.07542v1 link
2024-06-11 CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning Zeyuan Liu et.al. 2406.07541v1 null
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 QuickLLaMA: Query-aware Inference Acceleration for Large Language Models Jingyao Li et.al. 2406.07528v1 link
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Change of numeraire for weak martingale transport Mathias BeiglbΓΆck et.al. 2406.07523v1 null
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Faster Spectral Density Estimation and Sparsification in the Nuclear Norm Yujia Jin et.al. 2406.07521v1 null
2024-06-11 Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics Reuben R. W. Wang et.al. 2406.07519v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid Naser Souri et.al. 2406.07503v1 null
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale William Giarè et.al. 2406.07493v1 null
2024-06-11 GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection Hang Yao et.al. 2406.07487v1 null
2024-06-11 Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction Bekir Z. Demiray et.al. 2406.07484v1 null
2024-06-11 Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery Biplov Bhandari et.al. 2406.07482v1 null

GAN

Publish Date Title Authors PDF Code
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 RaD-Net 2: A causal two-stage repairing and denoising speech enhancement network with knowledge distillation and complex axial self-attention Mingshuai Liu et.al. 2406.07498v1 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 Choreographing the Rhythms of Observation: Dynamics for Ranged Observer Bipartite-Unipartite SpatioTemporal (ROBUST) Networks Ted Edward Holmberg et.al. 2406.07473v1 null
2024-06-11 Microbiomes Through The Looking Glass Jacopo Pasqualini et.al. 2406.07465v1 null
2024-06-11 Reconfigurable Intelligent Surfaces in Dynamic Rich Scattering Environments: BiLSTM-Based Optimization for Accurate User Localization Anum Umer et.al. 2406.07463v1 null
2024-06-11 fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions Alireza Afzal Aghaei et.al. 2406.07456v1 link
2024-06-11 HTVM: Efficient Neural Network Deployment On Heterogeneous TinyML Platforms Josse Van Delm et.al. 2406.07453v1 null
2024-06-11 Boosted Conformal Prediction Intervals Ran Xie et.al. 2406.07449v1 null
2024-06-11 Metastability in networks of nonlinear stochastic integrate-and-fire neurons Siddharth Paliwal et.al. 2406.07445v1 null
2024-06-11 DeformTime: Capturing Variable Dependencies with Deformable Attention for Time Series Forecasting Yuxuan Shu et.al. 2406.07438v1 null
2024-06-11 Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration Shashank Agnihotri et.al. 2406.07435v1 null
2024-06-11 Matryoshka Representation Learning for Recommendation Riwei Lai et.al. 2406.07432v1 link
2024-06-11 GemNet: Menu-Based, Strategy-Proof Multi-Bidder Auctions Through Deep Learning Tonghan Wang et.al. 2406.07428v1 null
2024-06-11 Graph Reasoning for Explainable Cold Start Recommendation Jibril Frej et.al. 2406.07420v1 null
2024-06-11 Average-exact mixed anomalies and compatible phases Yichen Xu et.al. 2406.07417v1 null
2024-06-11 Holistic Memory Diversification for Incremental Learning in Growing Graphs Ziyue Qiao et.al. 2406.07413v1 null
2024-06-11 Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance Ruxin Zheng et.al. 2406.07399v1 null
2024-06-11 Holographic reconstruction of black hole spacetime: machine learning and entanglement entropy Byoungjoon Ahn et.al. 2406.07395v1 null
2024-06-11 DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling Sixian Wang et.al. 2406.07390v1 null
2024-06-11 Disrupting Bipartite Trading Networks: Matching for Revenue Maximization Luca D'Amico-Wong et.al. 2406.07385v1 null
2024-06-11 Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks Bjarke Madsen et.al. 2406.07383v1 null
2024-06-11 COLoRIS: Localization-agnostic Smart Surfaces Enabling Opportunistic ISAC in 6G Networks Guillermo Encinas-Lago et.al. 2406.07377v1 null
2024-06-11 Decoding planetary surfaces by counting cracks S. Silver et.al. 2406.07376v1 null
2024-06-11 Improving the realism of robotic surgery simulation through injection of learning-based estimated errors Juan Antonio Barragan et.al. 2406.07375v1 null
2024-06-11 Deep Implicit Optimization for Robust and Flexible Image Registration Rohit Jena et.al. 2406.07361v1 null
2024-06-11 Toxic Memes: A Survey of Computational Perspectives on the Detection and Explanation of Meme Toxicities Delfina Sol Martinez Pandiani et.al. 2406.07353v1 null
2024-06-11 Stochastic Analysis of Homogeneous Wireless Networks Assisted by Intelligent Reflecting Surfaces Ali H. Abdollahi Bafghi et.al. 2406.07352v1 null

Multi-modal

Vision-Language

Publish Date Title Authors PDF Code

Image Caption

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Interpolating between Hausdorff and box dimension Amlan Banaji et.al. 2406.07527v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502v1 link
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 Trim 3D Gaussian Splatting for Accurate Geometry Representation Lue Fan et.al. 2406.07499v1 null
2024-06-11 GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection Hang Yao et.al. 2406.07487v1 null
2024-06-11 Novel Optimized Designs of Modulo $2n+1$ Adder for Quantum Computing Bhaskar Gaur et.al. 2406.07486v1 null
2024-06-11 Image Neural Field Diffusion Models Yinbo Chen et.al. 2406.07480v1 null
2024-06-11 VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs Zesen Cheng et.al. 2406.07476v1 link
2024-06-11 fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions Alireza Afzal Aghaei et.al. 2406.07456v1 link
2024-06-11 An Optimism-based Approach to Online Evaluation of Generative Models Xiaoyan Hu et.al. 2406.07451v1 null
2024-06-11 Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning Shuvendu Roy et.al. 2406.07450v1 link
2024-06-11 Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration Shashank Agnihotri et.al. 2406.07435v1 null
2024-06-11 Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments Christopher D. Hsu et.al. 2406.07431v1 null
2024-06-11 Learning Domain-Invariant Features for Out-of-Context News Detection Yimeng Gu et.al. 2406.07430v1 null
2024-06-11 DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 38 Subclasses Abdurrahim Yilmaz et.al. 2406.07426v1 null
2024-06-11 Persistent currents in mesoscopic spin-orbit coupled rings due to an applied Zeeman field Bijay Kumar Sahoo et.al. 2406.07405v1 null
2024-06-11 Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance Ruxin Zheng et.al. 2406.07399v1 null
2024-06-11 Visual Representation Learning with Stochastic Frame Prediction Huiwon Jang et.al. 2406.07398v1 null
2024-06-11 DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling Sixian Wang et.al. 2406.07390v1 null

Multi-modal

Publish Date Title Authors PDF Code
2024-06-11 Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis David Ortiz-Perez et.al. 2406.07542v1 link
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502v1 link
2024-06-11 Multimodal Belief Prediction John Murzaku et.al. 2406.07466v1 null
2024-06-11 World Models with Hints of Large Language Models for Goal Achieving Zeyuan Liu et.al. 2406.07381v1 null
2024-06-11 Toxic Memes: A Survey of Computational Perspectives on the Detection and Explanation of Meme Toxicities Delfina Sol Martinez Pandiani et.al. 2406.07353v1 null
2024-06-11 Transferring Knowledge from Large Foundation Models to Small Downstream Models Shikai Qiu et.al. 2406.07337v1 null
2024-06-11 MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting Zhiqi Ai et.al. 2406.07310v1 null
2024-06-11 Which Country Is This? Automatic Country Ranking of Street View Photos Tim Menzner et.al. 2406.07227v1 null
2024-06-11 Open-World Human-Object Interaction Detection via Multi-modal Prompts Jie Yang et.al. 2406.07221v1 null
2024-06-11 Target Speech Diarization with Multimodal Prompts Yidi Jiang et.al. 2406.07198v1 null
2024-06-11 RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker Yunfeng Li et.al. 2406.07189v1 link
2024-06-11 Unified Modeling Enhanced Multimodal Learning for Precision Neuro-Oncology Huahui Yi et.al. 2406.07078v1 link
2024-06-11 Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study Yichi Zhang et.al. 2406.07057v1 null
2024-06-11 Evolving from Single-modal to Multi-modal Facial Deepfake Detection: A Survey Ping Liu et.al. 2406.06965v1 null
2024-06-11 Missingness-resilient Video-enhanced Multimodal Disfluency Detection Payal Mohapatra et.al. 2406.06964v1 null
2024-06-11 Eyeballing Combinatorial Problems: A Case Study of Using Multimodal Large Language Models to Solve Traveling Salesman Problems Mohammed Elhenawy et.al. 2406.06865v1 null
2024-06-10 FlexLoc: Conditional Neural Networks for Zero-Shot Sensor Perspective Invariance in Object Localization with Distributed Multimodal Sensors Jason Wu et.al. 2406.06796v1 link
2024-06-10 BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification June-Woo Kim et.al. 2406.06786v1 null
2024-06-10 MolX: Enhancing Large Language Models for Molecular Learning with A Multi-Modal Extension Khiem Le et.al. 2406.06777v1 null
2024-06-10 Merlin: A Vision Language Foundation Model for 3D Computed Tomography Louis Blankemeier et.al. 2406.06512v1 null
2024-06-10 AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction Zhen Xing et.al. 2406.06465v1 null
2024-06-10 VCR: Visual Caption Restoration Tianyu Zhang et.al. 2406.06462v1 link
2024-06-10 Margin-aware Preference Optimization for Aligning Diffusion Models without Reference Jiwoo Hong et.al. 2406.06424v1 null
2024-06-10 STimage-1K4M: A histopathology image-gene expression dataset for spatial transcriptomics Jiawen Chen et.al. 2406.06393v1 link
2024-06-10 Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization Yi Gu et.al. 2406.06382v1 link
2024-06-10 ASTRA: Aligning Speech and Text Representations for Asr without Sampling Neeraj Gaur et.al. 2406.06664v1 null
2024-06-10 MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing Yu-Fen Huang et.al. 2406.06375v1 link
2024-06-10 A Guide to Stochastic Optimisation for Large-Scale Inverse Problems Matthias J. Ehrhardt et.al. 2406.06342v1 null

VQA

Publish Date Title Authors PDF Code
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 QuickLLaMA: Query-aware Inference Acceleration for Large Language Models Jingyao Li et.al. 2406.07528v1 link
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale William Giarè et.al. 2406.07493v1 null
2024-06-11 Paraphrasing in Affirmative Terms Improves Negation Understanding MohammadHossein Rezaei et.al. 2406.07492v1 null
2024-06-11 ReduceFormer: Attention with Tensor Reduction by Summation John Yang et.al. 2406.07488v1 null
2024-06-11 GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection Hang Yao et.al. 2406.07487v1 null
2024-06-11 PITCH: Productivity and Mental Well-being Coaching through Daily Conversational Interaction Adnan Abbas et.al. 2406.07485v1 null
2024-06-11 The end of multiple choice tests: using AI to enhance assessment Michael Klymkowsky et.al. 2406.07481v1 null
2024-06-11 VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs Zesen Cheng et.al. 2406.07476v1 link
2024-06-11 Partially Observed Trajectory Inference using Optimal Transport and a Dynamics Prior Anming Gu et.al. 2406.07475v1 null
2024-06-11 Estimating the Hallucination Rate of Generative AI Andrew Jesson et.al. 2406.07457v1 null
2024-06-11 Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis Qining Zhang et.al. 2406.07455v1 null
2024-06-11 Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning Shuvendu Roy et.al. 2406.07450v1 link
2024-06-11 Constructions of TurΓ‘n systems that are tight up to a multiplicative constant Oleg Pikhurko et.al. 2406.07443v1 null
2024-06-11 Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for Sampling Denis Blessing et.al. 2406.07423v1 link
2024-06-11 Holistic Memory Diversification for Incremental Learning in Growing Graphs Ziyue Qiao et.al. 2406.07413v1 null
2024-06-11 Guiding LLM Temporal Logic Generation with Explicit Separation of Data and Control William Murphy et.al. 2406.07400v1 null
2024-06-11 Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance Ruxin Zheng et.al. 2406.07399v1 null
2024-06-11 Limited Out-of-Context Knowledge Reasoning in Large Language Models Peng Hu et.al. 2406.07393v1 null
2024-06-11 AI.vs.Clinician: Unveiling Intricate Interactions Between AI and Clinicians through an Open-Access Database Wanling Gao et.al. 2406.07362v1 null
2024-06-11 Deep Implicit Optimization for Robust and Flexible Image Registration Rohit Jena et.al. 2406.07361v1 null
2024-06-11 PSMC: Provable and Scalable Algorithms for Motif Conductance Based Graph Clustering Longlong Lin et.al. 2406.07357v1 null
2024-06-11 DR-RAG: Applying Dynamic Document Relevance to Retrieval-Augmented Generation for Question-Answering Zijian Hei et.al. 2406.07348v2 null
2024-06-11 Beyond Training: Optimizing Reinforcement Learning Based Job Shop Scheduling Through Adaptive Action Sampling Constantin Waubert de Puiseau et.al. 2406.07325v1 null
2024-06-11 The magic of entangled top quarks Chris D. White et.al. 2406.07321v2 null
2024-06-11 Rethinking the impact of noisy labels in graph classification: A utility and privacy perspective De Li et.al. 2406.07314v1 null
2024-06-11 BertaQA: How Much Do Language Models Know About Local Culture? Julen Etxaniz et.al. 2406.07302v1 link

Text and Image Generation

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis David Ortiz-Perez et.al. 2406.07542v1 link
2024-06-11 CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning Zeyuan Liu et.al. 2406.07541v1 null
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Autoregressive Pretraining with Mamba in Vision Sucheng Ren et.al. 2406.07537v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Dynamics of the non-radial energy-critical inhomogeneous NLS Carlos M. GuzmΓ‘n et.al. 2406.07535v1 null
2024-06-11 On the potential of probing the neutron star composition in accreting X-ray binaries Kaiser Arf et.al. 2406.07534v1 null
2024-06-11 Interpreting DESI 2024 BAO: late-time dynamical dark energy or a local effect? Ioannis D. Gialamas et.al. 2406.07533v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 Interacting-bath dynamical embedding for capturing non-local electron correlation in solids Jiachen Li et.al. 2406.07531v1 null
2024-06-11 Coherent Three-Photon Excitation of the Strontium Clock Transition Junyu He et.al. 2406.07530v1 null
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 QuickLLaMA: Query-aware Inference Acceleration for Large Language Models Jingyao Li et.al. 2406.07528v1 link
2024-06-11 Interpolating between Hausdorff and box dimension Amlan Banaji et.al. 2406.07527v1 null
2024-06-11 Cosmological constraints on $Ξ›_{\rm s}$CDM scenario in a type II minimally modified gravity Ozgur Akarsu et.al. 2406.07526v1 null
2024-06-11 Will Southeast Asia be the next global manufacturing hub? A multiway cointegration, causality, and dynamic connectedness analyses on factors influencing offshore decisions Haibo Wang et.al. 2406.07525v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Change of numeraire for weak martingale transport Mathias BeiglbΓΆck et.al. 2406.07523v1 null
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null

Alignment

Publish Date Title Authors PDF Code
2024-06-11 Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis David Ortiz-Perez et.al. 2406.07542v1 link
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502v1 link
2024-06-11 Multimodal Belief Prediction John Murzaku et.al. 2406.07466v1 null
2024-06-11 World Models with Hints of Large Language Models for Goal Achieving Zeyuan Liu et.al. 2406.07381v1 null
2024-06-11 Toxic Memes: A Survey of Computational Perspectives on the Detection and Explanation of Meme Toxicities Delfina Sol Martinez Pandiani et.al. 2406.07353v1 null
2024-06-11 Transferring Knowledge from Large Foundation Models to Small Downstream Models Shikai Qiu et.al. 2406.07337v1 null
2024-06-11 MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting Zhiqi Ai et.al. 2406.07310v1 null
2024-06-11 Which Country Is This? Automatic Country Ranking of Street View Photos Tim Menzner et.al. 2406.07227v1 null
2024-06-11 Open-World Human-Object Interaction Detection via Multi-modal Prompts Jie Yang et.al. 2406.07221v1 null
2024-06-11 Target Speech Diarization with Multimodal Prompts Yidi Jiang et.al. 2406.07198v1 null
2024-06-11 RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker Yunfeng Li et.al. 2406.07189v1 link
2024-06-11 Unified Modeling Enhanced Multimodal Learning for Precision Neuro-Oncology Huahui Yi et.al. 2406.07078v1 link
2024-06-11 Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study Yichi Zhang et.al. 2406.07057v1 null
2024-06-11 Evolving from Single-modal to Multi-modal Facial Deepfake Detection: A Survey Ping Liu et.al. 2406.06965v1 null
2024-06-11 Missingness-resilient Video-enhanced Multimodal Disfluency Detection Payal Mohapatra et.al. 2406.06964v1 null
2024-06-11 Eyeballing Combinatorial Problems: A Case Study of Using Multimodal Large Language Models to Solve Traveling Salesman Problems Mohammed Elhenawy et.al. 2406.06865v1 null
2024-06-10 FlexLoc: Conditional Neural Networks for Zero-Shot Sensor Perspective Invariance in Object Localization with Distributed Multimodal Sensors Jason Wu et.al. 2406.06796v1 link
2024-06-10 BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification June-Woo Kim et.al. 2406.06786v1 null
2024-06-10 MolX: Enhancing Large Language Models for Molecular Learning with A Multi-Modal Extension Khiem Le et.al. 2406.06777v1 null
2024-06-10 Merlin: A Vision Language Foundation Model for 3D Computed Tomography Louis Blankemeier et.al. 2406.06512v1 null
2024-06-10 AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction Zhen Xing et.al. 2406.06465v1 null
2024-06-10 VCR: Visual Caption Restoration Tianyu Zhang et.al. 2406.06462v1 link
2024-06-10 Margin-aware Preference Optimization for Aligning Diffusion Models without Reference Jiwoo Hong et.al. 2406.06424v1 null
2024-06-10 STimage-1K4M: A histopathology image-gene expression dataset for spatial transcriptomics Jiawen Chen et.al. 2406.06393v1 link
2024-06-10 Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization Yi Gu et.al. 2406.06382v1 link
2024-06-10 ASTRA: Aligning Speech and Text Representations for Asr without Sampling Neeraj Gaur et.al. 2406.06664v1 null
2024-06-10 MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing Yu-Fen Huang et.al. 2406.06375v1 link
2024-06-10 A Guide to Stochastic Optimisation for Large-Scale Inverse Problems Matthias J. Ehrhardt et.al. 2406.06342v1 null

Transformer

Transformer

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Change of numeraire for weak martingale transport Mathias BeiglbΓΆck et.al. 2406.07523v1 null
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization Frederic Kirstein et.al. 2406.07494v2 null
2024-06-11 ReduceFormer: Attention with Tensor Reduction by Summation John Yang et.al. 2406.07488v1 null
2024-06-11 Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction Bekir Z. Demiray et.al. 2406.07484v1 null
2024-06-11 Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis Qining Zhang et.al. 2406.07455v1 null
2024-06-11 Textual Similarity as a Key Metric in Machine Translation Quality Estimation Kun Sun et.al. 2406.07440v1 null
2024-06-11 DeformTime: Capturing Variable Dependencies with Deformable Attention for Time Series Forecasting Yuxuan Shu et.al. 2406.07438v1 null
2024-06-11 Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration Shashank Agnihotri et.al. 2406.07435v1 null
2024-06-11 Making 'syscall' a Privilege not a Right Fangfei Yang et.al. 2406.07429v1 null
2024-06-11 GemNet: Menu-Based, Strategy-Proof Multi-Bidder Auctions Through Deep Learning Tonghan Wang et.al. 2406.07428v1 null
2024-06-11 Entropy, slicing problem and functional Mahler's conjecture Matthieu Fradelizi et.al. 2406.07406v1 null
2024-06-11 Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy Xiaohan Huang et.al. 2406.07404v1 null
2024-06-11 A Survey on Recent Random Walk-based Methods for Embedding Knowledge Graphs Elika Bozorgi et.al. 2406.07402v1 null
2024-06-11 Fast and accurate evaluation of Biot-Savart integrals over spatial curves Juan Ignacio Polanco et.al. 2406.07366v1 null
2024-06-11 Chebyshev Approximated Variational Coupled Cluster for Quantum Computing Luca Erhart et.al. 2406.07364v1 null
2024-06-11 Deep Implicit Optimization for Robust and Flexible Image Registration Rohit Jena et.al. 2406.07361v1 null
2024-06-11 Global-Regularized Neighborhood Regression for Efficient Zero-Shot Texture Anomaly Detection Haiming Yao et.al. 2406.07333v1 null
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332v1 null
2024-06-11 Instruct Large Language Models to Drive like Humans Ruijun Zhang et.al. 2406.07296v1 link
2024-06-11 $\mathscr{D}$-modules on the basic affine space and large $\mathfrak{g}$-modules Masatoshi Kitagawa et.al. 2406.07279v1 null
2024-06-11 Are Protein Language Models Compute Optimal? Yaiza Serrano et.al. 2406.07249v1 null
2024-06-11 Dynamical Mean-Field Theory of Self-Attention Neural Networks Ángel Poc-López et.al. 2406.07247v1 null

Vision Transformer

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Autoregressive Pretraining with Mamba in Vision Sucheng Ren et.al. 2406.07537v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 Change of numeraire for weak martingale transport Mathias BeiglbΓΆck et.al. 2406.07523v1 null
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502v1 link
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 Trim 3D Gaussian Splatting for Accurate Geometry Representation Lue Fan et.al. 2406.07499v1 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization Frederic Kirstein et.al. 2406.07494v2 null
2024-06-11 ReduceFormer: Attention with Tensor Reduction by Summation John Yang et.al. 2406.07488v1 null
2024-06-11 GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection Hang Yao et.al. 2406.07487v1 null
2024-06-11 Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction Bekir Z. Demiray et.al. 2406.07484v1 null
2024-06-11 Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery Biplov Bhandari et.al. 2406.07482v1 null

Reinforcement Learning

Reinforcement Learning

Publish Date Title Authors PDF Code
2024-06-11 CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning Zeyuan Liu et.al. 2406.07541v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis Qining Zhang et.al. 2406.07455v1 null
2024-06-11 Enhanced Gene Selection in Single-Cell Genomics: Pre-Filtering Synergy and Reinforced Optimization Weiliang Zhang et.al. 2406.07418v1 null
2024-06-11 Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy Xiaohan Huang et.al. 2406.07404v1 null
2024-06-11 Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks Bjarke Madsen et.al. 2406.07383v1 null
2024-06-11 World Models with Hints of Large Language Models for Goal Achieving Zeyuan Liu et.al. 2406.07381v1 null
2024-06-11 EdgeTimer: Adaptive Multi-Timescale Scheduling in Mobile Edge Computing with Deep Reinforcement Learning Yijun Hao et.al. 2406.07342v1 null
2024-06-11 Beyond Training: Optimizing Reinforcement Learning Based Job Shop Scheduling Through Adaptive Action Sampling Constantin Waubert de Puiseau et.al. 2406.07325v1 null
2024-06-11 Multi-objective Reinforcement learning from AI Feedback Marcus Williams et.al. 2406.07295v2 null
2024-06-11 Hybrid Reinforcement Learning from Offline Observation Alone Yuda Song et.al. 2406.07253v1 null
2024-06-11 A generic and robust quantum agent inspired by deep meta-reinforcement learning Zibo Miao et.al. 2406.07225v1 null
2024-06-11 Semantic-Aware Spectrum Sharing in Internet of Vehicles Based on Deep Reinforcement Learning Zhiyu Shao et.al. 2406.07213v1 link
2024-06-11 Machine learning potential for the Cu-W system Manura Liyanage et.al. 2406.07157v1 null
2024-06-11 Failures Are Fated, But Can Be Faded: Characterizing and Mitigating Unwanted Behaviors in Large-Scale Vision and Language Models Som Sagar et.al. 2406.07145v1 null
2024-06-11 CHARME: A chain-based reinforcement learning approach for the minor embedding problem Hoang M. Ngo et.al. 2406.07124v1 null
2024-06-11 Augmenting Offline RL with Unlabeled Data Zhao Wang et.al. 2406.07117v1 null
2024-06-11 Optimal Gait Control for a Tendon-driven Soft Quadruped Robot by Model-based Reinforcement Learning Xuezhi Niu et.al. 2406.07069v1 null
2024-06-11 Integrating Domain Knowledge for handling Limited Data in Offline RL Briti Gangopadhyay et.al. 2406.07041v1 null
2024-06-11 Entropy-Reinforced Planning with Large Language Models for Drug Discovery Xuefeng Liu et.al. 2406.07025v1 null
2024-06-11 Delving into ChatGPT usage in academic writing through excess vocabulary Dmitry Kobak et.al. 2406.07016v1 null
2024-06-11 DNN Partitioning, Task Offloading, and Resource Allocation in Dynamic Vehicular Networks: A Lyapunov-Guided Diffusion-Based Reinforcement Learning Approach Zhang Liu et.al. 2406.06986v1 null
2024-06-11 Joint Demonstration and Preference Learning Improves Policy Alignment with Human Feedback Chenliang Li et.al. 2406.06874v1 null
2024-06-11 Sample Complexity Reduction via Policy Difference Estimation in Tabular Reinforcement Learning Adhyyan Narang et.al. 2406.06856v1 null
2024-06-10 Reinforced Compressive Neural Architecture Search for Versatile Adversarial Robustness Dingrong Wang et.al. 2406.06792v1 link
2024-06-10 Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach For Adaptive Brain Stimulation Michelle Pan et.al. 2406.06714v1 null
2024-06-10 Verification-Guided Shielding for Deep Reinforcement Learning Davide Corsi et.al. 2406.06507v1 null
2024-06-10 Adaptive Opponent Policy Detection in Multi-Agent MDPs: Real-Time Strategy Switch Identification Using Running Error Estimation Mohidul Haque Mridul et.al. 2406.06500v1 null
2024-06-10 Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity Calarina Muslimani et.al. 2406.06495v1 null
2024-06-10 Towards Real-World Efficiency: Domain Randomization in Reinforcement Learning for Pre-Capture of Free-Floating Moving Targets by Autonomous Robots Bahador Beigomi et.al. 2406.06460v1 link

Robotics

Robotics

Publish Date Title Authors PDF Code
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding Ming Hu et.al. 2406.07471v2 null
2024-06-11 Visual Representation Learning with Stochastic Frame Prediction Huiwon Jang et.al. 2406.07398v1 null
2024-06-11 Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks Bjarke Madsen et.al. 2406.07383v1 null
2024-06-11 Improving the realism of robotic surgery simulation through injection of learning-based estimated errors Juan Antonio Barragan et.al. 2406.07375v1 null
2024-06-11 iMESA: Incremental Distributed Optimization for Collaborative Simultaneous Localization and Mapping Daniel McGann et.al. 2406.07371v1 null
2024-06-11 Realistic Data Generation for 6D Pose Estimation of Surgical Instruments Juan Antonio Barragan et.al. 2406.07328v1 null
2024-06-11 Should XAI Nudge Human Decisions with Explanation Biasing? Yosuke Fukuchi et.al. 2406.07323v1 null
2024-06-11 Experimental Modeling of Chiral Active Robots and a Minimal Model of Non-Gaussian Displacements Yuxuan Zhou et.al. 2406.07313v1 null
2024-06-11 Instruct Large Language Models to Drive like Humans Ruijun Zhang et.al. 2406.07296v1 link
2024-06-11 OTO Planner: An Efficient Only Travelling Once Exploration Planner for Complex and Unknown Environments Bo Zhou et.al. 2406.07294v1 null
2024-06-11 3D Voxel Maps to 2D Occupancy Maps for Efficient Path Planning for Aerial and Ground Robots Scott Fredriksson et.al. 2406.07270v1 null
2024-06-11 Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph Sergey Linok et.al. 2406.07113v1 null
2024-06-11 A Neck Orthosis with Multi-Directional Variable Stiffness for Persons with Dropped Head Syndrome Santiago Price Torrendell et.al. 2406.07074v1 null
2024-06-11 Optimal Gait Control for a Tendon-driven Soft Quadruped Robot by Model-based Reinforcement Learning Xuezhi Niu et.al. 2406.07069v1 null
2024-06-11 Optimal Gait Design for a Soft Quadruped Robot via Multi-fidelity Bayesian Optimization Kaige Tan et.al. 2406.07065v1 null
2024-06-11 GPU-Accelerated Optimization-Based Collision Avoidance Zeming Wu et.al. 2406.07048v1 null
2024-06-11 Neural Visibility Field for Uncertainty-Driven Active Mapping Shangjie Xue et.al. 2406.06948v1 null
2024-06-11 CAAP: Context-Aware Action Planning Prompting to Solve Computer Tasks with Front-End UI Only Junhee Cho et.al. 2406.06947v1 link
2024-06-11 Person Transfer in the Field: Examining Real World Sequential Human-Robot Interaction Between Two Robots Xiang Zhi Tan et.al. 2406.06904v1 null
2024-06-11 Developing, Analyzing, and Evaluating Vehicular Lane Keeping Algorithms Under Dynamic Lighting and Weather Conditions Using Electric Vehicles Michael Khalfin et.al. 2406.06899v1 null
2024-06-11 Joint Demonstration and Preference Learning Improves Policy Alignment with Human Feedback Chenliang Li et.al. 2406.06874v1 null
2024-06-10 HO-Cap: A Capture System and Dataset for 3D Reconstruction and Pose Tracking of Hand-Object Interaction Jikai Wang et.al. 2406.06843v1 null
2024-06-10 FlexLoc: Conditional Neural Networks for Zero-Shot Sensor Perspective Invariance in Object Localization with Distributed Multimodal Sensors Jason Wu et.al. 2406.06796v1 link
2024-06-10 Starling Formation-Flying Optical Experiment: Initial Operations and Flight Results Justin Kruger et.al. 2406.06748v1 null
2024-06-10 Stochastic Guidance of Buoyancy Controlled Vehicles under Ice Shelves using Ocean Currents Federico Rossi et.al. 2406.06724v1 null
2024-06-10 Verification-Guided Shielding for Deep Reinforcement Learning Davide Corsi et.al. 2406.06507v1 null
2024-06-10 Demonstrating HumanTHOR: A Simulation Platform and Benchmark for Human-Robot Collaboration in a Shared Workspace Chenxu Wang et.al. 2406.06498v1 null

SFM

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis David Ortiz-Perez et.al. 2406.07542v1 link
2024-06-11 CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning Zeyuan Liu et.al. 2406.07541v1 null
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Interpreting DESI 2024 BAO: late-time dynamical dark energy or a local effect? Ioannis D. Gialamas et.al. 2406.07533v1 null
2024-06-11 Coherent Three-Photon Excitation of the Strontium Clock Transition Junyu He et.al. 2406.07530v1 null
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 Cosmological constraints on $Ξ›_{\rm s}$CDM scenario in a type II minimally modified gravity Ozgur Akarsu et.al. 2406.07526v1 null
2024-06-11 Will Southeast Asia be the next global manufacturing hub? A multiway cointegration, causality, and dynamic connectedness analyses on factors influencing offshore decisions Haibo Wang et.al. 2406.07525v1 null
2024-06-11 Change of numeraire for weak martingale transport Mathias BeiglbΓΆck et.al. 2406.07523v1 null
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Faster Spectral Density Estimation and Sparsification in the Nuclear Norm Yujia Jin et.al. 2406.07521v1 null
2024-06-11 Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics Reuben R. W. Wang et.al. 2406.07519v1 null
2024-06-11 The canonical trace of Cohen-Macaulay algebras of codimension 2 Antonino Ficarra et.al. 2406.07517v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven" D. T. Chung et.al. 2406.07512v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum N. -O. Stutzer et.al. 2406.07511v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null
2024-06-11 Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices Atli Sigurgeirsson et.al. 2406.07504v1 null
2024-06-11 Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid Naser Souri et.al. 2406.07503v1 null
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502v1 link
2024-06-11 Trim 3D Gaussian Splatting for Accurate Geometry Representation Lue Fan et.al. 2406.07499v1 null

SLAM

Publish Date Title Authors PDF Code
2024-06-10 Notes on Kalman Filter (KF, EKF, ESKF, IEKF, IESKF) Gyubeom Im et.al. 2406.06427v1 null
2024-06-10 Notes on Various Errors and Jacobian Derivations for SLAM Gyubeom Im et.al. 2406.06422v1 null
2024-06-10 Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation Shenghao Li et.al. 2406.06374v1 link
2024-06-10 Visual-Inertial SLAM as Simple as A, B, VINS Nathaniel Merrill et.al. 2406.05969v1 null
2024-06-09 MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps Jianhao Zheng et.al. 2406.05849v1 null
2024-06-06 Open Problem: Active Representation Learning Nikola Milosevic et.al. 2406.03845v1 null
2024-06-04 ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization Chen Mao et.al. 2406.01906v1 link
2024-06-03 Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry Takayuki Kanai et.al. 2406.00929v1 null
2024-06-02 Visual place recognition for aerial imagery: A survey Ivan Moskalenko et.al. 2406.00885v1 link
2024-05-30 Structure Gaussian SLAM with Manhattan World Hypothesis Shuhong Liu et.al. 2405.20031v1 null
2024-05-30 Semantic Landmark Detection & Classification Using Neural Networks For 3D In-Air Sonar Wouter Jansen et.al. 2405.19869v1 null
2024-05-30 SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization Jiang Wang et.al. 2405.19813v1 link
2024-05-30 TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM Peifeng Jiang et.al. 2405.19614v1 null
2024-05-27 CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy Richard Elvira et.al. 2405.16932v1 null
2024-05-26 Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians Erik SandstrΓΆm et.al. 2405.16544v1 link
2024-05-24 NeB-SLAM: Neural Blocks-based Salable RGB-D SLAM for Unknown Scenes Lizhi Bai et.al. 2405.15151v1 null
2024-05-23 ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization Han Song et.al. 2405.15082v2 null
2024-05-23 Synergistic Global-space Camera and Human Reconstruction from Videos Yizhou Zhao et.al. 2405.14855v1 null
2024-05-23 CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments Yang Zhou et.al. 2405.14731v1 link
2024-05-23 Efficient Robot Learning for Perception and Mapping Niclas VΓΆdisch et.al. 2405.14688v1 null
2024-05-22 Monocular Gaussian SLAM with Language Extended Loop Closure Tian Lan et.al. 2405.13748v1 null
2024-05-21 NV-LIO: LiDAR-Inertial Odometry using Normal Vectors Towards Robust SLAM in Multifloor Environments Dongha Chung et.al. 2405.12563v2 link
2024-05-18 Outlier-Robust Long-Term Robotic Mapping Leveraging Ground Segmentation Hyungtae Lim et.al. 2405.11176v3 null
2024-05-18 MotionGS : Compact Gaussian Splatting SLAM by Motion Filter Xinli Guo et.al. 2405.11129v2 link
2024-05-17 CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion Gang Wang et.al. 2405.10793v2 null
2024-05-17 Occupancy-SLAM: Simultaneously Optimizing Robot Poses and Continuous Occupancy Map Liang Zhao et.al. 2405.10743v1 null
2024-05-14 IPC: Incremental Probabilistic Consensus-based Consistent Set Maximization for SLAM Backends Emilio Olivastri et.al. 2405.08503v1 link
2024-05-13 OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition Qiuchi Xiang et.al. 2405.07966v1 link
2024-05-13 SceneFactory: A Workflow-centric and Unified Framework for Incremental Scene Modeling Yijun Yuan et.al. 2405.07847v1 null
2024-05-12 NGD-SLAM: Towards Real-Time SLAM for Dynamic Environments without GPU Yuhao Zhang et.al. 2405.07392v1 link

Visual Localization

Publish Date Title Authors PDF Code
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 Cinematic Gaussians: Real-Time HDR Radiance Fields with Depth of Field Chao Wang et.al. 2406.07329v1 null
2024-06-11 Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs Kamil Jeziorek et.al. 2406.07318v1 null
2024-06-11 Let Go of Your Labels with Unsupervised Transfer Artyom Gadetsky et.al. 2406.07236v1 link
2024-06-11 RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker Yunfeng Li et.al. 2406.07189v1 link
2024-06-11 Increased accuracy and signal-to-noise ratio through recent improvements in Infra-Red Video Bolometer fabrication and calibration Fabio Federici et.al. 2406.07139v1 null
2024-06-11 PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving Yining Shi et.al. 2406.07037v1 null
2024-06-11 MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results Xin Jin et.al. 2406.07006v1 null
2024-06-11 Generative Lifting of Multiview to 3D from Unknown Pose: Wrapping NeRF inside Diffusion Xin Yuan et.al. 2406.06972v1 null
2024-06-11 Neural Visibility Field for Uncertainty-Driven Active Mapping Shangjie Xue et.al. 2406.06948v1 null
2024-06-11 High-velocity blue-shifted Fe XXV He$Ξ±$ line during a superflare of the RS CVn-type star IM Peg Shun Inoue et.al. 2406.06940v1 null
2024-06-10 The PAU Survey: Photometric Calibration of Narrow Band Images F. J. Castander et.al. 2406.06850v1 null
2024-06-10 HO-Cap: A Capture System and Dataset for 3D Reconstruction and Pose Tracking of Hand-Object Interaction Jikai Wang et.al. 2406.06843v1 null
2024-06-10 Starling Formation-Flying Optical Experiment: Initial Operations and Flight Results Justin Kruger et.al. 2406.06748v1 null
2024-06-10 PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction Danpeng Chen et.al. 2406.06521v1 null
2024-06-10 SYM3D: Learning Symmetric Triplanes for Better 3D-Awareness of GANs Jing Yang et.al. 2406.06432v1 null
2024-06-10 Notes on Various Errors and Jacobian Derivations for SLAM Gyubeom Im et.al. 2406.06422v1 null
2024-06-10 Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation Shenghao Li et.al. 2406.06374v1 link
2024-06-10 Relativistic and wide-angle corrections to galaxy power spectra Sheean Jolicoeur et.al. 2406.06274v1 null
2024-06-10 DualAD: Disentangling the Dynamic and Static World for End-to-End Driving Simon Doll et.al. 2406.06264v1 null
2024-06-10 Vript: A Video Is Worth Thousands of Words Dongjie Yang et.al. 2406.06040v1 link
2024-06-09 SlowPerception: Physical-World Latency Attack against Visual Perception in Autonomous Driving Chen Ma et.al. 2406.05800v1 null
2024-06-09 Region of Interest Loss for Anonymizing Learned Image Compression Christoph Liebender et.al. 2406.05726v1 null
2024-06-09 Enhancing the light yield of He:CF$_4$ based gaseous detector F. D. Amaro et.al. 2406.05713v1 null
2024-06-09 MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation Yan Ma et.al. 2406.05690v1 link
2024-06-08 The PLATO Mission Heike Rauer et.al. 2406.05447v1 null
2024-06-08 MotionClone: Training-Free Motion Cloning for Controllable Video Generation Pengyang Ling et.al. 2406.05338v2 null
2024-06-07 Lessons from the Cruise Robotaxi Pedestrian Dragging Mishap Philip Koopman et.al. 2406.05281v1 null
2024-06-07 A Tensor Decomposition Perspective on Second-order RNNs Maude Lizaire et.al. 2406.05045v1 link

Contrastive Learning

Contrastive Learning

Publish Date Title Authors PDF Code
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis David Ortiz-Perez et.al. 2406.07542v1 link
2024-06-11 CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning Zeyuan Liu et.al. 2406.07541v1 null
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 Coherent Three-Photon Excitation of the Strontium Clock Transition Junyu He et.al. 2406.07530v1 null
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 QuickLLaMA: Query-aware Inference Acceleration for Large Language Models Jingyao Li et.al. 2406.07528v1 link
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Change of numeraire for weak martingale transport Mathias BeiglbΓΆck et.al. 2406.07523v1 null
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Faster Spectral Density Estimation and Sparsification in the Nuclear Norm Yujia Jin et.al. 2406.07521v1 null
2024-06-11 Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics Reuben R. W. Wang et.al. 2406.07519v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid Naser Souri et.al. 2406.07503v1 null
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale William Giarè et.al. 2406.07493v1 null
2024-06-11 Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction Bekir Z. Demiray et.al. 2406.07484v1 null
2024-06-11 Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery Biplov Bhandari et.al. 2406.07482v1 null

Medical Application

Medical Application

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis David Ortiz-Perez et.al. 2406.07542v1 link
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 Coherent Three-Photon Excitation of the Strontium Clock Transition Junyu He et.al. 2406.07530v1 null
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 QuickLLaMA: Query-aware Inference Acceleration for Large Language Models Jingyao Li et.al. 2406.07528v1 link
2024-06-11 Interpolating between Hausdorff and box dimension Amlan Banaji et.al. 2406.07527v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum N. -O. Stutzer et.al. 2406.07511v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results I. Improved data selection and processing J. G. S. Lunde et.al. 2406.07510v1 null
2024-06-11 $(J/ψ, J/ψ)$, and $(η_c, η_c)$ production through two intermediate photons in electron-positron annihilation at B-factories Shashank Bhatnagar et.al. 2406.07508v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502v1 link
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 Trim 3D Gaussian Splatting for Accurate Geometry Representation Lue Fan et.al. 2406.07499v1 null
2024-06-11 RaD-Net 2: A causal two-stage repairing and denoising speech enhancement network with knowledge distillation and complex axial self-attention Mingshuai Liu et.al. 2406.07498v1 null
2024-06-11 A pilot protocol and cohort for the investigation of non-pathological variability in speech Nicholas Cummins et.al. 2406.07497v1 null
2024-06-11 CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization Frederic Kirstein et.al. 2406.07494v2 null
2024-06-11 GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection Hang Yao et.al. 2406.07487v1 null
2024-06-11 Novel Optimized Designs of Modulo $2n+1$ Adder for Quantum Computing Bhaskar Gaur et.al. 2406.07486v1 null

Medical Image Analysis

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis David Ortiz-Perez et.al. 2406.07542v1 link
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 Coherent Three-Photon Excitation of the Strontium Clock Transition Junyu He et.al. 2406.07530v1 null
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 QuickLLaMA: Query-aware Inference Acceleration for Large Language Models Jingyao Li et.al. 2406.07528v1 link
2024-06-11 Interpolating between Hausdorff and box dimension Amlan Banaji et.al. 2406.07527v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum N. -O. Stutzer et.al. 2406.07511v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results I. Improved data selection and processing J. G. S. Lunde et.al. 2406.07510v1 null
2024-06-11 $(J/ψ, J/ψ)$, and $(η_c, η_c)$ production through two intermediate photons in electron-positron annihilation at B-factories Shashank Bhatnagar et.al. 2406.07508v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502v1 link
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 Trim 3D Gaussian Splatting for Accurate Geometry Representation Lue Fan et.al. 2406.07499v1 null
2024-06-11 RaD-Net 2: A causal two-stage repairing and denoising speech enhancement network with knowledge distillation and complex axial self-attention Mingshuai Liu et.al. 2406.07498v1 null
2024-06-11 A pilot protocol and cohort for the investigation of non-pathological variability in speech Nicholas Cummins et.al. 2406.07497v1 null
2024-06-11 CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization Frederic Kirstein et.al. 2406.07494v2 null
2024-06-11 GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection Hang Yao et.al. 2406.07487v1 null
2024-06-11 Novel Optimized Designs of Modulo $2n+1$ Adder for Quantum Computing Bhaskar Gaur et.al. 2406.07486v1 null

Medical Multi-modal

Publish Date Title Authors PDF Code
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis David Ortiz-Perez et.al. 2406.07542v1 link
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Will Southeast Asia be the next global manufacturing hub? A multiway cointegration, causality, and dynamic connectedness analyses on factors influencing offshore decisions Haibo Wang et.al. 2406.07525v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502v1 link
2024-06-11 CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization Frederic Kirstein et.al. 2406.07494v2 null
2024-06-11 Choreographing the Rhythms of Observation: Dynamics for Ranged Observer Bipartite-Unipartite SpatioTemporal (ROBUST) Networks Ted Edward Holmberg et.al. 2406.07473v1 null
2024-06-11 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models Heng Yu et.al. 2406.07472v1 null
2024-06-11 Multimodal Belief Prediction John Murzaku et.al. 2406.07466v1 null
2024-06-11 Resummation of Multi-Stress Tensors in Higher Dimensions Kuo-Wei Huang et.al. 2406.07458v1 null
2024-06-11 An Optimism-based Approach to Online Evaluation of Generative Models Xiaoyan Hu et.al. 2406.07451v1 null
2024-06-11 Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning Shuvendu Roy et.al. 2406.07450v1 link
2024-06-11 Graph-based multi-Feature fusion method for speech emotion recognition Xueyu Liu et.al. 2406.07437v1 null
2024-06-11 Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments Christopher D. Hsu et.al. 2406.07431v1 null
2024-06-11 GemNet: Menu-Based, Strategy-Proof Multi-Bidder Auctions Through Deep Learning Tonghan Wang et.al. 2406.07428v1 null
2024-06-11 DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 38 Subclasses Abdurrahim Yilmaz et.al. 2406.07426v1 null
2024-06-11 Single-Codec: Single-Codebook Speech Codec towards High-Performance Speech Generation Hanzhao Li et.al. 2406.07422v1 null
2024-06-11 Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks Bjarke Madsen et.al. 2406.07383v1 null
2024-06-11 World Models with Hints of Large Language Models for Goal Achieving Zeyuan Liu et.al. 2406.07381v1 null
2024-06-11 Improving the realism of robotic surgery simulation through injection of learning-based estimated errors Juan Antonio Barragan et.al. 2406.07375v1 null
2024-06-11 iMESA: Incremental Distributed Optimization for Collaborative Simultaneous Localization and Mapping Daniel McGann et.al. 2406.07371v1 null
2024-06-11 BvSP: Broad-view Soft Prompting for Few-Shot Aspect Sentiment Quad Prediction Yinhao Bai et.al. 2406.07365v1 link
2024-06-11 AI.vs.Clinician: Unveiling Intricate Interactions Between AI and Clinicians through an Open-Access Database Wanling Gao et.al. 2406.07362v1 null
2024-06-11 Deep Implicit Optimization for Robust and Flexible Image Registration Rohit Jena et.al. 2406.07361v1 null
2024-06-11 GLIMPSE: Pragmatically Informative Multi-Document Summarization for Scholarly Reviews Maxime Darrin et.al. 2406.07359v1 null
2024-06-11 Toxic Memes: A Survey of Computational Perspectives on the Detection and Explanation of Meme Toxicities Delfina Sol Martinez Pandiani et.al. 2406.07353v1 null
2024-06-11 DR-RAG: Applying Dynamic Document Relevance to Retrieval-Augmented Generation for Question-Answering Zijian Hei et.al. 2406.07348v2 null
2024-06-11 Few-Body Quantum Chaos, Localization, and Multi-Photon Entanglement in Optical Synthetic Frequency Dimension Junlin Wang et.al. 2406.07346v1 null

Graph Neural Network

Graph Neural Network

Publish Date Title Authors PDF Code
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 Faster Spectral Density Estimation and Sparsification in the Nuclear Norm Yujia Jin et.al. 2406.07521v1 null
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null
2024-06-11 RaD-Net 2: A causal two-stage repairing and denoising speech enhancement network with knowledge distillation and complex axial self-attention Mingshuai Liu et.al. 2406.07498v1 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization Frederic Kirstein et.al. 2406.07494v2 null
2024-06-11 Image Neural Field Diffusion Models Yinbo Chen et.al. 2406.07480v1 null
2024-06-11 Lower bounds for sphere packing in arbitrary norms Carl Schildkraut et.al. 2406.07479v1 null
2024-06-11 Choreographing the Rhythms of Observation: Dynamics for Ranged Observer Bipartite-Unipartite SpatioTemporal (ROBUST) Networks Ted Edward Holmberg et.al. 2406.07473v1 null
2024-06-11 Microbiomes Through The Looking Glass Jacopo Pasqualini et.al. 2406.07465v1 null
2024-06-11 Reconfigurable Intelligent Surfaces in Dynamic Rich Scattering Environments: BiLSTM-Based Optimization for Accurate User Localization Anum Umer et.al. 2406.07463v1 null
2024-06-11 fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions Alireza Afzal Aghaei et.al. 2406.07456v1 link
2024-06-11 HTVM: Efficient Neural Network Deployment On Heterogeneous TinyML Platforms Josse Van Delm et.al. 2406.07453v1 null
2024-06-11 Boosted Conformal Prediction Intervals Ran Xie et.al. 2406.07449v1 null
2024-06-11 Metastability in networks of nonlinear stochastic integrate-and-fire neurons Siddharth Paliwal et.al. 2406.07445v1 null
2024-06-11 DeformTime: Capturing Variable Dependencies with Deformable Attention for Time Series Forecasting Yuxuan Shu et.al. 2406.07438v1 null
2024-06-11 Graph-based multi-Feature fusion method for speech emotion recognition Xueyu Liu et.al. 2406.07437v1 null
2024-06-11 Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration Shashank Agnihotri et.al. 2406.07435v1 null
2024-06-11 Matryoshka Representation Learning for Recommendation Riwei Lai et.al. 2406.07432v1 link
2024-06-11 Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments Christopher D. Hsu et.al. 2406.07431v1 null
2024-06-11 GemNet: Menu-Based, Strategy-Proof Multi-Bidder Auctions Through Deep Learning Tonghan Wang et.al. 2406.07428v1 null
2024-06-11 Graph Reasoning for Explainable Cold Start Recommendation Jibril Frej et.al. 2406.07420v1 null
2024-06-11 Average-exact mixed anomalies and compatible phases Yichen Xu et.al. 2406.07417v1 null
2024-06-11 Heat operators and isometry groups of Cuntz-Krieger algebras Dimitris Michail Gerontogiannis et.al. 2406.07416v1 null
2024-06-11 Holistic Memory Diversification for Incremental Learning in Growing Graphs Ziyue Qiao et.al. 2406.07413v1 null
2024-06-11 Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy Xiaohan Huang et.al. 2406.07404v1 null
2024-06-11 A Survey on Recent Random Walk-based Methods for Embedding Knowledge Graphs Elika Bozorgi et.al. 2406.07402v1 null
2024-06-11 Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance Ruxin Zheng et.al. 2406.07399v1 null

Large-Language Model

Large-Language Model

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis David Ortiz-Perez et.al. 2406.07542v1 link
2024-06-11 CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning Zeyuan Liu et.al. 2406.07541v1 null
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Autoregressive Pretraining with Mamba in Vision Sucheng Ren et.al. 2406.07537v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Interpreting DESI 2024 BAO: late-time dynamical dark energy or a local effect? Ioannis D. Gialamas et.al. 2406.07533v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 QuickLLaMA: Query-aware Inference Acceleration for Large Language Models Jingyao Li et.al. 2406.07528v1 link
2024-06-11 Cosmological constraints on $Ξ›_{\rm s}$CDM scenario in a type II minimally modified gravity Ozgur Akarsu et.al. 2406.07526v1 null
2024-06-11 Will Southeast Asia be the next global manufacturing hub? A multiway cointegration, causality, and dynamic connectedness analyses on factors influencing offshore decisions Haibo Wang et.al. 2406.07525v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics Reuben R. W. Wang et.al. 2406.07519v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven" D. T. Chung et.al. 2406.07512v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results I. Improved data selection and processing J. G. S. Lunde et.al. 2406.07510v1 null
2024-06-11 Uniqueness on average of large isoperimetric sets in noncompact manifolds with nonnegative Ricci curvature Gioacchino Antonelli et.al. 2406.07509v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null

Edge Computing

Privacy

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 Autoregressive Pretraining with Mamba in Vision Sucheng Ren et.al. 2406.07537v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 Interacting-bath dynamical embedding for capturing non-local electron correlation in solids Jiachen Li et.al. 2406.07531v1 null
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Faster Spectral Density Estimation and Sparsification in the Nuclear Norm Yujia Jin et.al. 2406.07521v1 null
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report KBTG Labs et.al. 2406.07505v1 null
2024-06-11 Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices Atli Sigurgeirsson et.al. 2406.07504v1 null
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502v1 link
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 Trim 3D Gaussian Splatting for Accurate Geometry Representation Lue Fan et.al. 2406.07499v1 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization Frederic Kirstein et.al. 2406.07494v2 null
2024-06-11 Paraphrasing in Affirmative Terms Improves Negation Understanding MohammadHossein Rezaei et.al. 2406.07492v1 null

Efficient

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning Zeyuan Liu et.al. 2406.07541v1 null
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Autoregressive Pretraining with Mamba in Vision Sucheng Ren et.al. 2406.07537v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 Interacting-bath dynamical embedding for capturing non-local electron correlation in solids Jiachen Li et.al. 2406.07531v1 null
2024-06-11 Coherent Three-Photon Excitation of the Strontium Clock Transition Junyu He et.al. 2406.07530v1 null
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Faster Spectral Density Estimation and Sparsification in the Nuclear Norm Yujia Jin et.al. 2406.07521v1 null
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System SBND Collaboration et.al. 2406.07514v1 null
2024-06-11 Accurate Current Sharing in a DC Microgrid Using Modified Droop Control Algorithm Naser Souri et.al. 2406.07513v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report KBTG Labs et.al. 2406.07505v1 null
2024-06-11 Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices Atli Sigurgeirsson et.al. 2406.07504v1 null

Scalability

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 Autoregressive Pretraining with Mamba in Vision Sucheng Ren et.al. 2406.07537v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 Interacting-bath dynamical embedding for capturing non-local electron correlation in solids Jiachen Li et.al. 2406.07531v1 null
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Faster Spectral Density Estimation and Sparsification in the Nuclear Norm Yujia Jin et.al. 2406.07521v1 null
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report KBTG Labs et.al. 2406.07505v1 null
2024-06-11 Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices Atli Sigurgeirsson et.al. 2406.07504v1 null
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502v1 link
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 Trim 3D Gaussian Splatting for Accurate Geometry Representation Lue Fan et.al. 2406.07499v1 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization Frederic Kirstein et.al. 2406.07494v2 null
2024-06-11 Paraphrasing in Affirmative Terms Improves Negation Understanding MohammadHossein Rezaei et.al. 2406.07492v1 null

Performance

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning Zeyuan Liu et.al. 2406.07541v1 null
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Autoregressive Pretraining with Mamba in Vision Sucheng Ren et.al. 2406.07537v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Interpreting DESI 2024 BAO: late-time dynamical dark energy or a local effect? Ioannis D. Gialamas et.al. 2406.07533v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 Interacting-bath dynamical embedding for capturing non-local electron correlation in solids Jiachen Li et.al. 2406.07531v1 null
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 Cosmological constraints on $Ξ›_{\rm s}$CDM scenario in a type II minimally modified gravity Ozgur Akarsu et.al. 2406.07526v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Faster Spectral Density Estimation and Sparsification in the Nuclear Norm Yujia Jin et.al. 2406.07521v1 null
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System SBND Collaboration et.al. 2406.07514v1 null
2024-06-11 Accurate Current Sharing in a DC Microgrid Using Modified Droop Control Algorithm Naser Souri et.al. 2406.07513v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum N. -O. Stutzer et.al. 2406.07511v1 null
2024-06-11 COMAP Pathfinder -- Season 2 results I. Improved data selection and processing J. G. S. Lunde et.al. 2406.07510v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null

Reliability

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 Autoregressive Pretraining with Mamba in Vision Sucheng Ren et.al. 2406.07537v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 Interacting-bath dynamical embedding for capturing non-local electron correlation in solids Jiachen Li et.al. 2406.07531v1 null
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Faster Spectral Density Estimation and Sparsification in the Nuclear Norm Yujia Jin et.al. 2406.07521v1 null
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report KBTG Labs et.al. 2406.07505v1 null
2024-06-11 Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices Atli Sigurgeirsson et.al. 2406.07504v1 null
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502v1 link
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 Trim 3D Gaussian Splatting for Accurate Geometry Representation Lue Fan et.al. 2406.07499v1 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization Frederic Kirstein et.al. 2406.07494v2 null
2024-06-11 Paraphrasing in Affirmative Terms Improves Negation Understanding MohammadHossein Rezaei et.al. 2406.07492v1 null

Trust

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 Autoregressive Pretraining with Mamba in Vision Sucheng Ren et.al. 2406.07537v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 Interacting-bath dynamical embedding for capturing non-local electron correlation in solids Jiachen Li et.al. 2406.07531v1 null
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Faster Spectral Density Estimation and Sparsification in the Nuclear Norm Yujia Jin et.al. 2406.07521v1 null
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report KBTG Labs et.al. 2406.07505v1 null
2024-06-11 Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices Atli Sigurgeirsson et.al. 2406.07504v1 null
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502v1 link
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 Trim 3D Gaussian Splatting for Accurate Geometry Representation Lue Fan et.al. 2406.07499v1 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization Frederic Kirstein et.al. 2406.07494v2 null
2024-06-11 Paraphrasing in Affirmative Terms Improves Negation Understanding MohammadHossein Rezaei et.al. 2406.07492v1 null

Secure

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 Autoregressive Pretraining with Mamba in Vision Sucheng Ren et.al. 2406.07537v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 Interacting-bath dynamical embedding for capturing non-local electron correlation in solids Jiachen Li et.al. 2406.07531v1 null
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Faster Spectral Density Estimation and Sparsification in the Nuclear Norm Yujia Jin et.al. 2406.07521v1 null
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-11 THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report KBTG Labs et.al. 2406.07505v1 null
2024-06-11 Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices Atli Sigurgeirsson et.al. 2406.07504v1 null
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502v1 link
2024-06-11 SPIN: Spacecraft Imagery for Navigation Javier Montalvo et.al. 2406.07500v2 null
2024-06-11 Trim 3D Gaussian Splatting for Accurate Geometry Representation Lue Fan et.al. 2406.07499v1 null
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496v1 link
2024-06-11 CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization Frederic Kirstein et.al. 2406.07494v2 null
2024-06-11 Paraphrasing in Affirmative Terms Improves Negation Understanding MohammadHossein Rezaei et.al. 2406.07492v1 null

Edge Computing

Publish Date Title Authors PDF Code
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546v1 null
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545v1 link
2024-06-11 Situational Awareness Matters in 3D Vision Language Reasoning Yunze Man et.al. 2406.07544v1 null
2024-06-11 Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning Chenyu Yang et.al. 2406.07543v1 link
2024-06-11 Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis David Ortiz-Perez et.al. 2406.07542v1 link
2024-06-11 CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning Zeyuan Liu et.al. 2406.07541v1 null
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Autoregressive Pretraining with Mamba in Vision Sucheng Ren et.al. 2406.07537v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Dynamics of the non-radial energy-critical inhomogeneous NLS Carlos M. GuzmΓ‘n et.al. 2406.07535v1 null
2024-06-11 On the potential of probing the neutron star composition in accreting X-ray binaries Kaiser Arf et.al. 2406.07534v1 null
2024-06-11 Interpreting DESI 2024 BAO: late-time dynamical dark energy or a local effect? Ioannis D. Gialamas et.al. 2406.07533v1 null
2024-06-11 Hearing Anything Anywhere Mason Wang et.al. 2406.07532v1 link
2024-06-11 Interacting-bath dynamical embedding for capturing non-local electron correlation in solids Jiachen Li et.al. 2406.07531v1 null
2024-06-11 Coherent Three-Photon Excitation of the Strontium Clock Transition Junyu He et.al. 2406.07530v1 null
2024-06-11 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation Lu Li et.al. 2406.07529v1 null
2024-06-11 QuickLLaMA: Query-aware Inference Acceleration for Large Language Models Jingyao Li et.al. 2406.07528v1 link
2024-06-11 Cosmological constraints on $Ξ›_{\rm s}$CDM scenario in a type II minimally modified gravity Ozgur Akarsu et.al. 2406.07526v1 null
2024-06-11 Will Southeast Asia be the next global manufacturing hub? A multiway cointegration, causality, and dynamic connectedness analyses on factors influencing offshore decisions Haibo Wang et.al. 2406.07525v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522v1 null
2024-06-11 Faster Spectral Density Estimation and Sparsification in the Nuclear Norm Yujia Jin et.al. 2406.07521v1 null
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null