Automated deployment @ 2024-06-13 09:04:46 Asia/Shanghai
Welcome to contribute! Add your topics and keywords in
topic.yml
. You can also view historical data through the storage.
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds | Gabriella Tarantello et.al. | 2406.07518v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven" | D. T. Chung et.al. | 2406.07512v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | A pilot protocol and cohort for the investigation of non-pathological variability in speech | Nicholas Cummins et.al. | 2406.07497v1 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | Partially Observed Trajectory Inference using Optimal Transport and a Dynamics Prior | Anming Gu et.al. | 2406.07475v1 | null |
2024-06-11 | 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models | Heng Yu et.al. | 2406.07472v1 | null |
2024-06-11 | Exploring non-radial oscillation modes in dark matter admixed neutron stars | Pratik Thakur et.al. | 2406.07470v1 | null |
2024-06-11 | Reserve Provision from Electric Vehicles: Aggregate Boundaries and Stochastic Model Predictive Control | Jacob ThrΓ€n et.al. | 2406.07454v1 | null |
2024-06-11 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431v1 | null |
2024-06-11 | Single and merger soliton dynamics in scalar field dark matter with and without self-interactions | Matthias Stallovits et.al. | 2406.07419v1 | null |
2024-06-11 | Visual Representation Learning with Stochastic Frame Prediction | Huiwon Jang et.al. | 2406.07398v1 | null |
2024-06-11 | Operad of posets 101: The Wixarika posets | JosΓ© Antonio Arciniega-NevΓ‘rez et.al. | 2406.07370v1 | null |
2024-06-11 | Fast and accurate evaluation of Biot-Savart integrals over spatial curves | Juan Ignacio Polanco et.al. | 2406.07366v1 | null |
2024-06-11 | Deep Implicit Optimization for Robust and Flexible Image Registration | Rohit Jena et.al. | 2406.07361v1 | null |
2024-06-11 | Machine Learning approaches to classical density functional theory | Alessandro Simon et.al. | 2406.07345v1 | null |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332v1 | null |
2024-06-11 | The representation and computational efficiency of the Tolman-Oppenheimer-Volkoff equations in isotropic coordinates | DΓ‘niel Barta et.al. | 2406.07319v1 | null |
2024-06-11 | A directional total variation minimization algorithm for isotropic resolution in digital breast tomosynthesis | Emil Y. Sidky et.al. | 2406.07306v1 | null |
2024-06-11 | Shadows, rings and optical appearance of a magnetically charged regular black hole illuminated by various accretion disks | Soroush Zare et.al. | 2406.07300v1 | null |
2024-06-11 | Multi-objective Reinforcement learning from AI Feedback | Marcus Williams et.al. | 2406.07295v2 | null |
2024-06-11 | Joint Learning of Context and Feedback Embeddings in Spoken Dialogue | Livia Qian et.al. | 2406.07291v1 | null |
2024-06-11 | Unsupervised Object Detection with Theoretical Guarantees | Marian Longa et.al. | 2406.07284v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Dynamics of the non-radial energy-critical inhomogeneous NLS | Carlos M. GuzmΓ‘n et.al. | 2406.07535v1 | null |
2024-06-11 | Interpolating between Hausdorff and box dimension | Amlan Banaji et.al. | 2406.07527v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502v1 | link |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499v1 | null |
2024-06-11 | GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection | Hang Yao et.al. | 2406.07487v1 | null |
2024-06-11 | Novel Optimized Designs of Modulo |
Bhaskar Gaur et.al. | 2406.07486v1 | null |
2024-06-11 | Image Neural Field Diffusion Models | Yinbo Chen et.al. | 2406.07480v1 | null |
2024-06-11 | fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions | Alireza Afzal Aghaei et.al. | 2406.07456v1 | link |
2024-06-11 | An Optimism-based Approach to Online Evaluation of Generative Models | Xiaoyan Hu et.al. | 2406.07451v1 | null |
2024-06-11 | Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning | Shuvendu Roy et.al. | 2406.07450v1 | link |
2024-06-11 | Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration | Shashank Agnihotri et.al. | 2406.07435v1 | null |
2024-06-11 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431v1 | null |
2024-06-11 | Learning Domain-Invariant Features for Out-of-Context News Detection | Yimeng Gu et.al. | 2406.07430v1 | null |
2024-06-11 | DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 38 Subclasses | Abdurrahim Yilmaz et.al. | 2406.07426v1 | null |
2024-06-11 | MINERS: Multilingual Language Models as Semantic Retrievers | Genta Indra Winata et.al. | 2406.07424v1 | null |
2024-06-11 | Holistic Memory Diversification for Incremental Learning in Growing Graphs | Ziyue Qiao et.al. | 2406.07413v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Autoregressive Pretraining with Mamba in Vision | Sucheng Ren et.al. | 2406.07537v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds | Gabriella Tarantello et.al. | 2406.07518v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven" | D. T. Chung et.al. | 2406.07512v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502v1 | link |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | A pilot protocol and cohort for the investigation of non-pathological variability in speech | Nicholas Cummins et.al. | 2406.07497v1 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | Partially Observed Trajectory Inference using Optimal Transport and a Dynamics Prior | Anming Gu et.al. | 2406.07475v1 | null |
2024-06-11 | 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models | Heng Yu et.al. | 2406.07472v1 | null |
2024-06-11 | Exploring non-radial oscillation modes in dark matter admixed neutron stars | Pratik Thakur et.al. | 2406.07470v1 | null |
2024-06-11 | Reserve Provision from Electric Vehicles: Aggregate Boundaries and Stochastic Model Predictive Control | Jacob ThrΓ€n et.al. | 2406.07454v1 | null |
2024-06-11 | Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning | Shuvendu Roy et.al. | 2406.07450v1 | link |
2024-06-11 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431v1 | null |
2024-06-11 | Single and merger soliton dynamics in scalar field dark matter with and without self-interactions | Matthias Stallovits et.al. | 2406.07419v1 | null |
2024-06-11 | Enhanced Gene Selection in Single-Cell Genomics: Pre-Filtering Synergy and Reinforced Optimization | Weiliang Zhang et.al. | 2406.07418v1 | null |
2024-06-11 | Visual Representation Learning with Stochastic Frame Prediction | Huiwon Jang et.al. | 2406.07398v1 | null |
2024-06-11 | DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling | Sixian Wang et.al. | 2406.07390v1 | null |
2024-06-11 | Operad of posets 101: The Wixarika posets | JosΓ© Antonio Arciniega-NevΓ‘rez et.al. | 2406.07370v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds | Gabriella Tarantello et.al. | 2406.07518v1 | null |
2024-06-11 | Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System | SBND Collaboration et.al. | 2406.07514v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum | N. -O. Stutzer et.al. | 2406.07511v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | A pilot protocol and cohort for the investigation of non-pathological variability in speech | Nicholas Cummins et.al. | 2406.07497v1 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | Neutrino magnetic dipole portal with low energy neutrino nucleus scattering data | Ying-Ying Li et.al. | 2406.07477v1 | null |
2024-06-11 | 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models | Heng Yu et.al. | 2406.07472v1 | null |
2024-06-11 | Exploring non-radial oscillation modes in dark matter admixed neutron stars | Pratik Thakur et.al. | 2406.07470v1 | null |
2024-06-11 | Reserve Provision from Electric Vehicles: Aggregate Boundaries and Stochastic Model Predictive Control | Jacob ThrΓ€n et.al. | 2406.07454v1 | null |
2024-06-11 | Search for photons above 10$^{18}$ eV by simultaneously measuring the atmospheric depth and the muon content of air showers at the Pierre Auger Observatory | The Pierre Auger Collaboration et.al. | 2406.07439v1 | null |
2024-06-11 | Single and merger soliton dynamics in scalar field dark matter with and without self-interactions | Matthias Stallovits et.al. | 2406.07419v1 | null |
2024-06-11 | Visual Representation Learning with Stochastic Frame Prediction | Huiwon Jang et.al. | 2406.07398v1 | null |
2024-06-11 | Operad of posets 101: The Wixarika posets | JosΓ© Antonio Arciniega-NevΓ‘rez et.al. | 2406.07370v1 | null |
2024-06-11 | Fast and accurate evaluation of Biot-Savart integrals over spatial curves | Juan Ignacio Polanco et.al. | 2406.07366v1 | null |
2024-06-11 | Deep Implicit Optimization for Robust and Flexible Image Registration | Rohit Jena et.al. | 2406.07361v1 | null |
2024-06-11 | Machine Learning approaches to classical density functional theory | Alessandro Simon et.al. | 2406.07345v1 | null |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332v1 | null |
2024-06-11 | The representation and computational efficiency of the Tolman-Oppenheimer-Volkoff equations in isotropic coordinates | DΓ‘niel Barta et.al. | 2406.07319v1 | null |
2024-06-11 | A directional total variation minimization algorithm for isotropic resolution in digital breast tomosynthesis | Emil Y. Sidky et.al. | 2406.07306v1 | null |
2024-06-11 | Shadows, rings and optical appearance of a magnetically charged regular black hole illuminated by various accretion disks | Soroush Zare et.al. | 2406.07300v1 | null |
2024-06-11 | Multi-objective Reinforcement learning from AI Feedback | Marcus Williams et.al. | 2406.07295v2 | null |
2024-06-11 | Joint Learning of Context and Feedback Embeddings in Spoken Dialogue | Livia Qian et.al. | 2406.07291v1 | null |
2024-06-11 | Unsupervised Object Detection with Theoretical Guarantees | Marian Longa et.al. | 2406.07284v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Interpolating between Hausdorff and box dimension | Amlan Banaji et.al. | 2406.07527v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven" | D. T. Chung et.al. | 2406.07512v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502v1 | link |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499v1 | null |
2024-06-11 | GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection | Hang Yao et.al. | 2406.07487v1 | null |
2024-06-11 | Novel Optimized Designs of Modulo |
Bhaskar Gaur et.al. | 2406.07486v1 | null |
2024-06-11 | Advancing Annotation of Stance in Social Media Posts: A Comparative Analysis of Large Language Models and Crowd Sourcing | Mao Li et.al. | 2406.07483v1 | null |
2024-06-11 | Image Neural Field Diffusion Models | Yinbo Chen et.al. | 2406.07480v1 | null |
2024-06-11 | Resummation of Multi-Stress Tensors in Higher Dimensions | Kuo-Wei Huang et.al. | 2406.07458v1 | null |
2024-06-11 | fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions | Alireza Afzal Aghaei et.al. | 2406.07456v1 | link |
2024-06-11 | An Optimism-based Approach to Online Evaluation of Generative Models | Xiaoyan Hu et.al. | 2406.07451v1 | null |
2024-06-11 | Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning | Shuvendu Roy et.al. | 2406.07450v1 | link |
2024-06-11 | Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration | Shashank Agnihotri et.al. | 2406.07435v1 | null |
2024-06-11 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431v1 | null |
2024-06-11 | Learning Domain-Invariant Features for Out-of-Context News Detection | Yimeng Gu et.al. | 2406.07430v1 | null |
2024-06-11 | DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 38 Subclasses | Abdurrahim Yilmaz et.al. | 2406.07426v1 | null |
2024-06-11 | Optimal Marital Strategies: How Couples Develop Successful Interaction Styles | Micah Henson et.al. | 2406.07403v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | QuickLLaMA: Query-aware Inference Acceleration for Large Language Models | Jingyao Li et.al. | 2406.07528v1 | link |
2024-06-11 | CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization | Frederic Kirstein et.al. | 2406.07494v2 | null |
2024-06-11 | Textual Similarity as a Key Metric in Machine Translation Quality Estimation | Kun Sun et.al. | 2406.07440v1 | null |
2024-06-11 | MINERS: Multilingual Language Models as Semantic Retrievers | Genta Indra Winata et.al. | 2406.07424v1 | null |
2024-06-11 | Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech | Yin-Long Liu et.al. | 2406.07410v1 | null |
2024-06-11 | Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance | Ruxin Zheng et.al. | 2406.07399v1 | null |
2024-06-11 | Visual Representation Learning with Stochastic Frame Prediction | Huiwon Jang et.al. | 2406.07398v1 | null |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332v1 | null |
2024-06-11 | A Banach space whose set of norm-attaining functionals is algebraically trivial | Miguel Martin et.al. | 2406.07273v1 | null |
2024-06-11 | Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation | Jinyuan Li et.al. | 2406.07268v1 | null |
2024-06-11 | Semantic-Aware Spectrum Sharing in Internet of Vehicles Based on Deep Reinforcement Learning | Zhiyu Shao et.al. | 2406.07213v1 | link |
2024-06-11 | Can Foundation Models Reliably Identify Spatial Hazards? A Case Study on Curb Segmentation | Diwei Sheng et.al. | 2406.07202v1 | null |
2024-06-11 | Target Speech Diarization with Multimodal Prompts | Yidi Jiang et.al. | 2406.07198v1 | null |
2024-06-11 | RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker | Yunfeng Li et.al. | 2406.07189v1 | link |
2024-06-11 | TernaryLLM: Ternarized Large Language Model | Tianqi Chen et.al. | 2406.07177v1 | null |
2024-06-11 | ULog: Unsupervised Log Parsing with Large Language Models through Log Contrastive Units | Junjie Huang et.al. | 2406.07174v1 | null |
2024-06-11 | FaceGPT: Self-supervised Learning to Chat about 3D Human Faces | Haoran Wang et.al. | 2406.07163v1 | null |
2024-06-11 | EEG classification for visual brain decoding with spatio-temporal and transformer based paradigms | Akanksha Sharma et.al. | 2406.07153v1 | null |
2024-06-11 | Translating speech with just images | Dan Oneata et.al. | 2406.07133v1 | null |
2024-06-11 | Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph | Sergey Linok et.al. | 2406.07113v1 | null |
2024-06-11 | AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding | Xing Zhang et.al. | 2406.07091v1 | null |
2024-06-11 | CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation | Zhongzhen Huang et.al. | 2406.07085v1 | null |
2024-06-11 | 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation | Mingqi Gao et.al. | 2406.07043v1 | link |
2024-06-11 | EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network | Yining Shi et.al. | 2406.07042v1 | link |
2024-06-11 | PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving | Yining Shi et.al. | 2406.07037v1 | null |
2024-06-11 | RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks | Zhechao Wang et.al. | 2406.07032v1 | null |
2024-06-11 | LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection | Jiahua Xu et.al. | 2406.07023v2 | null |
2024-06-11 | Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models | Sooyeon Go et.al. | 2406.07008v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis | Qining Zhang et.al. | 2406.07455v1 | null |
2024-06-11 | On the Robustness of Document-Level Relation Extraction Models to Entity Name Variations | Shiao Meng et.al. | 2406.07444v1 | null |
2024-06-11 | Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech | Yin-Long Liu et.al. | 2406.07410v1 | null |
2024-06-11 | Visual Representation Learning with Stochastic Frame Prediction | Huiwon Jang et.al. | 2406.07398v1 | null |
2024-06-11 | PSMC: Provable and Scalable Algorithms for Motif Conductance Based Graph Clustering | Longlong Lin et.al. | 2406.07357v1 | null |
2024-06-11 | The Theory of Intrinsic Time: A Primer | James B. Glattfelder et.al. | 2406.07354v1 | null |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332v1 | null |
2024-06-11 | A Banach space whose set of norm-attaining functionals is algebraically trivial | Miguel Martin et.al. | 2406.07273v1 | null |
2024-06-11 | Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation | Jinyuan Li et.al. | 2406.07268v1 | null |
2024-06-11 | Can Foundation Models Reliably Identify Spatial Hazards? A Case Study on Curb Segmentation | Diwei Sheng et.al. | 2406.07202v1 | null |
2024-06-11 | Quantum repeaters based on stationary Gottesman-Kitaev-Preskill qubits | Stefan HΓ€ussler et.al. | 2406.07158v1 | null |
2024-06-11 | Scaling Large-Language-Model-based Multi-Agent Collaboration | Chen Qian et.al. | 2406.07155v1 | link |
2024-06-11 | CHARME: A chain-based reinforcement learning approach for the minor embedding problem | Hoang M. Ngo et.al. | 2406.07124v1 | null |
2024-06-11 | The Treatment of Ties in Rank-Biased Overlap | Matteo Corsi et.al. | 2406.07121v1 | null |
2024-06-11 | Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph | Sergey Linok et.al. | 2406.07113v1 | null |
2024-06-11 | Large amplitude quasi-periodic traveling waves in two dimensional forced rotating fluids | Roberta Bianchini et.al. | 2406.07099v1 | null |
2024-06-11 | Edge Rendering Architecture for multiuser XR Experiences and E2E Performance Assessment | Inhar Yeregui et.al. | 2406.07087v1 | null |
2024-06-11 | CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation | Zhongzhen Huang et.al. | 2406.07085v1 | null |
2024-06-11 | Triage of 3D pathology data via 2.5D multiple-instance learning to guide pathologist assessments | Gan Gao et.al. | 2406.07061v1 | link |
2024-06-11 | Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study | Yichi Zhang et.al. | 2406.07057v1 | null |
2024-06-11 | 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation | Mingqi Gao et.al. | 2406.07043v1 | link |
2024-06-11 | PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving | Yining Shi et.al. | 2406.07037v1 | null |
2024-06-11 | RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks | Zhechao Wang et.al. | 2406.07032v1 | null |
2024-06-11 | LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection | Jiahua Xu et.al. | 2406.07023v2 | null |
2024-06-11 | Dual Thinking and Perceptual Analysis of Deep Learning Models using Human Adversarial Examples | Kailas Dayanandan et.al. | 2406.06967v1 | link |
2024-06-11 | Distributional MIPLIB: a Multi-Domain Library for Advancing ML-Guided MILP Methods | Weimin Huang et.al. | 2406.06954v1 | null |
2024-06-11 | Sparse Bayesian Networks: Efficient Uncertainty Quantification in Medical Image Analysis | Zeinab Abboud et.al. | 2406.06946v1 | null |
2024-06-11 | UVIS: Unsupervised Video Instance Segmentation | Shuaiyi Huang et.al. | 2406.06908v1 | null |
2024-06-11 | Person Transfer in the Field: Examining Real World Sequential Human-Robot Interaction Between Two Robots | Xiang Zhi Tan et.al. | 2406.06904v1 | null |
2024-06-11 | Universal spatial inflation of human mobility | Lu Zhong et.al. | 2406.06889v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Differentiability and Optimization of Multiparameter Persistent Homology | Luis Scoccola et.al. | 2406.07224v1 | null |
2024-06-10 | Relative descriptors for quantum agents | David MΓΆckli et.al. | 2406.06719v1 | null |
2024-06-08 | Unsupervised learning of Data-driven Facial Expression Coding System (DFECS) using keypoint tracking | Shivansh Chandra Tripathi et.al. | 2406.05434v1 | null |
2024-06-07 | Expected Lipschitz-Killing curvatures for spin random fields and other non-isotropic fields | Francesca Pistolato et.al. | 2406.04850v1 | null |
2024-06-07 | LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model | Dongkai Wang et.al. | 2406.04659v1 | link |
2024-06-06 | Monocular Localization with Semantics Map for Autonomous Vehicles | Jixiang Wan et.al. | 2406.03835v1 | null |
2024-06-05 | Image Copy-Move Forgery Detection and Localization Scheme: How to Avoid Missed Detection and False Alarm | Li Jiang et.al. | 2406.03271v1 | null |
2024-06-05 | Topological Neural Networks go Persistent, Equivariant, and Continuous | Yogesh Verma et.al. | 2406.03164v1 | null |
2024-06-05 | How precisely are solute clusters in RPV steels characterized by atom probe experiments? | N. Castin et.al. | 2406.02973v1 | null |
2024-06-05 | Homotopic Path Set Planning for Robot Manipulation and Navigation | Jing Huang et.al. | 2406.02885v1 | link |
2024-06-05 | Controllable Talking Face Generation by Implicit Facial Keypoints Editing | Dong Zhao et.al. | 2406.02880v1 | null |
2024-06-04 | Machine learning Hubbard parameters with equivariant neural networks | Martin Uhrin et.al. | 2406.02457v1 | link |
2024-06-03 | Scale-Free Image Keypoints Using Differentiable Persistent Homology | Giovanni Barbarani et.al. | 2406.01315v1 | link |
2024-06-03 | MoFormer: Multi-objective Antimicrobial Peptide Generation Based on Conditional Transformer Joint Multi-modal Fusion Descriptor | Li Wang et.al. | 2406.02610v1 | null |
2024-06-02 | W-Net: A Facial Feature-Guided Face Super-Resolution Network | Hao Liu et.al. | 2406.00676v1 | null |
2024-06-02 | SAM-LAD: Segment Anything Model Meets Zero-Shot Logic Anomaly Detection | Yun Peng et.al. | 2406.00625v2 | null |
2024-06-01 | CapeX: Category-Agnostic Pose Estimation from Textual Point Explanation | Matan Rusanovsky et.al. | 2406.00384v1 | link |
2024-05-31 | Learning from metastable grain boundaries | Avanish Mishra et.al. | 2406.00204v1 | null |
2024-05-30 | Estimating Human Poses Across Datasets: A Unified Skeleton and Multi-Teacher Distillation Approach | Muhammad Saif Ullah Khan et.al. | 2405.20084v1 | null |
2024-05-30 | KITRO: Refining Human Mesh by 2D Clues and Kinematic-tree Rotation | Fengyuan Yang et.al. | 2405.19833v1 | link |
2024-05-30 | Automatic Dance Video Segmentation for Understanding Choreography | Koki Endo et.al. | 2405.19727v1 | null |
2024-05-30 | SMPLX-Lite: A Realistic and Drivable Avatar Benchmark with Rich Geometry and Texture Annotations | Yujiao Jiang et.al. | 2405.19609v1 | null |
2024-05-29 | SDPRLayers: Certifiable Backpropagation Through Polynomial Optimization Problems in Robotics | Connor Holmes et.al. | 2405.19309v1 | null |
2024-05-29 | Greedy Kernel Methods for Approximating Breakthrough Curves for Reactive Flow from 3D Porous Geometry Data | Robin Herkert et.al. | 2405.19170v1 | null |
2024-05-29 | PointNetPGAP-SLC: A 3D LiDAR-based Place Recognition Approach with Segment-level Consistency Training for Mobile Robots in Horticulture | T. Barros et.al. | 2405.19038v1 | link |
2024-05-29 | Classification analysis of transition-metal chalcogenides and oxides using quantum machine learning | Kurudi V Vedavyasa et.al. | 2405.18989v1 | null |
2024-05-29 | Diffeomorphic interpolation for efficient persistence-based topological optimization | Mathieu Carriere et.al. | 2405.18820v1 | null |
2024-05-28 | Temperature-Dependent Chirality in Halide Perovskites | Mike Pols et.al. | 2405.18643v1 | null |
2024-05-28 | What can machine learning help with microstructure-informed materials modeling and design? | Xiang-Long Peng et.al. | 2405.18396v1 | null |
2024-05-28 | Relational Self-supervised Distillation with Compact Descriptors for Image Copy Detection | Juntae Kim et.al. | 2405.17928v3 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Dynamics of the non-radial energy-critical inhomogeneous NLS | Carlos M. GuzmΓ‘n et.al. | 2406.07535v1 | null |
2024-06-11 | Interpolating between Hausdorff and box dimension | Amlan Banaji et.al. | 2406.07527v1 | null |
2024-06-11 | On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds | Gabriella Tarantello et.al. | 2406.07518v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven" | D. T. Chung et.al. | 2406.07512v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results I. Improved data selection and processing | J. G. S. Lunde et.al. | 2406.07510v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
2024-06-11 | Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices | Atli Sigurgeirsson et.al. | 2406.07504v1 | null |
2024-06-11 | ReduceFormer: Attention with Tensor Reduction by Summation | John Yang et.al. | 2406.07488v1 | null |
2024-06-11 | Advancing Annotation of Stance in Social Media Posts: A Comparative Analysis of Large Language Models and Crowd Sourcing | Mao Li et.al. | 2406.07483v1 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482v1 | null |
2024-06-11 | Resummation of Multi-Stress Tensors in Higher Dimensions | Kuo-Wei Huang et.al. | 2406.07458v1 | null |
2024-06-11 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431v1 | null |
2024-06-11 | Support for fragile porous dust in a gravitationally self-regulated disk around IM Lup | Takahiro Ueda et.al. | 2406.07427v1 | null |
2024-06-11 | Adic curves: stable reduction, skeletons and metric structure | Katharina HΓΌbner et.al. | 2406.07414v1 | null |
2024-06-11 | Private Geometric Median | Mahdi Haghifam et.al. | 2406.07407v1 | null |
2024-06-11 | Optimal Marital Strategies: How Couples Develop Successful Interaction Styles | Micah Henson et.al. | 2406.07403v1 | null |
2024-06-11 | Disrupting Bipartite Trading Networks: Matching for Revenue Maximization | Luca D'Amico-Wong et.al. | 2406.07385v1 | null |
2024-06-11 | Closing the Computational-Query Depth Gap in Parallel Stochastic Convex Optimization | Arun Jambulapati et.al. | 2406.07373v1 | null |
2024-06-11 | Analytical Delta-V Approximation for Nonlinear Programming of Multi-target Rendezvous and Flyby Trajectories | An-Yi Huang et.al. | 2406.07341v1 | null |
2024-06-11 | Searching for gravitational waves from stellar-mass binary black holes early inspiral | Xue-Ting Zhang et.al. | 2406.07336v1 | null |
2024-06-11 | Maximum number of points on an intersection of a cubic threefold and a non-degenerate Hermitian threefold | Mrinmoy Datta et.al. | 2406.07326v1 | null |
2024-06-11 | Lyapunov equations: a (fixed) point of view | Richard Pates et.al. | 2406.07324v1 | null |
2024-06-11 | Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs | Kamil Jeziorek et.al. | 2406.07318v1 | null |
2024-06-11 | Morse Index Stability for the Ginzburg-Landau Approximation | Francesca Da Lio et.al. | 2406.07317v1 | null |
2024-06-11 | Sum the Probabilities to |
Zakaria Derbazi et.al. | 2406.07283v1 | null |
2024-06-11 | Efficient 3D Molecular Generation with Flow Matching and Scale Optimal Transport | Ross Irwin et.al. | 2406.07266v1 | null |
2024-06-11 | Coupled-channel |
Jozef J. Dudek et.al. | 2406.07261v1 | null |
2024-06-11 | Hybrid Reinforcement Learning from Offline Observation Alone | Yuda Song et.al. | 2406.07253v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds | Gabriella Tarantello et.al. | 2406.07518v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System | SBND Collaboration et.al. | 2406.07514v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven" | D. T. Chung et.al. | 2406.07512v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum | N. -O. Stutzer et.al. | 2406.07511v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499v1 | null |
2024-06-11 | A pilot protocol and cohort for the investigation of non-pathological variability in speech | Nicholas Cummins et.al. | 2406.07497v1 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | ReduceFormer: Attention with Tensor Reduction by Summation | John Yang et.al. | 2406.07488v1 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482v1 | null |
2024-06-11 | Partially Observed Trajectory Inference using Optimal Transport and a Dynamics Prior | Anming Gu et.al. | 2406.07475v1 | null |
2024-06-11 | 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models | Heng Yu et.al. | 2406.07472v1 | null |
2024-06-11 | Exploring non-radial oscillation modes in dark matter admixed neutron stars | Pratik Thakur et.al. | 2406.07470v1 | null |
2024-06-11 | Existence and asymptotic autonomous robustness of random attractors for three-dimensional stochastic globally modified Navier-Stokes equations on unbounded domains | Kush Kinra et.al. | 2406.07460v1 | null |
2024-06-11 | Reserve Provision from Electric Vehicles: Aggregate Boundaries and Stochastic Model Predictive Control | Jacob ThrΓ€n et.al. | 2406.07454v1 | null |
2024-06-11 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431v1 | null |
2024-06-11 | Single and merger soliton dynamics in scalar field dark matter with and without self-interactions | Matthias Stallovits et.al. | 2406.07419v1 | null |
2024-06-11 | Visual Representation Learning with Stochastic Frame Prediction | Huiwon Jang et.al. | 2406.07398v1 | null |
2024-06-11 | Operad of posets 101: The Wixarika posets | JosΓ© Antonio Arciniega-NevΓ‘rez et.al. | 2406.07370v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds | Gabriella Tarantello et.al. | 2406.07518v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System | SBND Collaboration et.al. | 2406.07514v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven" | D. T. Chung et.al. | 2406.07512v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum | N. -O. Stutzer et.al. | 2406.07511v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid | Naser Souri et.al. | 2406.07503v1 | null |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499v1 | null |
2024-06-11 | A pilot protocol and cohort for the investigation of non-pathological variability in speech | Nicholas Cummins et.al. | 2406.07497v1 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | Prospects for the detection of Dark Matter with Long-lived Mediators in the Sun using the Southern Wide-field Gamma-ray Observatory | Micael Andrade et.al. | 2406.07489v1 | null |
2024-06-11 | ReduceFormer: Attention with Tensor Reduction by Summation | John Yang et.al. | 2406.07488v1 | null |
2024-06-11 | GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection | Hang Yao et.al. | 2406.07487v1 | null |
2024-06-11 | Advancing Annotation of Stance in Social Media Posts: A Comparative Analysis of Large Language Models and Crowd Sourcing | Mao Li et.al. | 2406.07483v1 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482v1 | null |
2024-06-11 | Neutrino magnetic dipole portal with low energy neutrino nucleus scattering data | Ying-Ying Li et.al. | 2406.07477v1 | null |
2024-06-11 | 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models | Heng Yu et.al. | 2406.07472v1 | null |
2024-06-11 | Exploring non-radial oscillation modes in dark matter admixed neutron stars | Pratik Thakur et.al. | 2406.07470v1 | null |
2024-06-11 | Anomaly Detection on Unstable Logs with GPT Models | Fatemeh Hadadi et.al. | 2406.07467v1 | null |
2024-06-11 | Existence and asymptotic autonomous robustness of random attractors for three-dimensional stochastic globally modified Navier-Stokes equations on unbounded domains | Kush Kinra et.al. | 2406.07460v1 | null |
2024-06-11 | Reserve Provision from Electric Vehicles: Aggregate Boundaries and Stochastic Model Predictive Control | Jacob ThrΓ€n et.al. | 2406.07454v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Dynamics of the non-radial energy-critical inhomogeneous NLS | Carlos M. GuzmΓ‘n et.al. | 2406.07535v1 | null |
2024-06-11 | Interpolating between Hausdorff and box dimension | Amlan Banaji et.al. | 2406.07527v1 | null |
2024-06-11 | On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds | Gabriella Tarantello et.al. | 2406.07518v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results I. Improved data selection and processing | J. G. S. Lunde et.al. | 2406.07510v1 | null |
2024-06-11 | Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices | Atli Sigurgeirsson et.al. | 2406.07504v1 | null |
2024-06-11 | ReduceFormer: Attention with Tensor Reduction by Summation | John Yang et.al. | 2406.07488v1 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482v1 | null |
2024-06-11 | Resummation of Multi-Stress Tensors in Higher Dimensions | Kuo-Wei Huang et.al. | 2406.07458v1 | null |
2024-06-11 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431v1 | null |
2024-06-11 | Support for fragile porous dust in a gravitationally self-regulated disk around IM Lup | Takahiro Ueda et.al. | 2406.07427v1 | null |
2024-06-11 | Adic curves: stable reduction, skeletons and metric structure | Katharina HΓΌbner et.al. | 2406.07414v1 | null |
2024-06-11 | Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech | Yin-Long Liu et.al. | 2406.07410v1 | null |
2024-06-11 | Private Geometric Median | Mahdi Haghifam et.al. | 2406.07407v1 | null |
2024-06-11 | Visual Representation Learning with Stochastic Frame Prediction | Huiwon Jang et.al. | 2406.07398v1 | null |
2024-06-11 | Analytical Delta-V Approximation for Nonlinear Programming of Multi-target Rendezvous and Flyby Trajectories | An-Yi Huang et.al. | 2406.07341v1 | null |
2024-06-11 | Searching for gravitational waves from stellar-mass binary black holes early inspiral | Xue-Ting Zhang et.al. | 2406.07336v1 | null |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332v1 | null |
2024-06-11 | Maximum number of points on an intersection of a cubic threefold and a non-degenerate Hermitian threefold | Mrinmoy Datta et.al. | 2406.07326v1 | null |
2024-06-11 | Lyapunov equations: a (fixed) point of view | Richard Pates et.al. | 2406.07324v1 | null |
2024-06-11 | Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs | Kamil Jeziorek et.al. | 2406.07318v1 | null |
2024-06-11 | Morse Index Stability for the Ginzburg-Landau Approximation | Francesca Da Lio et.al. | 2406.07317v1 | null |
2024-06-11 | Sum the Probabilities to |
Zakaria Derbazi et.al. | 2406.07283v1 | null |
2024-06-11 | A Banach space whose set of norm-attaining functionals is algebraically trivial | Miguel Martin et.al. | 2406.07273v1 | null |
2024-06-11 | Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation | Jinyuan Li et.al. | 2406.07268v1 | null |
2024-06-11 | Coupled-channel |
Jozef J. Dudek et.al. | 2406.07261v1 | null |
2024-06-11 | Even dimensional Fermat cubics are rational over any field | Alex Massarenti et.al. | 2406.07223v1 | null |
2024-06-11 | Can Foundation Models Reliably Identify Spatial Hazards? A Case Study on Curb Segmentation | Diwei Sheng et.al. | 2406.07202v1 | null |
2024-06-11 | A Multi-step Approach for Minimizing Risk in Decentralized Exchanges | Daniele Maria Di Nosse et.al. | 2406.07200v2 | null |
2024-06-11 | TernaryLLM: Ternarized Large Language Model | Tianqi Chen et.al. | 2406.07177v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Dynamics of the non-radial energy-critical inhomogeneous NLS | Carlos M. GuzmΓ‘n et.al. | 2406.07535v1 | null |
2024-06-11 | Interpolating between Hausdorff and box dimension | Amlan Banaji et.al. | 2406.07527v1 | null |
2024-06-11 | On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds | Gabriella Tarantello et.al. | 2406.07518v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results I. Improved data selection and processing | J. G. S. Lunde et.al. | 2406.07510v1 | null |
2024-06-11 | Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices | Atli Sigurgeirsson et.al. | 2406.07504v1 | null |
2024-06-11 | ReduceFormer: Attention with Tensor Reduction by Summation | John Yang et.al. | 2406.07488v1 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482v1 | null |
2024-06-11 | Resummation of Multi-Stress Tensors in Higher Dimensions | Kuo-Wei Huang et.al. | 2406.07458v1 | null |
2024-06-11 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431v1 | null |
2024-06-11 | Support for fragile porous dust in a gravitationally self-regulated disk around IM Lup | Takahiro Ueda et.al. | 2406.07427v1 | null |
2024-06-11 | Adic curves: stable reduction, skeletons and metric structure | Katharina HΓΌbner et.al. | 2406.07414v1 | null |
2024-06-11 | Private Geometric Median | Mahdi Haghifam et.al. | 2406.07407v1 | null |
2024-06-11 | Deep Implicit Optimization for Robust and Flexible Image Registration | Rohit Jena et.al. | 2406.07361v1 | null |
2024-06-11 | Analytical Delta-V Approximation for Nonlinear Programming of Multi-target Rendezvous and Flyby Trajectories | An-Yi Huang et.al. | 2406.07341v1 | null |
2024-06-11 | Searching for gravitational waves from stellar-mass binary black holes early inspiral | Xue-Ting Zhang et.al. | 2406.07336v1 | null |
2024-06-11 | Maximum number of points on an intersection of a cubic threefold and a non-degenerate Hermitian threefold | Mrinmoy Datta et.al. | 2406.07326v1 | null |
2024-06-11 | Lyapunov equations: a (fixed) point of view | Richard Pates et.al. | 2406.07324v1 | null |
2024-06-11 | Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs | Kamil Jeziorek et.al. | 2406.07318v1 | null |
2024-06-11 | Morse Index Stability for the Ginzburg-Landau Approximation | Francesca Da Lio et.al. | 2406.07317v1 | null |
2024-06-11 | Sum the Probabilities to |
Zakaria Derbazi et.al. | 2406.07283v1 | null |
2024-06-11 | Coupled-channel |
Jozef J. Dudek et.al. | 2406.07261v1 | null |
2024-06-11 | Even dimensional Fermat cubics are rational over any field | Alex Massarenti et.al. | 2406.07223v1 | null |
2024-06-11 | A Multi-step Approach for Minimizing Risk in Decentralized Exchanges | Daniele Maria Di Nosse et.al. | 2406.07200v2 | null |
2024-06-11 | TernaryLLM: Ternarized Large Language Model | Tianqi Chen et.al. | 2406.07177v1 | null |
2024-06-11 | Ultrametric-preserving functions as monoid endomorphisms | Oleksiy Dovgoshey et.al. | 2406.07166v2 | null |
2024-06-11 | ACCO: Automated Causal CNN Scheduling Optimizer for Real-Time Edge Accelerators | Jun Yin et.al. | 2406.07161v1 | null |
2024-06-11 | Deep Learning-Based Approach for User Activity Detection with Grant-Free Random Access in Cell-Free Massive MIMO | Ali Elkeshawy et.al. | 2406.07160v1 | null |
2024-06-11 | A portrait of the rotation of Ultra-Cool Dwarfs revealed by TESS | D. O. Fontinele et.al. | 2406.07154v1 | null |
2024-06-11 | High-performance in-vacuum optical system for quantum optics experiments in a Penning-trap | JoaquΓn Berrocal et.al. | 2406.07152v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Dynamics of the non-radial energy-critical inhomogeneous NLS | Carlos M. GuzmΓ‘n et.al. | 2406.07535v1 | null |
2024-06-11 | Interpolating between Hausdorff and box dimension | Amlan Banaji et.al. | 2406.07527v1 | null |
2024-06-11 | On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds | Gabriella Tarantello et.al. | 2406.07518v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results I. Improved data selection and processing | J. G. S. Lunde et.al. | 2406.07510v1 | null |
2024-06-11 | Uniqueness on average of large isoperimetric sets in noncompact manifolds with nonnegative Ricci curvature | Gioacchino Antonelli et.al. | 2406.07509v1 | null |
2024-06-11 | Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices | Atli Sigurgeirsson et.al. | 2406.07504v1 | null |
2024-06-11 | ReduceFormer: Attention with Tensor Reduction by Summation | John Yang et.al. | 2406.07488v1 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482v1 | null |
2024-06-11 | Resummation of Multi-Stress Tensors in Higher Dimensions | Kuo-Wei Huang et.al. | 2406.07458v1 | null |
2024-06-11 | McEval: Massively Multilingual Code Evaluation | Linzheng Chai et.al. | 2406.07436v1 | null |
2024-06-11 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431v1 | null |
2024-06-11 | Support for fragile porous dust in a gravitationally self-regulated disk around IM Lup | Takahiro Ueda et.al. | 2406.07427v1 | null |
2024-06-11 | Adic curves: stable reduction, skeletons and metric structure | Katharina HΓΌbner et.al. | 2406.07414v1 | null |
2024-06-11 | VersiCode: Towards Version-controllable Code Generation | Tongtong Wu et.al. | 2406.07411v1 | null |
2024-06-11 | Private Geometric Median | Mahdi Haghifam et.al. | 2406.07407v1 | null |
2024-06-11 | Limited Out-of-Context Knowledge Reasoning in Large Language Models | Peng Hu et.al. | 2406.07393v1 | null |
2024-06-11 | A mechanical qubit | Yu Yang et.al. | 2406.07360v1 | null |
2024-06-11 | Finite |
Jonathan S. Brown et.al. | 2406.07350v1 | null |
2024-06-11 | Analytical Delta-V Approximation for Nonlinear Programming of Multi-target Rendezvous and Flyby Trajectories | An-Yi Huang et.al. | 2406.07341v1 | null |
2024-06-11 | Searching for gravitational waves from stellar-mass binary black holes early inspiral | Xue-Ting Zhang et.al. | 2406.07336v1 | null |
2024-06-11 | Maximum number of points on an intersection of a cubic threefold and a non-degenerate Hermitian threefold | Mrinmoy Datta et.al. | 2406.07326v1 | null |
2024-06-11 | Lyapunov equations: a (fixed) point of view | Richard Pates et.al. | 2406.07324v1 | null |
2024-06-11 | Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs | Kamil Jeziorek et.al. | 2406.07318v1 | null |
2024-06-11 | Morse Index Stability for the Ginzburg-Landau Approximation | Francesca Da Lio et.al. | 2406.07317v1 | null |
2024-06-11 | Sum the Probabilities to |
Zakaria Derbazi et.al. | 2406.07283v1 | null |
2024-06-11 | Coupled-channel |
Jozef J. Dudek et.al. | 2406.07261v1 | null |
2024-06-11 | Hybrid Reinforcement Learning from Offline Observation Alone | Yuda Song et.al. | 2406.07253v1 | null |
2024-06-11 | Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring | Tomoya Nishida et.al. | 2406.07250v1 | null |
2024-06-11 | Variational inequalities and smooth-fit principle for singular stochastic control problems in Hilbert spaces | Salvatore Federico et.al. | 2406.07242v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Dynamics of the non-radial energy-critical inhomogeneous NLS | Carlos M. GuzmΓ‘n et.al. | 2406.07535v1 | null |
2024-06-11 | Interpolating between Hausdorff and box dimension | Amlan Banaji et.al. | 2406.07527v1 | null |
2024-06-11 | On constant mean curvature 1-immersions of surfaces into hyperbolic 3-manifolds | Gabriella Tarantello et.al. | 2406.07518v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results I. Improved data selection and processing | J. G. S. Lunde et.al. | 2406.07510v1 | null |
2024-06-11 | Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices | Atli Sigurgeirsson et.al. | 2406.07504v1 | null |
2024-06-11 | ReduceFormer: Attention with Tensor Reduction by Summation | John Yang et.al. | 2406.07488v1 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482v1 | null |
2024-06-11 | Resummation of Multi-Stress Tensors in Higher Dimensions | Kuo-Wei Huang et.al. | 2406.07458v1 | null |
2024-06-11 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431v1 | null |
2024-06-11 | Support for fragile porous dust in a gravitationally self-regulated disk around IM Lup | Takahiro Ueda et.al. | 2406.07427v1 | null |
2024-06-11 | Adic curves: stable reduction, skeletons and metric structure | Katharina HΓΌbner et.al. | 2406.07414v1 | null |
2024-06-11 | Private Geometric Median | Mahdi Haghifam et.al. | 2406.07407v1 | null |
2024-06-11 | Analytical Delta-V Approximation for Nonlinear Programming of Multi-target Rendezvous and Flyby Trajectories | An-Yi Huang et.al. | 2406.07341v1 | null |
2024-06-11 | Searching for gravitational waves from stellar-mass binary black holes early inspiral | Xue-Ting Zhang et.al. | 2406.07336v1 | null |
2024-06-11 | Maximum number of points on an intersection of a cubic threefold and a non-degenerate Hermitian threefold | Mrinmoy Datta et.al. | 2406.07326v1 | null |
2024-06-11 | Lyapunov equations: a (fixed) point of view | Richard Pates et.al. | 2406.07324v1 | null |
2024-06-11 | Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs | Kamil Jeziorek et.al. | 2406.07318v1 | null |
2024-06-11 | Morse Index Stability for the Ginzburg-Landau Approximation | Francesca Da Lio et.al. | 2406.07317v1 | null |
2024-06-11 | Sum the Probabilities to |
Zakaria Derbazi et.al. | 2406.07283v1 | null |
2024-06-11 | Coupled-channel |
Jozef J. Dudek et.al. | 2406.07261v1 | null |
2024-06-11 | Even dimensional Fermat cubics are rational over any field | Alex Massarenti et.al. | 2406.07223v1 | null |
2024-06-11 | A Multi-step Approach for Minimizing Risk in Decentralized Exchanges | Daniele Maria Di Nosse et.al. | 2406.07200v2 | null |
2024-06-11 | TernaryLLM: Ternarized Large Language Model | Tianqi Chen et.al. | 2406.07177v1 | null |
2024-06-11 | Ultrametric-preserving functions as monoid endomorphisms | Oleksiy Dovgoshey et.al. | 2406.07166v2 | null |
2024-06-11 | ACCO: Automated Causal CNN Scheduling Optimizer for Real-Time Edge Accelerators | Jun Yin et.al. | 2406.07161v1 | null |
2024-06-11 | Deep Learning-Based Approach for User Activity Detection with Grant-Free Random Access in Cell-Free Massive MIMO | Ali Elkeshawy et.al. | 2406.07160v1 | null |
2024-06-11 | A portrait of the rotation of Ultra-Cool Dwarfs revealed by TESS | D. O. Fontinele et.al. | 2406.07154v1 | null |
2024-06-11 | High-performance in-vacuum optical system for quantum optics experiments in a Penning-trap | JoaquΓn Berrocal et.al. | 2406.07152v1 | null |
2024-06-11 | Partial yet definite emergence of the Kardar-Parisi-Zhang class in isotropic spin chains | Kazumasa A. Takeuchi et.al. | 2406.07150v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis | David Ortiz-Perez et.al. | 2406.07542v1 | link |
2024-06-11 | CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Zeyuan Liu et.al. | 2406.07541v1 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | QuickLLaMA: Query-aware Inference Acceleration for Large Language Models | Jingyao Li et.al. | 2406.07528v1 | link |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Change of numeraire for weak martingale transport | Mathias BeiglbΓΆck et.al. | 2406.07523v1 | null |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Faster Spectral Density Estimation and Sparsification in the Nuclear Norm | Yujia Jin et.al. | 2406.07521v1 | null |
2024-06-11 | Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics | Reuben R. W. Wang et.al. | 2406.07519v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid | Naser Souri et.al. | 2406.07503v1 | null |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale | William Giarè et.al. | 2406.07493v1 | null |
2024-06-11 | Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction | Bekir Z. Demiray et.al. | 2406.07484v1 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482v1 | null |
2024-06-11 | The end of multiple choice tests: using AI to enhance assessment | Michael Klymkowsky et.al. | 2406.07481v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks | Bjarke Madsen et.al. | 2406.07383v1 | null |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332v1 | null |
2024-06-11 | Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway | Hamed Babaei Giglou et.al. | 2406.07257v1 | null |
2024-06-11 | Which Country Is This? Automatic Country Ranking of Street View Photos | Tim Menzner et.al. | 2406.07227v1 | null |
2024-06-10 | Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints | T. Tony Cai et.al. | 2406.06755v1 | null |
2024-06-10 | Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests | T. Tony Cai et.al. | 2406.06749v1 | null |
2024-06-10 | Decentralized Personalized Federated Learning | Salma Kharrat et.al. | 2406.06520v1 | null |
2024-06-10 | Optimisation of federated learning settings under statistical heterogeneity variations | Basem Suleiman et.al. | 2406.06340v1 | null |
2024-06-10 | Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning | Xiaoting Lyu et.al. | 2406.06207v1 | null |
2024-06-10 | Federated learning in food research | Zuzanna Fendor et.al. | 2406.06202v1 | null |
2024-06-10 | Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm | Ahmed Elbakary et.al. | 2406.06655v1 | null |
2024-06-10 | Federated Machine Reasoning for Resource Provisioning in 6G O-RAN | Swastika Roy et.al. | 2406.06128v1 | null |
2024-06-09 | Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" | Mahtab Talaei et.al. | 2406.05858v1 | null |
2024-06-08 | Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey | Shinu M. Rajagopal et.al. | 2406.05517v1 | null |
2024-06-08 | PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System | Wei Yuan et.al. | 2406.05387v1 | null |
2024-06-07 | Federated LoRA with Sparse Communication | Kevin Kuo et.al. | 2406.05233v1 | null |
2024-06-07 | The Russian Legislative Corpus | Denis Saveliev et.al. | 2406.04855v1 | link |
2024-06-07 | FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models | Rui Ye et.al. | 2406.04845v1 | link |
2024-06-07 | When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain | Lei Xu et.al. | 2406.04743v1 | null |
2024-06-07 | Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems | Zhen Cai et.al. | 2406.04702v1 | null |
2024-06-07 | Federated Representation Learning in the Under-Parameterized Regime | Renpu Liu et.al. | 2406.04596v3 | null |
2024-06-06 | Data Measurements for Decentralized Data Markets | Charles Lu et.al. | 2406.04257v1 | null |
2024-06-06 | R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients | Tamer Ahmed Eltaras et.al. | 2406.04227v1 | null |
2024-06-06 | Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning | Xuhan Zuo et.al. | 2406.04076v1 | null |
2024-06-06 | Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation | Honglei Zhang et.al. | 2406.03933v1 | link |
2024-06-06 | 1-D CNN-Based Online Signature Verification with Federated Learning | Lingfeng Zhang et.al. | 2406.06597v1 | null |
2024-06-06 | Stochastic Dynamic Network Utility Maximization with Application to Disaster Response | Anna Scaglione et.al. | 2406.03750v1 | null |
2024-06-05 | FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles | Cyprien QuΓ©mΓ©neur et.al. | 2406.03611v1 | link |
2024-06-05 | Fantastyc: Blockchain-based Federated Learning Made Secure and Practical | William Boitier et.al. | 2406.03608v1 | null |
2024-06-05 | Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning | Saber Malekmohammadi et.al. | 2406.03519v1 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks | Bjarke Madsen et.al. | 2406.07383v1 | null |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332v1 | null |
2024-06-11 | Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway | Hamed Babaei Giglou et.al. | 2406.07257v1 | null |
2024-06-11 | Which Country Is This? Automatic Country Ranking of Street View Photos | Tim Menzner et.al. | 2406.07227v1 | null |
2024-06-10 | Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints | T. Tony Cai et.al. | 2406.06755v1 | null |
2024-06-10 | Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests | T. Tony Cai et.al. | 2406.06749v1 | null |
2024-06-10 | Decentralized Personalized Federated Learning | Salma Kharrat et.al. | 2406.06520v1 | null |
2024-06-10 | Optimisation of federated learning settings under statistical heterogeneity variations | Basem Suleiman et.al. | 2406.06340v1 | null |
2024-06-10 | Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning | Xiaoting Lyu et.al. | 2406.06207v1 | null |
2024-06-10 | Federated learning in food research | Zuzanna Fendor et.al. | 2406.06202v1 | null |
2024-06-10 | Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm | Ahmed Elbakary et.al. | 2406.06655v1 | null |
2024-06-10 | Federated Machine Reasoning for Resource Provisioning in 6G O-RAN | Swastika Roy et.al. | 2406.06128v1 | null |
2024-06-09 | Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" | Mahtab Talaei et.al. | 2406.05858v1 | null |
2024-06-08 | Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey | Shinu M. Rajagopal et.al. | 2406.05517v1 | null |
2024-06-08 | PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System | Wei Yuan et.al. | 2406.05387v1 | null |
2024-06-07 | Federated LoRA with Sparse Communication | Kevin Kuo et.al. | 2406.05233v1 | null |
2024-06-07 | The Russian Legislative Corpus | Denis Saveliev et.al. | 2406.04855v1 | link |
2024-06-07 | FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models | Rui Ye et.al. | 2406.04845v1 | link |
2024-06-07 | When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain | Lei Xu et.al. | 2406.04743v1 | null |
2024-06-07 | Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems | Zhen Cai et.al. | 2406.04702v1 | null |
2024-06-07 | Federated Representation Learning in the Under-Parameterized Regime | Renpu Liu et.al. | 2406.04596v3 | null |
2024-06-06 | Data Measurements for Decentralized Data Markets | Charles Lu et.al. | 2406.04257v1 | null |
2024-06-06 | R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients | Tamer Ahmed Eltaras et.al. | 2406.04227v1 | null |
2024-06-06 | Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning | Xuhan Zuo et.al. | 2406.04076v1 | null |
2024-06-06 | Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation | Honglei Zhang et.al. | 2406.03933v1 | link |
2024-06-06 | 1-D CNN-Based Online Signature Verification with Federated Learning | Lingfeng Zhang et.al. | 2406.06597v1 | null |
2024-06-06 | Stochastic Dynamic Network Utility Maximization with Application to Disaster Response | Anna Scaglione et.al. | 2406.03750v1 | null |
2024-06-05 | FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles | Cyprien QuΓ©mΓ©neur et.al. | 2406.03611v1 | link |
2024-06-05 | Fantastyc: Blockchain-based Federated Learning Made Secure and Practical | William Boitier et.al. | 2406.03608v1 | null |
2024-06-05 | Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning | Saber Malekmohammadi et.al. | 2406.03519v1 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks | Bjarke Madsen et.al. | 2406.07383v1 | null |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332v1 | null |
2024-06-11 | Should XAI Nudge Human Decisions with Explanation Biasing? | Yosuke Fukuchi et.al. | 2406.07323v1 | null |
2024-06-11 | Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway | Hamed Babaei Giglou et.al. | 2406.07257v1 | null |
2024-06-11 | Which Country Is This? Automatic Country Ranking of Street View Photos | Tim Menzner et.al. | 2406.07227v1 | null |
2024-06-11 | A Synthetic Dataset for Personal Attribute Inference | Hanna Yukhymenko et.al. | 2406.07217v1 | null |
2024-06-11 | MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance | X. Wang et.al. | 2406.07209v1 | link |
2024-06-11 | Can Foundation Models Reliably Identify Spatial Hazards? A Case Study on Curb Segmentation | Diwei Sheng et.al. | 2406.07202v1 | null |
2024-06-11 | Unlocking the Potential of the Metaverse for Innovative and Immersive Digital Care | Fatemeh Ebrahimzadeh et.al. | 2406.07114v1 | null |
2024-06-11 | A Neck Orthosis with Multi-Directional Variable Stiffness for Persons with Dropped Head Syndrome | Santiago Price Torrendell et.al. | 2406.07074v1 | null |
2024-06-11 | pVACview: an interactive visualization tool for efficient neoantigen prioritization and selection | Huiming Xia et.al. | 2406.06985v1 | null |
2024-06-11 | Non-autoregressive Personalized Bundle Generation | Wenchuan Yang et.al. | 2406.06925v1 | null |
2024-06-11 | Person Transfer in the Field: Examining Real World Sequential Human-Robot Interaction Between Two Robots | Xiang Zhi Tan et.al. | 2406.06904v1 | null |
2024-06-10 | Personalized Binomial DAGs Learning with Network Structured Covariates | Boxin Zhao et.al. | 2406.06829v1 | null |
2024-06-10 | Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints | T. Tony Cai et.al. | 2406.06755v1 | null |
2024-06-10 | Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests | T. Tony Cai et.al. | 2406.06749v1 | null |
2024-06-10 | Video-based Exercise Classification and Activated Muscle Group Prediction with Hybrid X3D-SlowFast Network | Manvik Pasula et.al. | 2406.06703v1 | null |
2024-06-10 | Decentralized Personalized Federated Learning | Salma Kharrat et.al. | 2406.06520v1 | null |
2024-06-10 | Towards a Personal Health Large Language Model | Justin Cosentino et.al. | 2406.06474v1 | null |
2024-06-10 | Transforming Wearable Data into Health Insights using Large Language Model Agents | Mike A. Merrill et.al. | 2406.06464v2 | null |
2024-06-10 | Estimating Heterogeneous Treatment Effects by Combining Weak Instruments and Observational Data | Miruna Oprescu et.al. | 2406.06452v1 | link |
2024-06-10 | Biomarker-Guided Adaptive Enrichment Design with Threshold Detection for Clinical Trials with Time-to-Event Outcome | Kaiyuan Hua et.al. | 2406.06426v1 | null |
2024-06-10 | Improving Deep Learning-based Automatic Cranial Defect Reconstruction by Heavy Data Augmentation: From Image Registration to Latent Diffusion Models | Marek Wodzinski et.al. | 2406.06372v1 | null |
2024-06-10 | Optimisation of federated learning settings under statistical heterogeneity variations | Basem Suleiman et.al. | 2406.06340v1 | null |
2024-06-10 | Human Gaze and Head Rotation during Navigation, Exploration and Object Manipulation in Shared Environments with Robots | Tim Schreiter et.al. | 2406.06300v1 | null |
2024-06-10 | Tuning-Free Visual Customization via View Iterative Self-Attention Control | Xiaojie Li et.al. | 2406.06258v2 | null |
2024-06-10 | Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning | Xiaoting Lyu et.al. | 2406.06207v1 | null |
2024-06-10 | Federated learning in food research | Zuzanna Fendor et.al. | 2406.06202v1 | null |
2024-06-10 | Link Prediction in Bipartite Networks | ΕΓΌkrΓΌ Demir Δ°nan Γzer et.al. | 2406.06658v1 | null |
2024-06-10 | Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm | Ahmed Elbakary et.al. | 2406.06655v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks | Bjarke Madsen et.al. | 2406.07383v1 | null |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332v1 | null |
2024-06-11 | Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway | Hamed Babaei Giglou et.al. | 2406.07257v1 | null |
2024-06-11 | Which Country Is This? Automatic Country Ranking of Street View Photos | Tim Menzner et.al. | 2406.07227v1 | null |
2024-06-10 | Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints | T. Tony Cai et.al. | 2406.06755v1 | null |
2024-06-10 | Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests | T. Tony Cai et.al. | 2406.06749v1 | null |
2024-06-10 | Decentralized Personalized Federated Learning | Salma Kharrat et.al. | 2406.06520v1 | null |
2024-06-10 | Optimisation of federated learning settings under statistical heterogeneity variations | Basem Suleiman et.al. | 2406.06340v1 | null |
2024-06-10 | Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning | Xiaoting Lyu et.al. | 2406.06207v1 | null |
2024-06-10 | Federated learning in food research | Zuzanna Fendor et.al. | 2406.06202v1 | null |
2024-06-10 | Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm | Ahmed Elbakary et.al. | 2406.06655v1 | null |
2024-06-10 | Federated Machine Reasoning for Resource Provisioning in 6G O-RAN | Swastika Roy et.al. | 2406.06128v1 | null |
2024-06-09 | Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" | Mahtab Talaei et.al. | 2406.05858v1 | null |
2024-06-08 | Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey | Shinu M. Rajagopal et.al. | 2406.05517v1 | null |
2024-06-08 | PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System | Wei Yuan et.al. | 2406.05387v1 | null |
2024-06-07 | Federated LoRA with Sparse Communication | Kevin Kuo et.al. | 2406.05233v1 | null |
2024-06-07 | The Russian Legislative Corpus | Denis Saveliev et.al. | 2406.04855v1 | link |
2024-06-07 | FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models | Rui Ye et.al. | 2406.04845v1 | link |
2024-06-07 | When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain | Lei Xu et.al. | 2406.04743v1 | null |
2024-06-07 | Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems | Zhen Cai et.al. | 2406.04702v1 | null |
2024-06-07 | Federated Representation Learning in the Under-Parameterized Regime | Renpu Liu et.al. | 2406.04596v3 | null |
2024-06-06 | Data Measurements for Decentralized Data Markets | Charles Lu et.al. | 2406.04257v1 | null |
2024-06-06 | R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients | Tamer Ahmed Eltaras et.al. | 2406.04227v1 | null |
2024-06-06 | Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning | Xuhan Zuo et.al. | 2406.04076v1 | null |
2024-06-06 | Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation | Honglei Zhang et.al. | 2406.03933v1 | link |
2024-06-06 | 1-D CNN-Based Online Signature Verification with Federated Learning | Lingfeng Zhang et.al. | 2406.06597v1 | null |
2024-06-06 | Stochastic Dynamic Network Utility Maximization with Application to Disaster Response | Anna Scaglione et.al. | 2406.03750v1 | null |
2024-06-05 | FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles | Cyprien QuΓ©mΓ©neur et.al. | 2406.03611v1 | link |
2024-06-05 | Fantastyc: Blockchain-based Federated Learning Made Secure and Practical | William Boitier et.al. | 2406.03608v1 | null |
2024-06-05 | Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning | Saber Malekmohammadi et.al. | 2406.03519v1 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks | Bjarke Madsen et.al. | 2406.07383v1 | null |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332v1 | null |
2024-06-11 | Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway | Hamed Babaei Giglou et.al. | 2406.07257v1 | null |
2024-06-11 | Which Country Is This? Automatic Country Ranking of Street View Photos | Tim Menzner et.al. | 2406.07227v1 | null |
2024-06-10 | Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints | T. Tony Cai et.al. | 2406.06755v1 | null |
2024-06-10 | Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests | T. Tony Cai et.al. | 2406.06749v1 | null |
2024-06-10 | Decentralized Personalized Federated Learning | Salma Kharrat et.al. | 2406.06520v1 | null |
2024-06-10 | Optimisation of federated learning settings under statistical heterogeneity variations | Basem Suleiman et.al. | 2406.06340v1 | null |
2024-06-10 | Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning | Xiaoting Lyu et.al. | 2406.06207v1 | null |
2024-06-10 | Federated learning in food research | Zuzanna Fendor et.al. | 2406.06202v1 | null |
2024-06-10 | Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm | Ahmed Elbakary et.al. | 2406.06655v1 | null |
2024-06-10 | Federated Machine Reasoning for Resource Provisioning in 6G O-RAN | Swastika Roy et.al. | 2406.06128v1 | null |
2024-06-09 | Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" | Mahtab Talaei et.al. | 2406.05858v1 | null |
2024-06-08 | Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey | Shinu M. Rajagopal et.al. | 2406.05517v1 | null |
2024-06-08 | PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System | Wei Yuan et.al. | 2406.05387v1 | null |
2024-06-07 | Federated LoRA with Sparse Communication | Kevin Kuo et.al. | 2406.05233v1 | null |
2024-06-07 | The Russian Legislative Corpus | Denis Saveliev et.al. | 2406.04855v1 | link |
2024-06-07 | FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models | Rui Ye et.al. | 2406.04845v1 | link |
2024-06-07 | When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain | Lei Xu et.al. | 2406.04743v1 | null |
2024-06-07 | Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems | Zhen Cai et.al. | 2406.04702v1 | null |
2024-06-07 | Federated Representation Learning in the Under-Parameterized Regime | Renpu Liu et.al. | 2406.04596v3 | null |
2024-06-06 | Data Measurements for Decentralized Data Markets | Charles Lu et.al. | 2406.04257v1 | null |
2024-06-06 | R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients | Tamer Ahmed Eltaras et.al. | 2406.04227v1 | null |
2024-06-06 | Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning | Xuhan Zuo et.al. | 2406.04076v1 | null |
2024-06-06 | Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation | Honglei Zhang et.al. | 2406.03933v1 | link |
2024-06-06 | 1-D CNN-Based Online Signature Verification with Federated Learning | Lingfeng Zhang et.al. | 2406.06597v1 | null |
2024-06-06 | Stochastic Dynamic Network Utility Maximization with Application to Disaster Response | Anna Scaglione et.al. | 2406.03750v1 | null |
2024-06-05 | FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles | Cyprien QuΓ©mΓ©neur et.al. | 2406.03611v1 | link |
2024-06-05 | Fantastyc: Blockchain-based Federated Learning Made Secure and Practical | William Boitier et.al. | 2406.03608v1 | null |
2024-06-05 | Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning | Saber Malekmohammadi et.al. | 2406.03519v1 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks | Bjarke Madsen et.al. | 2406.07383v1 | null |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332v1 | null |
2024-06-11 | Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway | Hamed Babaei Giglou et.al. | 2406.07257v1 | null |
2024-06-11 | Which Country Is This? Automatic Country Ranking of Street View Photos | Tim Menzner et.al. | 2406.07227v1 | null |
2024-06-10 | Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints | T. Tony Cai et.al. | 2406.06755v1 | null |
2024-06-10 | Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests | T. Tony Cai et.al. | 2406.06749v1 | null |
2024-06-10 | Decentralized Personalized Federated Learning | Salma Kharrat et.al. | 2406.06520v1 | null |
2024-06-10 | Optimisation of federated learning settings under statistical heterogeneity variations | Basem Suleiman et.al. | 2406.06340v1 | null |
2024-06-10 | Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning | Xiaoting Lyu et.al. | 2406.06207v1 | null |
2024-06-10 | Federated learning in food research | Zuzanna Fendor et.al. | 2406.06202v1 | null |
2024-06-10 | Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm | Ahmed Elbakary et.al. | 2406.06655v1 | null |
2024-06-10 | Federated Machine Reasoning for Resource Provisioning in 6G O-RAN | Swastika Roy et.al. | 2406.06128v1 | null |
2024-06-09 | Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" | Mahtab Talaei et.al. | 2406.05858v1 | null |
2024-06-08 | Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey | Shinu M. Rajagopal et.al. | 2406.05517v1 | null |
2024-06-08 | PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System | Wei Yuan et.al. | 2406.05387v1 | null |
2024-06-07 | Federated LoRA with Sparse Communication | Kevin Kuo et.al. | 2406.05233v1 | null |
2024-06-07 | The Russian Legislative Corpus | Denis Saveliev et.al. | 2406.04855v1 | link |
2024-06-07 | FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models | Rui Ye et.al. | 2406.04845v1 | link |
2024-06-07 | When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain | Lei Xu et.al. | 2406.04743v1 | null |
2024-06-07 | Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems | Zhen Cai et.al. | 2406.04702v1 | null |
2024-06-07 | Federated Representation Learning in the Under-Parameterized Regime | Renpu Liu et.al. | 2406.04596v3 | null |
2024-06-06 | Data Measurements for Decentralized Data Markets | Charles Lu et.al. | 2406.04257v1 | null |
2024-06-06 | R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients | Tamer Ahmed Eltaras et.al. | 2406.04227v1 | null |
2024-06-06 | Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning | Xuhan Zuo et.al. | 2406.04076v1 | null |
2024-06-06 | Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation | Honglei Zhang et.al. | 2406.03933v1 | link |
2024-06-06 | 1-D CNN-Based Online Signature Verification with Federated Learning | Lingfeng Zhang et.al. | 2406.06597v1 | null |
2024-06-06 | Stochastic Dynamic Network Utility Maximization with Application to Disaster Response | Anna Scaglione et.al. | 2406.03750v1 | null |
2024-06-05 | FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles | Cyprien QuΓ©mΓ©neur et.al. | 2406.03611v1 | link |
2024-06-05 | Fantastyc: Blockchain-based Federated Learning Made Secure and Practical | William Boitier et.al. | 2406.03608v1 | null |
2024-06-05 | Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning | Saber Malekmohammadi et.al. | 2406.03519v1 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks | Bjarke Madsen et.al. | 2406.07383v1 | null |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332v1 | null |
2024-06-11 | Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway | Hamed Babaei Giglou et.al. | 2406.07257v1 | null |
2024-06-11 | Which Country Is This? Automatic Country Ranking of Street View Photos | Tim Menzner et.al. | 2406.07227v1 | null |
2024-06-10 | Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints | T. Tony Cai et.al. | 2406.06755v1 | null |
2024-06-10 | Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests | T. Tony Cai et.al. | 2406.06749v1 | null |
2024-06-10 | Decentralized Personalized Federated Learning | Salma Kharrat et.al. | 2406.06520v1 | null |
2024-06-10 | Optimisation of federated learning settings under statistical heterogeneity variations | Basem Suleiman et.al. | 2406.06340v1 | null |
2024-06-10 | Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning | Xiaoting Lyu et.al. | 2406.06207v1 | null |
2024-06-10 | Federated learning in food research | Zuzanna Fendor et.al. | 2406.06202v1 | null |
2024-06-10 | Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm | Ahmed Elbakary et.al. | 2406.06655v1 | null |
2024-06-10 | Federated Machine Reasoning for Resource Provisioning in 6G O-RAN | Swastika Roy et.al. | 2406.06128v1 | null |
2024-06-09 | Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" | Mahtab Talaei et.al. | 2406.05858v1 | null |
2024-06-08 | Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey | Shinu M. Rajagopal et.al. | 2406.05517v1 | null |
2024-06-08 | PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System | Wei Yuan et.al. | 2406.05387v1 | null |
2024-06-07 | Federated LoRA with Sparse Communication | Kevin Kuo et.al. | 2406.05233v1 | null |
2024-06-07 | The Russian Legislative Corpus | Denis Saveliev et.al. | 2406.04855v1 | link |
2024-06-07 | FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models | Rui Ye et.al. | 2406.04845v1 | link |
2024-06-07 | When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain | Lei Xu et.al. | 2406.04743v1 | null |
2024-06-07 | Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems | Zhen Cai et.al. | 2406.04702v1 | null |
2024-06-07 | Federated Representation Learning in the Under-Parameterized Regime | Renpu Liu et.al. | 2406.04596v3 | null |
2024-06-06 | Data Measurements for Decentralized Data Markets | Charles Lu et.al. | 2406.04257v1 | null |
2024-06-06 | R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients | Tamer Ahmed Eltaras et.al. | 2406.04227v1 | null |
2024-06-06 | Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning | Xuhan Zuo et.al. | 2406.04076v1 | null |
2024-06-06 | Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation | Honglei Zhang et.al. | 2406.03933v1 | link |
2024-06-06 | 1-D CNN-Based Online Signature Verification with Federated Learning | Lingfeng Zhang et.al. | 2406.06597v1 | null |
2024-06-06 | Stochastic Dynamic Network Utility Maximization with Application to Disaster Response | Anna Scaglione et.al. | 2406.03750v1 | null |
2024-06-05 | FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles | Cyprien QuΓ©mΓ©neur et.al. | 2406.03611v1 | link |
2024-06-05 | Fantastyc: Blockchain-based Federated Learning Made Secure and Practical | William Boitier et.al. | 2406.03608v1 | null |
2024-06-05 | Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning | Saber Malekmohammadi et.al. | 2406.03519v1 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks | Bjarke Madsen et.al. | 2406.07383v1 | null |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332v1 | null |
2024-06-11 | Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway | Hamed Babaei Giglou et.al. | 2406.07257v1 | null |
2024-06-11 | Which Country Is This? Automatic Country Ranking of Street View Photos | Tim Menzner et.al. | 2406.07227v1 | null |
2024-06-10 | Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints | T. Tony Cai et.al. | 2406.06755v1 | null |
2024-06-10 | Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests | T. Tony Cai et.al. | 2406.06749v1 | null |
2024-06-10 | Decentralized Personalized Federated Learning | Salma Kharrat et.al. | 2406.06520v1 | null |
2024-06-10 | Optimisation of federated learning settings under statistical heterogeneity variations | Basem Suleiman et.al. | 2406.06340v1 | null |
2024-06-10 | Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning | Xiaoting Lyu et.al. | 2406.06207v1 | null |
2024-06-10 | Federated learning in food research | Zuzanna Fendor et.al. | 2406.06202v1 | null |
2024-06-10 | Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm | Ahmed Elbakary et.al. | 2406.06655v1 | null |
2024-06-10 | Federated Machine Reasoning for Resource Provisioning in 6G O-RAN | Swastika Roy et.al. | 2406.06128v1 | null |
2024-06-09 | Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" | Mahtab Talaei et.al. | 2406.05858v1 | null |
2024-06-08 | Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey | Shinu M. Rajagopal et.al. | 2406.05517v1 | null |
2024-06-08 | PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System | Wei Yuan et.al. | 2406.05387v1 | null |
2024-06-07 | Federated LoRA with Sparse Communication | Kevin Kuo et.al. | 2406.05233v1 | null |
2024-06-07 | The Russian Legislative Corpus | Denis Saveliev et.al. | 2406.04855v1 | link |
2024-06-07 | FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models | Rui Ye et.al. | 2406.04845v1 | link |
2024-06-07 | When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain | Lei Xu et.al. | 2406.04743v1 | null |
2024-06-07 | Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems | Zhen Cai et.al. | 2406.04702v1 | null |
2024-06-07 | Federated Representation Learning in the Under-Parameterized Regime | Renpu Liu et.al. | 2406.04596v3 | null |
2024-06-06 | Data Measurements for Decentralized Data Markets | Charles Lu et.al. | 2406.04257v1 | null |
2024-06-06 | R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients | Tamer Ahmed Eltaras et.al. | 2406.04227v1 | null |
2024-06-06 | Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning | Xuhan Zuo et.al. | 2406.04076v1 | null |
2024-06-06 | Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation | Honglei Zhang et.al. | 2406.03933v1 | link |
2024-06-06 | 1-D CNN-Based Online Signature Verification with Federated Learning | Lingfeng Zhang et.al. | 2406.06597v1 | null |
2024-06-06 | Stochastic Dynamic Network Utility Maximization with Application to Disaster Response | Anna Scaglione et.al. | 2406.03750v1 | null |
2024-06-05 | FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles | Cyprien QuΓ©mΓ©neur et.al. | 2406.03611v1 | link |
2024-06-05 | Fantastyc: Blockchain-based Federated Learning Made Secure and Practical | William Boitier et.al. | 2406.03608v1 | null |
2024-06-05 | Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning | Saber Malekmohammadi et.al. | 2406.03519v1 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks | Bjarke Madsen et.al. | 2406.07383v1 | null |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332v1 | null |
2024-06-11 | Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway | Hamed Babaei Giglou et.al. | 2406.07257v1 | null |
2024-06-11 | Which Country Is This? Automatic Country Ranking of Street View Photos | Tim Menzner et.al. | 2406.07227v1 | null |
2024-06-10 | Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints | T. Tony Cai et.al. | 2406.06755v1 | null |
2024-06-10 | Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests | T. Tony Cai et.al. | 2406.06749v1 | null |
2024-06-10 | Decentralized Personalized Federated Learning | Salma Kharrat et.al. | 2406.06520v1 | null |
2024-06-10 | Optimisation of federated learning settings under statistical heterogeneity variations | Basem Suleiman et.al. | 2406.06340v1 | null |
2024-06-10 | Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning | Xiaoting Lyu et.al. | 2406.06207v1 | null |
2024-06-10 | Federated learning in food research | Zuzanna Fendor et.al. | 2406.06202v1 | null |
2024-06-10 | Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm | Ahmed Elbakary et.al. | 2406.06655v1 | null |
2024-06-10 | Federated Machine Reasoning for Resource Provisioning in 6G O-RAN | Swastika Roy et.al. | 2406.06128v1 | null |
2024-06-09 | Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" | Mahtab Talaei et.al. | 2406.05858v1 | null |
2024-06-08 | Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey | Shinu M. Rajagopal et.al. | 2406.05517v1 | null |
2024-06-08 | PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System | Wei Yuan et.al. | 2406.05387v1 | null |
2024-06-07 | Federated LoRA with Sparse Communication | Kevin Kuo et.al. | 2406.05233v1 | null |
2024-06-07 | The Russian Legislative Corpus | Denis Saveliev et.al. | 2406.04855v1 | link |
2024-06-07 | FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models | Rui Ye et.al. | 2406.04845v1 | link |
2024-06-07 | When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain | Lei Xu et.al. | 2406.04743v1 | null |
2024-06-07 | Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems | Zhen Cai et.al. | 2406.04702v1 | null |
2024-06-07 | Federated Representation Learning in the Under-Parameterized Regime | Renpu Liu et.al. | 2406.04596v3 | null |
2024-06-06 | Data Measurements for Decentralized Data Markets | Charles Lu et.al. | 2406.04257v1 | null |
2024-06-06 | R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients | Tamer Ahmed Eltaras et.al. | 2406.04227v1 | null |
2024-06-06 | Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning | Xuhan Zuo et.al. | 2406.04076v1 | null |
2024-06-06 | Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation | Honglei Zhang et.al. | 2406.03933v1 | link |
2024-06-06 | 1-D CNN-Based Online Signature Verification with Federated Learning | Lingfeng Zhang et.al. | 2406.06597v1 | null |
2024-06-06 | Stochastic Dynamic Network Utility Maximization with Application to Disaster Response | Anna Scaglione et.al. | 2406.03750v1 | null |
2024-06-05 | FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles | Cyprien QuΓ©mΓ©neur et.al. | 2406.03611v1 | link |
2024-06-05 | Fantastyc: Blockchain-based Federated Learning Made Secure and Practical | William Boitier et.al. | 2406.03608v1 | null |
2024-06-05 | Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning | Saber Malekmohammadi et.al. | 2406.03519v1 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks | Bjarke Madsen et.al. | 2406.07383v1 | null |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332v1 | null |
2024-06-11 | Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway | Hamed Babaei Giglou et.al. | 2406.07257v1 | null |
2024-06-11 | Which Country Is This? Automatic Country Ranking of Street View Photos | Tim Menzner et.al. | 2406.07227v1 | null |
2024-06-10 | Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints | T. Tony Cai et.al. | 2406.06755v1 | null |
2024-06-10 | Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests | T. Tony Cai et.al. | 2406.06749v1 | null |
2024-06-10 | Decentralized Personalized Federated Learning | Salma Kharrat et.al. | 2406.06520v1 | null |
2024-06-10 | Optimisation of federated learning settings under statistical heterogeneity variations | Basem Suleiman et.al. | 2406.06340v1 | null |
2024-06-10 | Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning | Xiaoting Lyu et.al. | 2406.06207v1 | null |
2024-06-10 | Federated learning in food research | Zuzanna Fendor et.al. | 2406.06202v1 | null |
2024-06-10 | Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm | Ahmed Elbakary et.al. | 2406.06655v1 | null |
2024-06-10 | Federated Machine Reasoning for Resource Provisioning in 6G O-RAN | Swastika Roy et.al. | 2406.06128v1 | null |
2024-06-09 | Comments on "Federated Learning with Differential Privacy: Algorithms and Performance Analysis" | Mahtab Talaei et.al. | 2406.05858v1 | null |
2024-06-08 | Blockchain Integrated Federated Learning in Edge-Fog-Cloud Systems for IoT based Healthcare Applications A Survey | Shinu M. Rajagopal et.al. | 2406.05517v1 | null |
2024-06-08 | PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System | Wei Yuan et.al. | 2406.05387v1 | null |
2024-06-07 | Federated LoRA with Sparse Communication | Kevin Kuo et.al. | 2406.05233v1 | null |
2024-06-07 | The Russian Legislative Corpus | Denis Saveliev et.al. | 2406.04855v1 | link |
2024-06-07 | FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models | Rui Ye et.al. | 2406.04845v1 | link |
2024-06-07 | When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain | Lei Xu et.al. | 2406.04743v1 | null |
2024-06-07 | Marking the Pace: A Blockchain-Enhanced Privacy-Traceable Strategy for Federated Recommender Systems | Zhen Cai et.al. | 2406.04702v1 | null |
2024-06-07 | Federated Representation Learning in the Under-Parameterized Regime | Renpu Liu et.al. | 2406.04596v3 | null |
2024-06-06 | Data Measurements for Decentralized Data Markets | Charles Lu et.al. | 2406.04257v1 | null |
2024-06-06 | R-CONV: An Analytical Approach for Efficient Data Reconstruction via Convolutional Gradients | Tamer Ahmed Eltaras et.al. | 2406.04227v1 | null |
2024-06-06 | Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning | Xuhan Zuo et.al. | 2406.04076v1 | null |
2024-06-06 | Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation | Honglei Zhang et.al. | 2406.03933v1 | link |
2024-06-06 | 1-D CNN-Based Online Signature Verification with Federated Learning | Lingfeng Zhang et.al. | 2406.06597v1 | null |
2024-06-06 | Stochastic Dynamic Network Utility Maximization with Application to Disaster Response | Anna Scaglione et.al. | 2406.03750v1 | null |
2024-06-05 | FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles | Cyprien QuΓ©mΓ©neur et.al. | 2406.03611v1 | link |
2024-06-05 | Fantastyc: Blockchain-based Federated Learning Made Secure and Practical | William Boitier et.al. | 2406.03608v1 | null |
2024-06-05 | Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning | Saber Malekmohammadi et.al. | 2406.03519v1 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis | David Ortiz-Perez et.al. | 2406.07542v1 | link |
2024-06-11 | CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Zeyuan Liu et.al. | 2406.07541v1 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | QuickLLaMA: Query-aware Inference Acceleration for Large Language Models | Jingyao Li et.al. | 2406.07528v1 | link |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Change of numeraire for weak martingale transport | Mathias BeiglbΓΆck et.al. | 2406.07523v1 | null |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Faster Spectral Density Estimation and Sparsification in the Nuclear Norm | Yujia Jin et.al. | 2406.07521v1 | null |
2024-06-11 | Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics | Reuben R. W. Wang et.al. | 2406.07519v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid | Naser Souri et.al. | 2406.07503v1 | null |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale | William Giarè et.al. | 2406.07493v1 | null |
2024-06-11 | Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction | Bekir Z. Demiray et.al. | 2406.07484v1 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482v1 | null |
2024-06-11 | The end of multiple choice tests: using AI to enhance assessment | Michael Klymkowsky et.al. | 2406.07481v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis | David Ortiz-Perez et.al. | 2406.07542v1 | link |
2024-06-11 | CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Zeyuan Liu et.al. | 2406.07541v1 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | QuickLLaMA: Query-aware Inference Acceleration for Large Language Models | Jingyao Li et.al. | 2406.07528v1 | link |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Change of numeraire for weak martingale transport | Mathias BeiglbΓΆck et.al. | 2406.07523v1 | null |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Faster Spectral Density Estimation and Sparsification in the Nuclear Norm | Yujia Jin et.al. | 2406.07521v1 | null |
2024-06-11 | Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics | Reuben R. W. Wang et.al. | 2406.07519v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid | Naser Souri et.al. | 2406.07503v1 | null |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale | William Giarè et.al. | 2406.07493v1 | null |
2024-06-11 | Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction | Bekir Z. Demiray et.al. | 2406.07484v1 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482v1 | null |
2024-06-11 | The end of multiple choice tests: using AI to enhance assessment | Michael Klymkowsky et.al. | 2406.07481v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Impact of the nuclear equation of state on the formation of twin stars | Nai-Bo Zhang et.al. | 2406.07396v1 | null |
2024-06-11 | Fast Adaptive Meta-Heuristic for Large-Scale Facility Location Problem | Bahram Alidaee et.al. | 2406.07382v1 | null |
2024-06-11 | A generic and robust quantum agent inspired by deep meta-reinforcement learning | Zibo Miao et.al. | 2406.07225v1 | null |
2024-06-11 | Agnostic Sharpness-Aware Minimization | Van-Anh Nguyen et.al. | 2406.07107v2 | null |
2024-06-11 | Meta-Backscatter: A New ISAC Paradigm for Battery-Free Internet of Things | Xu Liu et.al. | 2406.07077v1 | null |
2024-06-11 | HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation | Wen Luo et.al. | 2406.07070v1 | null |
2024-06-11 | Fairness-Aware Meta-Learning via Nash Bargaining | Yi Zeng et.al. | 2406.07029v1 | null |
2024-06-11 | Ollabench: Evaluating LLMs' Reasoning for Human-centric Interdependent Cybersecurity | Tam n. Nguyen et.al. | 2406.06863v1 | link |
2024-06-10 | Reinforced Compressive Neural Architecture Search for Versatile Adversarial Robustness | Dingrong Wang et.al. | 2406.06792v1 | link |
2024-06-10 | Meta Learning Text-to-Speech Synthesis in over 7000 Languages | Florian Lux et.al. | 2406.06403v1 | link |
2024-06-10 | Characteristics and Energy Flux Distributions of Decayless Transverse Oscillations Depending on Coronal Regions | Daye Lim et.al. | 2406.06368v1 | null |
2024-06-10 | Data Augmentation in Earth Observation: A Diffusion Model Approach | Tiago Sousa et.al. | 2406.06218v1 | null |
2024-06-10 | Causality-inspired Latent Feature Augmentation for Single Domain Generalization | Jian Xu et.al. | 2406.05980v1 | null |
2024-06-10 | Data Caching for Enterprise-Grade Petabyte-Scale OLAP | Chunxu Tang et.al. | 2406.05962v1 | null |
2024-06-09 | Async Learned User Embeddings for Ads Delivery Optimization | Mingwei Tang et.al. | 2406.05898v1 | null |
2024-06-09 | Few-Shot Load Forecasting Under Data Scarcity in Smart Grids: A Meta-Learning Approach | Georgios Tsoumplekas et.al. | 2406.05887v1 | null |
2024-06-08 | Synergizing Deep Learning and Phase Change Materials for Four-state Broadband Multifunctional Metasurfaces in the Visible Range | Md. Ehsanul Karim et.al. | 2406.05519v1 | null |
2024-06-08 | Gradient-based algorithms for multi-objective bi-level optimization | Xinmin Yang et.al. | 2406.05455v1 | null |
2024-06-08 | A Survey of Meta-features Used for Automated Selection of Algorithms for Black-box Single-objective Continuous Optimization | Gjorgjina Cenikj et.al. | 2406.06629v1 | null |
2024-06-08 | Large Language Model Assisted Adversarial Robustness Neural Architecture Search | Rui Zhong et.al. | 2406.05433v1 | link |
2024-06-07 | Massively Multiagent Minigames for Training Generalist Agents | Kyoung Whan Choe et.al. | 2406.05071v1 | link |
2024-06-07 | Scenarios and Approaches for Situated Natural Language Explanations | Pengshuo Qiu et.al. | 2406.05035v1 | null |
2024-06-07 | Unraveling Trace Anomaly of Supradense Matter via Neutron Star Compactness Scaling | Bao-Jun Cai et.al. | 2406.05025v1 | null |
2024-06-07 | Adversarial Tuning: Defending Against Jailbreak Attacks for LLMs | Fan Liu et.al. | 2406.06622v1 | null |
2024-06-07 | Cactus-like Metamaterial Structures for Electromagnetically Induced Transparency at THz frequencies | Savvas Papamakarios et.al. | 2406.04862v1 | null |
2024-06-07 | Black Box Differential Privacy Auditing Using Total Variation Distance | Antti Koskela et.al. | 2406.04827v1 | null |
2024-06-07 | Graph Mining under Data scarcity | Appan Rakaraddi et.al. | 2406.04825v2 | null |
2024-06-07 | Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning | Xuehui Yu et.al. | 2406.04815v1 | link |
2024-06-07 | Cooperative Meta-Learning with Gradient Augmentation | Jongyun Shin et.al. | 2406.04639v1 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis | David Ortiz-Perez et.al. | 2406.07542v1 | link |
2024-06-11 | CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Zeyuan Liu et.al. | 2406.07541v1 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | Coherent Three-Photon Excitation of the Strontium Clock Transition | Junyu He et.al. | 2406.07530v1 | null |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | QuickLLaMA: Query-aware Inference Acceleration for Large Language Models | Jingyao Li et.al. | 2406.07528v1 | link |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Change of numeraire for weak martingale transport | Mathias BeiglbΓΆck et.al. | 2406.07523v1 | null |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Faster Spectral Density Estimation and Sparsification in the Nuclear Norm | Yujia Jin et.al. | 2406.07521v1 | null |
2024-06-11 | Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics | Reuben R. W. Wang et.al. | 2406.07519v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid | Naser Souri et.al. | 2406.07503v1 | null |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale | William Giarè et.al. | 2406.07493v1 | null |
2024-06-11 | Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction | Bekir Z. Demiray et.al. | 2406.07484v1 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis | David Ortiz-Perez et.al. | 2406.07542v1 | link |
2024-06-11 | CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Zeyuan Liu et.al. | 2406.07541v1 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | QuickLLaMA: Query-aware Inference Acceleration for Large Language Models | Jingyao Li et.al. | 2406.07528v1 | link |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Change of numeraire for weak martingale transport | Mathias BeiglbΓΆck et.al. | 2406.07523v1 | null |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Faster Spectral Density Estimation and Sparsification in the Nuclear Norm | Yujia Jin et.al. | 2406.07521v1 | null |
2024-06-11 | Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics | Reuben R. W. Wang et.al. | 2406.07519v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid | Naser Souri et.al. | 2406.07503v1 | null |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale | William Giarè et.al. | 2406.07493v1 | null |
2024-06-11 | GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection | Hang Yao et.al. | 2406.07487v1 | null |
2024-06-11 | Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction | Bekir Z. Demiray et.al. | 2406.07484v1 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551v1 | link |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | RaD-Net 2: A causal two-stage repairing and denoising speech enhancement network with knowledge distillation and complex axial self-attention | Mingshuai Liu et.al. | 2406.07498v1 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | Choreographing the Rhythms of Observation: Dynamics for Ranged Observer Bipartite-Unipartite SpatioTemporal (ROBUST) Networks | Ted Edward Holmberg et.al. | 2406.07473v1 | null |
2024-06-11 | Microbiomes Through The Looking Glass | Jacopo Pasqualini et.al. | 2406.07465v1 | null |
2024-06-11 | Reconfigurable Intelligent Surfaces in Dynamic Rich Scattering Environments: BiLSTM-Based Optimization for Accurate User Localization | Anum Umer et.al. | 2406.07463v1 | null |
2024-06-11 | fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions | Alireza Afzal Aghaei et.al. | 2406.07456v1 | link |
2024-06-11 | HTVM: Efficient Neural Network Deployment On Heterogeneous TinyML Platforms | Josse Van Delm et.al. | 2406.07453v1 | null |
2024-06-11 | Boosted Conformal Prediction Intervals | Ran Xie et.al. | 2406.07449v1 | null |
2024-06-11 | Metastability in networks of nonlinear stochastic integrate-and-fire neurons | Siddharth Paliwal et.al. | 2406.07445v1 | null |
2024-06-11 | DeformTime: Capturing Variable Dependencies with Deformable Attention for Time Series Forecasting | Yuxuan Shu et.al. | 2406.07438v1 | null |
2024-06-11 | Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration | Shashank Agnihotri et.al. | 2406.07435v1 | null |
2024-06-11 | Matryoshka Representation Learning for Recommendation | Riwei Lai et.al. | 2406.07432v1 | link |
2024-06-11 | GemNet: Menu-Based, Strategy-Proof Multi-Bidder Auctions Through Deep Learning | Tonghan Wang et.al. | 2406.07428v1 | null |
2024-06-11 | Graph Reasoning for Explainable Cold Start Recommendation | Jibril Frej et.al. | 2406.07420v1 | null |
2024-06-11 | Average-exact mixed anomalies and compatible phases | Yichen Xu et.al. | 2406.07417v1 | null |
2024-06-11 | Holistic Memory Diversification for Incremental Learning in Growing Graphs | Ziyue Qiao et.al. | 2406.07413v1 | null |
2024-06-11 | Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance | Ruxin Zheng et.al. | 2406.07399v1 | null |
2024-06-11 | Holographic reconstruction of black hole spacetime: machine learning and entanglement entropy | Byoungjoon Ahn et.al. | 2406.07395v1 | null |
2024-06-11 | DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling | Sixian Wang et.al. | 2406.07390v1 | null |
2024-06-11 | Disrupting Bipartite Trading Networks: Matching for Revenue Maximization | Luca D'Amico-Wong et.al. | 2406.07385v1 | null |
2024-06-11 | Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks | Bjarke Madsen et.al. | 2406.07383v1 | null |
2024-06-11 | COLoRIS: Localization-agnostic Smart Surfaces Enabling Opportunistic ISAC in 6G Networks | Guillermo Encinas-Lago et.al. | 2406.07377v1 | null |
2024-06-11 | Decoding planetary surfaces by counting cracks | S. Silver et.al. | 2406.07376v1 | null |
2024-06-11 | Improving the realism of robotic surgery simulation through injection of learning-based estimated errors | Juan Antonio Barragan et.al. | 2406.07375v1 | null |
2024-06-11 | Deep Implicit Optimization for Robust and Flexible Image Registration | Rohit Jena et.al. | 2406.07361v1 | null |
2024-06-11 | Toxic Memes: A Survey of Computational Perspectives on the Detection and Explanation of Meme Toxicities | Delfina Sol Martinez Pandiani et.al. | 2406.07353v1 | null |
2024-06-11 | Stochastic Analysis of Homogeneous Wireless Networks Assisted by Intelligent Reflecting Surfaces | Ali H. Abdollahi Bafghi et.al. | 2406.07352v1 | null |
Publish Date | Title | Authors | Code |
---|
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Interpolating between Hausdorff and box dimension | Amlan Banaji et.al. | 2406.07527v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502v1 | link |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499v1 | null |
2024-06-11 | GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection | Hang Yao et.al. | 2406.07487v1 | null |
2024-06-11 | Novel Optimized Designs of Modulo |
Bhaskar Gaur et.al. | 2406.07486v1 | null |
2024-06-11 | Image Neural Field Diffusion Models | Yinbo Chen et.al. | 2406.07480v1 | null |
2024-06-11 | VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs | Zesen Cheng et.al. | 2406.07476v1 | link |
2024-06-11 | fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions | Alireza Afzal Aghaei et.al. | 2406.07456v1 | link |
2024-06-11 | An Optimism-based Approach to Online Evaluation of Generative Models | Xiaoyan Hu et.al. | 2406.07451v1 | null |
2024-06-11 | Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning | Shuvendu Roy et.al. | 2406.07450v1 | link |
2024-06-11 | Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration | Shashank Agnihotri et.al. | 2406.07435v1 | null |
2024-06-11 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431v1 | null |
2024-06-11 | Learning Domain-Invariant Features for Out-of-Context News Detection | Yimeng Gu et.al. | 2406.07430v1 | null |
2024-06-11 | DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 38 Subclasses | Abdurrahim Yilmaz et.al. | 2406.07426v1 | null |
2024-06-11 | Persistent currents in mesoscopic spin-orbit coupled rings due to an applied Zeeman field | Bijay Kumar Sahoo et.al. | 2406.07405v1 | null |
2024-06-11 | Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance | Ruxin Zheng et.al. | 2406.07399v1 | null |
2024-06-11 | Visual Representation Learning with Stochastic Frame Prediction | Huiwon Jang et.al. | 2406.07398v1 | null |
2024-06-11 | DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling | Sixian Wang et.al. | 2406.07390v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis | David Ortiz-Perez et.al. | 2406.07542v1 | link |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502v1 | link |
2024-06-11 | Multimodal Belief Prediction | John Murzaku et.al. | 2406.07466v1 | null |
2024-06-11 | World Models with Hints of Large Language Models for Goal Achieving | Zeyuan Liu et.al. | 2406.07381v1 | null |
2024-06-11 | Toxic Memes: A Survey of Computational Perspectives on the Detection and Explanation of Meme Toxicities | Delfina Sol Martinez Pandiani et.al. | 2406.07353v1 | null |
2024-06-11 | Transferring Knowledge from Large Foundation Models to Small Downstream Models | Shikai Qiu et.al. | 2406.07337v1 | null |
2024-06-11 | MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting | Zhiqi Ai et.al. | 2406.07310v1 | null |
2024-06-11 | Which Country Is This? Automatic Country Ranking of Street View Photos | Tim Menzner et.al. | 2406.07227v1 | null |
2024-06-11 | Open-World Human-Object Interaction Detection via Multi-modal Prompts | Jie Yang et.al. | 2406.07221v1 | null |
2024-06-11 | Target Speech Diarization with Multimodal Prompts | Yidi Jiang et.al. | 2406.07198v1 | null |
2024-06-11 | RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker | Yunfeng Li et.al. | 2406.07189v1 | link |
2024-06-11 | Unified Modeling Enhanced Multimodal Learning for Precision Neuro-Oncology | Huahui Yi et.al. | 2406.07078v1 | link |
2024-06-11 | Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study | Yichi Zhang et.al. | 2406.07057v1 | null |
2024-06-11 | Evolving from Single-modal to Multi-modal Facial Deepfake Detection: A Survey | Ping Liu et.al. | 2406.06965v1 | null |
2024-06-11 | Missingness-resilient Video-enhanced Multimodal Disfluency Detection | Payal Mohapatra et.al. | 2406.06964v1 | null |
2024-06-11 | Eyeballing Combinatorial Problems: A Case Study of Using Multimodal Large Language Models to Solve Traveling Salesman Problems | Mohammed Elhenawy et.al. | 2406.06865v1 | null |
2024-06-10 | FlexLoc: Conditional Neural Networks for Zero-Shot Sensor Perspective Invariance in Object Localization with Distributed Multimodal Sensors | Jason Wu et.al. | 2406.06796v1 | link |
2024-06-10 | BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification | June-Woo Kim et.al. | 2406.06786v1 | null |
2024-06-10 | MolX: Enhancing Large Language Models for Molecular Learning with A Multi-Modal Extension | Khiem Le et.al. | 2406.06777v1 | null |
2024-06-10 | Merlin: A Vision Language Foundation Model for 3D Computed Tomography | Louis Blankemeier et.al. | 2406.06512v1 | null |
2024-06-10 | AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction | Zhen Xing et.al. | 2406.06465v1 | null |
2024-06-10 | VCR: Visual Caption Restoration | Tianyu Zhang et.al. | 2406.06462v1 | link |
2024-06-10 | Margin-aware Preference Optimization for Aligning Diffusion Models without Reference | Jiwoo Hong et.al. | 2406.06424v1 | null |
2024-06-10 | STimage-1K4M: A histopathology image-gene expression dataset for spatial transcriptomics | Jiawen Chen et.al. | 2406.06393v1 | link |
2024-06-10 | Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization | Yi Gu et.al. | 2406.06382v1 | link |
2024-06-10 | ASTRA: Aligning Speech and Text Representations for Asr without Sampling | Neeraj Gaur et.al. | 2406.06664v1 | null |
2024-06-10 | MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing | Yu-Fen Huang et.al. | 2406.06375v1 | link |
2024-06-10 | A Guide to Stochastic Optimisation for Large-Scale Inverse Problems | Matthias J. Ehrhardt et.al. | 2406.06342v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | QuickLLaMA: Query-aware Inference Acceleration for Large Language Models | Jingyao Li et.al. | 2406.07528v1 | link |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale | William Giarè et.al. | 2406.07493v1 | null |
2024-06-11 | Paraphrasing in Affirmative Terms Improves Negation Understanding | MohammadHossein Rezaei et.al. | 2406.07492v1 | null |
2024-06-11 | ReduceFormer: Attention with Tensor Reduction by Summation | John Yang et.al. | 2406.07488v1 | null |
2024-06-11 | GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection | Hang Yao et.al. | 2406.07487v1 | null |
2024-06-11 | PITCH: Productivity and Mental Well-being Coaching through Daily Conversational Interaction | Adnan Abbas et.al. | 2406.07485v1 | null |
2024-06-11 | The end of multiple choice tests: using AI to enhance assessment | Michael Klymkowsky et.al. | 2406.07481v1 | null |
2024-06-11 | VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs | Zesen Cheng et.al. | 2406.07476v1 | link |
2024-06-11 | Partially Observed Trajectory Inference using Optimal Transport and a Dynamics Prior | Anming Gu et.al. | 2406.07475v1 | null |
2024-06-11 | Estimating the Hallucination Rate of Generative AI | Andrew Jesson et.al. | 2406.07457v1 | null |
2024-06-11 | Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis | Qining Zhang et.al. | 2406.07455v1 | null |
2024-06-11 | Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning | Shuvendu Roy et.al. | 2406.07450v1 | link |
2024-06-11 | Constructions of TurΓ‘n systems that are tight up to a multiplicative constant | Oleg Pikhurko et.al. | 2406.07443v1 | null |
2024-06-11 | Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for Sampling | Denis Blessing et.al. | 2406.07423v1 | link |
2024-06-11 | Holistic Memory Diversification for Incremental Learning in Growing Graphs | Ziyue Qiao et.al. | 2406.07413v1 | null |
2024-06-11 | Guiding LLM Temporal Logic Generation with Explicit Separation of Data and Control | William Murphy et.al. | 2406.07400v1 | null |
2024-06-11 | Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance | Ruxin Zheng et.al. | 2406.07399v1 | null |
2024-06-11 | Limited Out-of-Context Knowledge Reasoning in Large Language Models | Peng Hu et.al. | 2406.07393v1 | null |
2024-06-11 | AI.vs.Clinician: Unveiling Intricate Interactions Between AI and Clinicians through an Open-Access Database | Wanling Gao et.al. | 2406.07362v1 | null |
2024-06-11 | Deep Implicit Optimization for Robust and Flexible Image Registration | Rohit Jena et.al. | 2406.07361v1 | null |
2024-06-11 | PSMC: Provable and Scalable Algorithms for Motif Conductance Based Graph Clustering | Longlong Lin et.al. | 2406.07357v1 | null |
2024-06-11 | DR-RAG: Applying Dynamic Document Relevance to Retrieval-Augmented Generation for Question-Answering | Zijian Hei et.al. | 2406.07348v2 | null |
2024-06-11 | Beyond Training: Optimizing Reinforcement Learning Based Job Shop Scheduling Through Adaptive Action Sampling | Constantin Waubert de Puiseau et.al. | 2406.07325v1 | null |
2024-06-11 | The magic of entangled top quarks | Chris D. White et.al. | 2406.07321v2 | null |
2024-06-11 | Rethinking the impact of noisy labels in graph classification: A utility and privacy perspective | De Li et.al. | 2406.07314v1 | null |
2024-06-11 | BertaQA: How Much Do Language Models Know About Local Culture? | Julen Etxaniz et.al. | 2406.07302v1 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551v1 | link |
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis | David Ortiz-Perez et.al. | 2406.07542v1 | link |
2024-06-11 | CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Zeyuan Liu et.al. | 2406.07541v1 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Autoregressive Pretraining with Mamba in Vision | Sucheng Ren et.al. | 2406.07537v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Dynamics of the non-radial energy-critical inhomogeneous NLS | Carlos M. GuzmΓ‘n et.al. | 2406.07535v1 | null |
2024-06-11 | On the potential of probing the neutron star composition in accreting X-ray binaries | Kaiser Arf et.al. | 2406.07534v1 | null |
2024-06-11 | Interpreting DESI 2024 BAO: late-time dynamical dark energy or a local effect? | Ioannis D. Gialamas et.al. | 2406.07533v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | Interacting-bath dynamical embedding for capturing non-local electron correlation in solids | Jiachen Li et.al. | 2406.07531v1 | null |
2024-06-11 | Coherent Three-Photon Excitation of the Strontium Clock Transition | Junyu He et.al. | 2406.07530v1 | null |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | QuickLLaMA: Query-aware Inference Acceleration for Large Language Models | Jingyao Li et.al. | 2406.07528v1 | link |
2024-06-11 | Interpolating between Hausdorff and box dimension | Amlan Banaji et.al. | 2406.07527v1 | null |
2024-06-11 | Cosmological constraints on $Ξ_{\rm s}$CDM scenario in a type II minimally modified gravity | Ozgur Akarsu et.al. | 2406.07526v1 | null |
2024-06-11 | Will Southeast Asia be the next global manufacturing hub? A multiway cointegration, causality, and dynamic connectedness analyses on factors influencing offshore decisions | Haibo Wang et.al. | 2406.07525v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Change of numeraire for weak martingale transport | Mathias BeiglbΓΆck et.al. | 2406.07523v1 | null |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis | David Ortiz-Perez et.al. | 2406.07542v1 | link |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502v1 | link |
2024-06-11 | Multimodal Belief Prediction | John Murzaku et.al. | 2406.07466v1 | null |
2024-06-11 | World Models with Hints of Large Language Models for Goal Achieving | Zeyuan Liu et.al. | 2406.07381v1 | null |
2024-06-11 | Toxic Memes: A Survey of Computational Perspectives on the Detection and Explanation of Meme Toxicities | Delfina Sol Martinez Pandiani et.al. | 2406.07353v1 | null |
2024-06-11 | Transferring Knowledge from Large Foundation Models to Small Downstream Models | Shikai Qiu et.al. | 2406.07337v1 | null |
2024-06-11 | MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting | Zhiqi Ai et.al. | 2406.07310v1 | null |
2024-06-11 | Which Country Is This? Automatic Country Ranking of Street View Photos | Tim Menzner et.al. | 2406.07227v1 | null |
2024-06-11 | Open-World Human-Object Interaction Detection via Multi-modal Prompts | Jie Yang et.al. | 2406.07221v1 | null |
2024-06-11 | Target Speech Diarization with Multimodal Prompts | Yidi Jiang et.al. | 2406.07198v1 | null |
2024-06-11 | RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker | Yunfeng Li et.al. | 2406.07189v1 | link |
2024-06-11 | Unified Modeling Enhanced Multimodal Learning for Precision Neuro-Oncology | Huahui Yi et.al. | 2406.07078v1 | link |
2024-06-11 | Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study | Yichi Zhang et.al. | 2406.07057v1 | null |
2024-06-11 | Evolving from Single-modal to Multi-modal Facial Deepfake Detection: A Survey | Ping Liu et.al. | 2406.06965v1 | null |
2024-06-11 | Missingness-resilient Video-enhanced Multimodal Disfluency Detection | Payal Mohapatra et.al. | 2406.06964v1 | null |
2024-06-11 | Eyeballing Combinatorial Problems: A Case Study of Using Multimodal Large Language Models to Solve Traveling Salesman Problems | Mohammed Elhenawy et.al. | 2406.06865v1 | null |
2024-06-10 | FlexLoc: Conditional Neural Networks for Zero-Shot Sensor Perspective Invariance in Object Localization with Distributed Multimodal Sensors | Jason Wu et.al. | 2406.06796v1 | link |
2024-06-10 | BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification | June-Woo Kim et.al. | 2406.06786v1 | null |
2024-06-10 | MolX: Enhancing Large Language Models for Molecular Learning with A Multi-Modal Extension | Khiem Le et.al. | 2406.06777v1 | null |
2024-06-10 | Merlin: A Vision Language Foundation Model for 3D Computed Tomography | Louis Blankemeier et.al. | 2406.06512v1 | null |
2024-06-10 | AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction | Zhen Xing et.al. | 2406.06465v1 | null |
2024-06-10 | VCR: Visual Caption Restoration | Tianyu Zhang et.al. | 2406.06462v1 | link |
2024-06-10 | Margin-aware Preference Optimization for Aligning Diffusion Models without Reference | Jiwoo Hong et.al. | 2406.06424v1 | null |
2024-06-10 | STimage-1K4M: A histopathology image-gene expression dataset for spatial transcriptomics | Jiawen Chen et.al. | 2406.06393v1 | link |
2024-06-10 | Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization | Yi Gu et.al. | 2406.06382v1 | link |
2024-06-10 | ASTRA: Aligning Speech and Text Representations for Asr without Sampling | Neeraj Gaur et.al. | 2406.06664v1 | null |
2024-06-10 | MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing | Yu-Fen Huang et.al. | 2406.06375v1 | link |
2024-06-10 | A Guide to Stochastic Optimisation for Large-Scale Inverse Problems | Matthias J. Ehrhardt et.al. | 2406.06342v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Change of numeraire for weak martingale transport | Mathias BeiglbΓΆck et.al. | 2406.07523v1 | null |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization | Frederic Kirstein et.al. | 2406.07494v2 | null |
2024-06-11 | ReduceFormer: Attention with Tensor Reduction by Summation | John Yang et.al. | 2406.07488v1 | null |
2024-06-11 | Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction | Bekir Z. Demiray et.al. | 2406.07484v1 | null |
2024-06-11 | Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis | Qining Zhang et.al. | 2406.07455v1 | null |
2024-06-11 | Textual Similarity as a Key Metric in Machine Translation Quality Estimation | Kun Sun et.al. | 2406.07440v1 | null |
2024-06-11 | DeformTime: Capturing Variable Dependencies with Deformable Attention for Time Series Forecasting | Yuxuan Shu et.al. | 2406.07438v1 | null |
2024-06-11 | Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration | Shashank Agnihotri et.al. | 2406.07435v1 | null |
2024-06-11 | Making 'syscall' a Privilege not a Right | Fangfei Yang et.al. | 2406.07429v1 | null |
2024-06-11 | GemNet: Menu-Based, Strategy-Proof Multi-Bidder Auctions Through Deep Learning | Tonghan Wang et.al. | 2406.07428v1 | null |
2024-06-11 | Entropy, slicing problem and functional Mahler's conjecture | Matthieu Fradelizi et.al. | 2406.07406v1 | null |
2024-06-11 | Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy | Xiaohan Huang et.al. | 2406.07404v1 | null |
2024-06-11 | A Survey on Recent Random Walk-based Methods for Embedding Knowledge Graphs | Elika Bozorgi et.al. | 2406.07402v1 | null |
2024-06-11 | Fast and accurate evaluation of Biot-Savart integrals over spatial curves | Juan Ignacio Polanco et.al. | 2406.07366v1 | null |
2024-06-11 | Chebyshev Approximated Variational Coupled Cluster for Quantum Computing | Luca Erhart et.al. | 2406.07364v1 | null |
2024-06-11 | Deep Implicit Optimization for Robust and Flexible Image Registration | Rohit Jena et.al. | 2406.07361v1 | null |
2024-06-11 | Global-Regularized Neighborhood Regression for Efficient Zero-Shot Texture Anomaly Detection | Haiming Yao et.al. | 2406.07333v1 | null |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332v1 | null |
2024-06-11 | Instruct Large Language Models to Drive like Humans | Ruijun Zhang et.al. | 2406.07296v1 | link |
2024-06-11 | Masatoshi Kitagawa et.al. | 2406.07279v1 | null | |
2024-06-11 | Are Protein Language Models Compute Optimal? | Yaiza Serrano et.al. | 2406.07249v1 | null |
2024-06-11 | Dynamical Mean-Field Theory of Self-Attention Neural Networks | Γngel Poc-LΓ³pez et.al. | 2406.07247v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551v1 | link |
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Autoregressive Pretraining with Mamba in Vision | Sucheng Ren et.al. | 2406.07537v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | Change of numeraire for weak martingale transport | Mathias BeiglbΓΆck et.al. | 2406.07523v1 | null |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502v1 | link |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499v1 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization | Frederic Kirstein et.al. | 2406.07494v2 | null |
2024-06-11 | ReduceFormer: Attention with Tensor Reduction by Summation | John Yang et.al. | 2406.07488v1 | null |
2024-06-11 | GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection | Hang Yao et.al. | 2406.07487v1 | null |
2024-06-11 | Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction | Bekir Z. Demiray et.al. | 2406.07484v1 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Zeyuan Liu et.al. | 2406.07541v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis | Qining Zhang et.al. | 2406.07455v1 | null |
2024-06-11 | Enhanced Gene Selection in Single-Cell Genomics: Pre-Filtering Synergy and Reinforced Optimization | Weiliang Zhang et.al. | 2406.07418v1 | null |
2024-06-11 | Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy | Xiaohan Huang et.al. | 2406.07404v1 | null |
2024-06-11 | Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks | Bjarke Madsen et.al. | 2406.07383v1 | null |
2024-06-11 | World Models with Hints of Large Language Models for Goal Achieving | Zeyuan Liu et.al. | 2406.07381v1 | null |
2024-06-11 | EdgeTimer: Adaptive Multi-Timescale Scheduling in Mobile Edge Computing with Deep Reinforcement Learning | Yijun Hao et.al. | 2406.07342v1 | null |
2024-06-11 | Beyond Training: Optimizing Reinforcement Learning Based Job Shop Scheduling Through Adaptive Action Sampling | Constantin Waubert de Puiseau et.al. | 2406.07325v1 | null |
2024-06-11 | Multi-objective Reinforcement learning from AI Feedback | Marcus Williams et.al. | 2406.07295v2 | null |
2024-06-11 | Hybrid Reinforcement Learning from Offline Observation Alone | Yuda Song et.al. | 2406.07253v1 | null |
2024-06-11 | A generic and robust quantum agent inspired by deep meta-reinforcement learning | Zibo Miao et.al. | 2406.07225v1 | null |
2024-06-11 | Semantic-Aware Spectrum Sharing in Internet of Vehicles Based on Deep Reinforcement Learning | Zhiyu Shao et.al. | 2406.07213v1 | link |
2024-06-11 | Machine learning potential for the Cu-W system | Manura Liyanage et.al. | 2406.07157v1 | null |
2024-06-11 | Failures Are Fated, But Can Be Faded: Characterizing and Mitigating Unwanted Behaviors in Large-Scale Vision and Language Models | Som Sagar et.al. | 2406.07145v1 | null |
2024-06-11 | CHARME: A chain-based reinforcement learning approach for the minor embedding problem | Hoang M. Ngo et.al. | 2406.07124v1 | null |
2024-06-11 | Augmenting Offline RL with Unlabeled Data | Zhao Wang et.al. | 2406.07117v1 | null |
2024-06-11 | Optimal Gait Control for a Tendon-driven Soft Quadruped Robot by Model-based Reinforcement Learning | Xuezhi Niu et.al. | 2406.07069v1 | null |
2024-06-11 | Integrating Domain Knowledge for handling Limited Data in Offline RL | Briti Gangopadhyay et.al. | 2406.07041v1 | null |
2024-06-11 | Entropy-Reinforced Planning with Large Language Models for Drug Discovery | Xuefeng Liu et.al. | 2406.07025v1 | null |
2024-06-11 | Delving into ChatGPT usage in academic writing through excess vocabulary | Dmitry Kobak et.al. | 2406.07016v1 | null |
2024-06-11 | DNN Partitioning, Task Offloading, and Resource Allocation in Dynamic Vehicular Networks: A Lyapunov-Guided Diffusion-Based Reinforcement Learning Approach | Zhang Liu et.al. | 2406.06986v1 | null |
2024-06-11 | Joint Demonstration and Preference Learning Improves Policy Alignment with Human Feedback | Chenliang Li et.al. | 2406.06874v1 | null |
2024-06-11 | Sample Complexity Reduction via Policy Difference Estimation in Tabular Reinforcement Learning | Adhyyan Narang et.al. | 2406.06856v1 | null |
2024-06-10 | Reinforced Compressive Neural Architecture Search for Versatile Adversarial Robustness | Dingrong Wang et.al. | 2406.06792v1 | link |
2024-06-10 | Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach For Adaptive Brain Stimulation | Michelle Pan et.al. | 2406.06714v1 | null |
2024-06-10 | Verification-Guided Shielding for Deep Reinforcement Learning | Davide Corsi et.al. | 2406.06507v1 | null |
2024-06-10 | Adaptive Opponent Policy Detection in Multi-Agent MDPs: Real-Time Strategy Switch Identification Using Running Error Estimation | Mohidul Haque Mridul et.al. | 2406.06500v1 | null |
2024-06-10 | Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity | Calarina Muslimani et.al. | 2406.06495v1 | null |
2024-06-10 | Towards Real-World Efficiency: Domain Randomization in Reinforcement Learning for Pre-Capture of Free-Floating Moving Targets by Autonomous Robots | Bahador Beigomi et.al. | 2406.06460v1 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding | Ming Hu et.al. | 2406.07471v2 | null |
2024-06-11 | Visual Representation Learning with Stochastic Frame Prediction | Huiwon Jang et.al. | 2406.07398v1 | null |
2024-06-11 | Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks | Bjarke Madsen et.al. | 2406.07383v1 | null |
2024-06-11 | Improving the realism of robotic surgery simulation through injection of learning-based estimated errors | Juan Antonio Barragan et.al. | 2406.07375v1 | null |
2024-06-11 | iMESA: Incremental Distributed Optimization for Collaborative Simultaneous Localization and Mapping | Daniel McGann et.al. | 2406.07371v1 | null |
2024-06-11 | Realistic Data Generation for 6D Pose Estimation of Surgical Instruments | Juan Antonio Barragan et.al. | 2406.07328v1 | null |
2024-06-11 | Should XAI Nudge Human Decisions with Explanation Biasing? | Yosuke Fukuchi et.al. | 2406.07323v1 | null |
2024-06-11 | Experimental Modeling of Chiral Active Robots and a Minimal Model of Non-Gaussian Displacements | Yuxuan Zhou et.al. | 2406.07313v1 | null |
2024-06-11 | Instruct Large Language Models to Drive like Humans | Ruijun Zhang et.al. | 2406.07296v1 | link |
2024-06-11 | OTO Planner: An Efficient Only Travelling Once Exploration Planner for Complex and Unknown Environments | Bo Zhou et.al. | 2406.07294v1 | null |
2024-06-11 | 3D Voxel Maps to 2D Occupancy Maps for Efficient Path Planning for Aerial and Ground Robots | Scott Fredriksson et.al. | 2406.07270v1 | null |
2024-06-11 | Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph | Sergey Linok et.al. | 2406.07113v1 | null |
2024-06-11 | A Neck Orthosis with Multi-Directional Variable Stiffness for Persons with Dropped Head Syndrome | Santiago Price Torrendell et.al. | 2406.07074v1 | null |
2024-06-11 | Optimal Gait Control for a Tendon-driven Soft Quadruped Robot by Model-based Reinforcement Learning | Xuezhi Niu et.al. | 2406.07069v1 | null |
2024-06-11 | Optimal Gait Design for a Soft Quadruped Robot via Multi-fidelity Bayesian Optimization | Kaige Tan et.al. | 2406.07065v1 | null |
2024-06-11 | GPU-Accelerated Optimization-Based Collision Avoidance | Zeming Wu et.al. | 2406.07048v1 | null |
2024-06-11 | Neural Visibility Field for Uncertainty-Driven Active Mapping | Shangjie Xue et.al. | 2406.06948v1 | null |
2024-06-11 | CAAP: Context-Aware Action Planning Prompting to Solve Computer Tasks with Front-End UI Only | Junhee Cho et.al. | 2406.06947v1 | link |
2024-06-11 | Person Transfer in the Field: Examining Real World Sequential Human-Robot Interaction Between Two Robots | Xiang Zhi Tan et.al. | 2406.06904v1 | null |
2024-06-11 | Developing, Analyzing, and Evaluating Vehicular Lane Keeping Algorithms Under Dynamic Lighting and Weather Conditions Using Electric Vehicles | Michael Khalfin et.al. | 2406.06899v1 | null |
2024-06-11 | Joint Demonstration and Preference Learning Improves Policy Alignment with Human Feedback | Chenliang Li et.al. | 2406.06874v1 | null |
2024-06-10 | HO-Cap: A Capture System and Dataset for 3D Reconstruction and Pose Tracking of Hand-Object Interaction | Jikai Wang et.al. | 2406.06843v1 | null |
2024-06-10 | FlexLoc: Conditional Neural Networks for Zero-Shot Sensor Perspective Invariance in Object Localization with Distributed Multimodal Sensors | Jason Wu et.al. | 2406.06796v1 | link |
2024-06-10 | Starling Formation-Flying Optical Experiment: Initial Operations and Flight Results | Justin Kruger et.al. | 2406.06748v1 | null |
2024-06-10 | Stochastic Guidance of Buoyancy Controlled Vehicles under Ice Shelves using Ocean Currents | Federico Rossi et.al. | 2406.06724v1 | null |
2024-06-10 | Verification-Guided Shielding for Deep Reinforcement Learning | Davide Corsi et.al. | 2406.06507v1 | null |
2024-06-10 | Demonstrating HumanTHOR: A Simulation Platform and Benchmark for Human-Robot Collaboration in a Shared Workspace | Chenxu Wang et.al. | 2406.06498v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis | David Ortiz-Perez et.al. | 2406.07542v1 | link |
2024-06-11 | CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Zeyuan Liu et.al. | 2406.07541v1 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Interpreting DESI 2024 BAO: late-time dynamical dark energy or a local effect? | Ioannis D. Gialamas et.al. | 2406.07533v1 | null |
2024-06-11 | Coherent Three-Photon Excitation of the Strontium Clock Transition | Junyu He et.al. | 2406.07530v1 | null |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | Cosmological constraints on $Ξ_{\rm s}$CDM scenario in a type II minimally modified gravity | Ozgur Akarsu et.al. | 2406.07526v1 | null |
2024-06-11 | Will Southeast Asia be the next global manufacturing hub? A multiway cointegration, causality, and dynamic connectedness analyses on factors influencing offshore decisions | Haibo Wang et.al. | 2406.07525v1 | null |
2024-06-11 | Change of numeraire for weak martingale transport | Mathias BeiglbΓΆck et.al. | 2406.07523v1 | null |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Faster Spectral Density Estimation and Sparsification in the Nuclear Norm | Yujia Jin et.al. | 2406.07521v1 | null |
2024-06-11 | Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics | Reuben R. W. Wang et.al. | 2406.07519v1 | null |
2024-06-11 | The canonical trace of Cohen-Macaulay algebras of codimension 2 | Antonino Ficarra et.al. | 2406.07517v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven" | D. T. Chung et.al. | 2406.07512v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum | N. -O. Stutzer et.al. | 2406.07511v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
2024-06-11 | Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices | Atli Sigurgeirsson et.al. | 2406.07504v1 | null |
2024-06-11 | Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid | Naser Souri et.al. | 2406.07503v1 | null |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502v1 | link |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-10 | Notes on Kalman Filter (KF, EKF, ESKF, IEKF, IESKF) | Gyubeom Im et.al. | 2406.06427v1 | null |
2024-06-10 | Notes on Various Errors and Jacobian Derivations for SLAM | Gyubeom Im et.al. | 2406.06422v1 | null |
2024-06-10 | Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation | Shenghao Li et.al. | 2406.06374v1 | link |
2024-06-10 | Visual-Inertial SLAM as Simple as A, B, VINS | Nathaniel Merrill et.al. | 2406.05969v1 | null |
2024-06-09 | MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps | Jianhao Zheng et.al. | 2406.05849v1 | null |
2024-06-06 | Open Problem: Active Representation Learning | Nikola Milosevic et.al. | 2406.03845v1 | null |
2024-06-04 | ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization | Chen Mao et.al. | 2406.01906v1 | link |
2024-06-03 | Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry | Takayuki Kanai et.al. | 2406.00929v1 | null |
2024-06-02 | Visual place recognition for aerial imagery: A survey | Ivan Moskalenko et.al. | 2406.00885v1 | link |
2024-05-30 | Structure Gaussian SLAM with Manhattan World Hypothesis | Shuhong Liu et.al. | 2405.20031v1 | null |
2024-05-30 | Semantic Landmark Detection & Classification Using Neural Networks For 3D In-Air Sonar | Wouter Jansen et.al. | 2405.19869v1 | null |
2024-05-30 | SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization | Jiang Wang et.al. | 2405.19813v1 | link |
2024-05-30 | TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM | Peifeng Jiang et.al. | 2405.19614v1 | null |
2024-05-27 | CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy | Richard Elvira et.al. | 2405.16932v1 | null |
2024-05-26 | Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians | Erik SandstrΓΆm et.al. | 2405.16544v1 | link |
2024-05-24 | NeB-SLAM: Neural Blocks-based Salable RGB-D SLAM for Unknown Scenes | Lizhi Bai et.al. | 2405.15151v1 | null |
2024-05-23 | ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization | Han Song et.al. | 2405.15082v2 | null |
2024-05-23 | Synergistic Global-space Camera and Human Reconstruction from Videos | Yizhou Zhao et.al. | 2405.14855v1 | null |
2024-05-23 | CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments | Yang Zhou et.al. | 2405.14731v1 | link |
2024-05-23 | Efficient Robot Learning for Perception and Mapping | Niclas VΓΆdisch et.al. | 2405.14688v1 | null |
2024-05-22 | Monocular Gaussian SLAM with Language Extended Loop Closure | Tian Lan et.al. | 2405.13748v1 | null |
2024-05-21 | NV-LIO: LiDAR-Inertial Odometry using Normal Vectors Towards Robust SLAM in Multifloor Environments | Dongha Chung et.al. | 2405.12563v2 | link |
2024-05-18 | Outlier-Robust Long-Term Robotic Mapping Leveraging Ground Segmentation | Hyungtae Lim et.al. | 2405.11176v3 | null |
2024-05-18 | MotionGS : Compact Gaussian Splatting SLAM by Motion Filter | Xinli Guo et.al. | 2405.11129v2 | link |
2024-05-17 | CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion | Gang Wang et.al. | 2405.10793v2 | null |
2024-05-17 | Occupancy-SLAM: Simultaneously Optimizing Robot Poses and Continuous Occupancy Map | Liang Zhao et.al. | 2405.10743v1 | null |
2024-05-14 | IPC: Incremental Probabilistic Consensus-based Consistent Set Maximization for SLAM Backends | Emilio Olivastri et.al. | 2405.08503v1 | link |
2024-05-13 | OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition | Qiuchi Xiang et.al. | 2405.07966v1 | link |
2024-05-13 | SceneFactory: A Workflow-centric and Unified Framework for Incremental Scene Modeling | Yijun Yuan et.al. | 2405.07847v1 | null |
2024-05-12 | NGD-SLAM: Towards Real-Time SLAM for Dynamic Environments without GPU | Yuhao Zhang et.al. | 2405.07392v1 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | Cinematic Gaussians: Real-Time HDR Radiance Fields with Depth of Field | Chao Wang et.al. | 2406.07329v1 | null |
2024-06-11 | Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs | Kamil Jeziorek et.al. | 2406.07318v1 | null |
2024-06-11 | Let Go of Your Labels with Unsupervised Transfer | Artyom Gadetsky et.al. | 2406.07236v1 | link |
2024-06-11 | RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker | Yunfeng Li et.al. | 2406.07189v1 | link |
2024-06-11 | Increased accuracy and signal-to-noise ratio through recent improvements in Infra-Red Video Bolometer fabrication and calibration | Fabio Federici et.al. | 2406.07139v1 | null |
2024-06-11 | PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving | Yining Shi et.al. | 2406.07037v1 | null |
2024-06-11 | MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results | Xin Jin et.al. | 2406.07006v1 | null |
2024-06-11 | Generative Lifting of Multiview to 3D from Unknown Pose: Wrapping NeRF inside Diffusion | Xin Yuan et.al. | 2406.06972v1 | null |
2024-06-11 | Neural Visibility Field for Uncertainty-Driven Active Mapping | Shangjie Xue et.al. | 2406.06948v1 | null |
2024-06-11 | High-velocity blue-shifted Fe XXV He$Ξ±$ line during a superflare of the RS CVn-type star IM Peg | Shun Inoue et.al. | 2406.06940v1 | null |
2024-06-10 | The PAU Survey: Photometric Calibration of Narrow Band Images | F. J. Castander et.al. | 2406.06850v1 | null |
2024-06-10 | HO-Cap: A Capture System and Dataset for 3D Reconstruction and Pose Tracking of Hand-Object Interaction | Jikai Wang et.al. | 2406.06843v1 | null |
2024-06-10 | Starling Formation-Flying Optical Experiment: Initial Operations and Flight Results | Justin Kruger et.al. | 2406.06748v1 | null |
2024-06-10 | PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction | Danpeng Chen et.al. | 2406.06521v1 | null |
2024-06-10 | SYM3D: Learning Symmetric Triplanes for Better 3D-Awareness of GANs | Jing Yang et.al. | 2406.06432v1 | null |
2024-06-10 | Notes on Various Errors and Jacobian Derivations for SLAM | Gyubeom Im et.al. | 2406.06422v1 | null |
2024-06-10 | Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation | Shenghao Li et.al. | 2406.06374v1 | link |
2024-06-10 | Relativistic and wide-angle corrections to galaxy power spectra | Sheean Jolicoeur et.al. | 2406.06274v1 | null |
2024-06-10 | DualAD: Disentangling the Dynamic and Static World for End-to-End Driving | Simon Doll et.al. | 2406.06264v1 | null |
2024-06-10 | Vript: A Video Is Worth Thousands of Words | Dongjie Yang et.al. | 2406.06040v1 | link |
2024-06-09 | SlowPerception: Physical-World Latency Attack against Visual Perception in Autonomous Driving | Chen Ma et.al. | 2406.05800v1 | null |
2024-06-09 | Region of Interest Loss for Anonymizing Learned Image Compression | Christoph Liebender et.al. | 2406.05726v1 | null |
2024-06-09 | Enhancing the light yield of He:CF$_4$ based gaseous detector | F. D. Amaro et.al. | 2406.05713v1 | null |
2024-06-09 | MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation | Yan Ma et.al. | 2406.05690v1 | link |
2024-06-08 | The PLATO Mission | Heike Rauer et.al. | 2406.05447v1 | null |
2024-06-08 | MotionClone: Training-Free Motion Cloning for Controllable Video Generation | Pengyang Ling et.al. | 2406.05338v2 | null |
2024-06-07 | Lessons from the Cruise Robotaxi Pedestrian Dragging Mishap | Philip Koopman et.al. | 2406.05281v1 | null |
2024-06-07 | A Tensor Decomposition Perspective on Second-order RNNs | Maude Lizaire et.al. | 2406.05045v1 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis | David Ortiz-Perez et.al. | 2406.07542v1 | link |
2024-06-11 | CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Zeyuan Liu et.al. | 2406.07541v1 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | Coherent Three-Photon Excitation of the Strontium Clock Transition | Junyu He et.al. | 2406.07530v1 | null |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | QuickLLaMA: Query-aware Inference Acceleration for Large Language Models | Jingyao Li et.al. | 2406.07528v1 | link |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Change of numeraire for weak martingale transport | Mathias BeiglbΓΆck et.al. | 2406.07523v1 | null |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Faster Spectral Density Estimation and Sparsification in the Nuclear Norm | Yujia Jin et.al. | 2406.07521v1 | null |
2024-06-11 | Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics | Reuben R. W. Wang et.al. | 2406.07519v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | Hybrid Machine Learning Approach for Cyberattack Mitigation of Parallel Converters in a DC Microgrid | Naser Souri et.al. | 2406.07503v1 | null |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | A model-independent test of pre-recombination New Physics: Machine Learning based estimate of the Sound Horizon from Gravitational Wave Standard Sirens and the Baryon Acoustic Oscillation Angular Scale | William Giarè et.al. | 2406.07493v1 | null |
2024-06-11 | Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction | Bekir Z. Demiray et.al. | 2406.07484v1 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis | David Ortiz-Perez et.al. | 2406.07542v1 | link |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | Coherent Three-Photon Excitation of the Strontium Clock Transition | Junyu He et.al. | 2406.07530v1 | null |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | QuickLLaMA: Query-aware Inference Acceleration for Large Language Models | Jingyao Li et.al. | 2406.07528v1 | link |
2024-06-11 | Interpolating between Hausdorff and box dimension | Amlan Banaji et.al. | 2406.07527v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum | N. -O. Stutzer et.al. | 2406.07511v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results I. Improved data selection and processing | J. G. S. Lunde et.al. | 2406.07510v1 | null |
2024-06-11 | Shashank Bhatnagar et.al. | 2406.07508v1 | null | |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502v1 | link |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499v1 | null |
2024-06-11 | RaD-Net 2: A causal two-stage repairing and denoising speech enhancement network with knowledge distillation and complex axial self-attention | Mingshuai Liu et.al. | 2406.07498v1 | null |
2024-06-11 | A pilot protocol and cohort for the investigation of non-pathological variability in speech | Nicholas Cummins et.al. | 2406.07497v1 | null |
2024-06-11 | CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization | Frederic Kirstein et.al. | 2406.07494v2 | null |
2024-06-11 | GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection | Hang Yao et.al. | 2406.07487v1 | null |
2024-06-11 | Novel Optimized Designs of Modulo |
Bhaskar Gaur et.al. | 2406.07486v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis | David Ortiz-Perez et.al. | 2406.07542v1 | link |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | Coherent Three-Photon Excitation of the Strontium Clock Transition | Junyu He et.al. | 2406.07530v1 | null |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | QuickLLaMA: Query-aware Inference Acceleration for Large Language Models | Jingyao Li et.al. | 2406.07528v1 | link |
2024-06-11 | Interpolating between Hausdorff and box dimension | Amlan Banaji et.al. | 2406.07527v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum | N. -O. Stutzer et.al. | 2406.07511v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results I. Improved data selection and processing | J. G. S. Lunde et.al. | 2406.07510v1 | null |
2024-06-11 | Shashank Bhatnagar et.al. | 2406.07508v1 | null | |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502v1 | link |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499v1 | null |
2024-06-11 | RaD-Net 2: A causal two-stage repairing and denoising speech enhancement network with knowledge distillation and complex axial self-attention | Mingshuai Liu et.al. | 2406.07498v1 | null |
2024-06-11 | A pilot protocol and cohort for the investigation of non-pathological variability in speech | Nicholas Cummins et.al. | 2406.07497v1 | null |
2024-06-11 | CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization | Frederic Kirstein et.al. | 2406.07494v2 | null |
2024-06-11 | GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection | Hang Yao et.al. | 2406.07487v1 | null |
2024-06-11 | Novel Optimized Designs of Modulo |
Bhaskar Gaur et.al. | 2406.07486v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis | David Ortiz-Perez et.al. | 2406.07542v1 | link |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Will Southeast Asia be the next global manufacturing hub? A multiway cointegration, causality, and dynamic connectedness analyses on factors influencing offshore decisions | Haibo Wang et.al. | 2406.07525v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502v1 | link |
2024-06-11 | CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization | Frederic Kirstein et.al. | 2406.07494v2 | null |
2024-06-11 | Choreographing the Rhythms of Observation: Dynamics for Ranged Observer Bipartite-Unipartite SpatioTemporal (ROBUST) Networks | Ted Edward Holmberg et.al. | 2406.07473v1 | null |
2024-06-11 | 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models | Heng Yu et.al. | 2406.07472v1 | null |
2024-06-11 | Multimodal Belief Prediction | John Murzaku et.al. | 2406.07466v1 | null |
2024-06-11 | Resummation of Multi-Stress Tensors in Higher Dimensions | Kuo-Wei Huang et.al. | 2406.07458v1 | null |
2024-06-11 | An Optimism-based Approach to Online Evaluation of Generative Models | Xiaoyan Hu et.al. | 2406.07451v1 | null |
2024-06-11 | Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning | Shuvendu Roy et.al. | 2406.07450v1 | link |
2024-06-11 | Graph-based multi-Feature fusion method for speech emotion recognition | Xueyu Liu et.al. | 2406.07437v1 | null |
2024-06-11 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431v1 | null |
2024-06-11 | GemNet: Menu-Based, Strategy-Proof Multi-Bidder Auctions Through Deep Learning | Tonghan Wang et.al. | 2406.07428v1 | null |
2024-06-11 | DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 38 Subclasses | Abdurrahim Yilmaz et.al. | 2406.07426v1 | null |
2024-06-11 | Single-Codec: Single-Codebook Speech Codec towards High-Performance Speech Generation | Hanzhao Li et.al. | 2406.07422v1 | null |
2024-06-11 | Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks | Bjarke Madsen et.al. | 2406.07383v1 | null |
2024-06-11 | World Models with Hints of Large Language Models for Goal Achieving | Zeyuan Liu et.al. | 2406.07381v1 | null |
2024-06-11 | Improving the realism of robotic surgery simulation through injection of learning-based estimated errors | Juan Antonio Barragan et.al. | 2406.07375v1 | null |
2024-06-11 | iMESA: Incremental Distributed Optimization for Collaborative Simultaneous Localization and Mapping | Daniel McGann et.al. | 2406.07371v1 | null |
2024-06-11 | BvSP: Broad-view Soft Prompting for Few-Shot Aspect Sentiment Quad Prediction | Yinhao Bai et.al. | 2406.07365v1 | link |
2024-06-11 | AI.vs.Clinician: Unveiling Intricate Interactions Between AI and Clinicians through an Open-Access Database | Wanling Gao et.al. | 2406.07362v1 | null |
2024-06-11 | Deep Implicit Optimization for Robust and Flexible Image Registration | Rohit Jena et.al. | 2406.07361v1 | null |
2024-06-11 | GLIMPSE: Pragmatically Informative Multi-Document Summarization for Scholarly Reviews | Maxime Darrin et.al. | 2406.07359v1 | null |
2024-06-11 | Toxic Memes: A Survey of Computational Perspectives on the Detection and Explanation of Meme Toxicities | Delfina Sol Martinez Pandiani et.al. | 2406.07353v1 | null |
2024-06-11 | DR-RAG: Applying Dynamic Document Relevance to Retrieval-Augmented Generation for Question-Answering | Zijian Hei et.al. | 2406.07348v2 | null |
2024-06-11 | Few-Body Quantum Chaos, Localization, and Multi-Photon Entanglement in Optical Synthetic Frequency Dimension | Junlin Wang et.al. | 2406.07346v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551v1 | link |
2024-06-11 | Faster Spectral Density Estimation and Sparsification in the Nuclear Norm | Yujia Jin et.al. | 2406.07521v1 | null |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
2024-06-11 | RaD-Net 2: A causal two-stage repairing and denoising speech enhancement network with knowledge distillation and complex axial self-attention | Mingshuai Liu et.al. | 2406.07498v1 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization | Frederic Kirstein et.al. | 2406.07494v2 | null |
2024-06-11 | Image Neural Field Diffusion Models | Yinbo Chen et.al. | 2406.07480v1 | null |
2024-06-11 | Lower bounds for sphere packing in arbitrary norms | Carl Schildkraut et.al. | 2406.07479v1 | null |
2024-06-11 | Choreographing the Rhythms of Observation: Dynamics for Ranged Observer Bipartite-Unipartite SpatioTemporal (ROBUST) Networks | Ted Edward Holmberg et.al. | 2406.07473v1 | null |
2024-06-11 | Microbiomes Through The Looking Glass | Jacopo Pasqualini et.al. | 2406.07465v1 | null |
2024-06-11 | Reconfigurable Intelligent Surfaces in Dynamic Rich Scattering Environments: BiLSTM-Based Optimization for Accurate User Localization | Anum Umer et.al. | 2406.07463v1 | null |
2024-06-11 | fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions | Alireza Afzal Aghaei et.al. | 2406.07456v1 | link |
2024-06-11 | HTVM: Efficient Neural Network Deployment On Heterogeneous TinyML Platforms | Josse Van Delm et.al. | 2406.07453v1 | null |
2024-06-11 | Boosted Conformal Prediction Intervals | Ran Xie et.al. | 2406.07449v1 | null |
2024-06-11 | Metastability in networks of nonlinear stochastic integrate-and-fire neurons | Siddharth Paliwal et.al. | 2406.07445v1 | null |
2024-06-11 | DeformTime: Capturing Variable Dependencies with Deformable Attention for Time Series Forecasting | Yuxuan Shu et.al. | 2406.07438v1 | null |
2024-06-11 | Graph-based multi-Feature fusion method for speech emotion recognition | Xueyu Liu et.al. | 2406.07437v1 | null |
2024-06-11 | Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration | Shashank Agnihotri et.al. | 2406.07435v1 | null |
2024-06-11 | Matryoshka Representation Learning for Recommendation | Riwei Lai et.al. | 2406.07432v1 | link |
2024-06-11 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431v1 | null |
2024-06-11 | GemNet: Menu-Based, Strategy-Proof Multi-Bidder Auctions Through Deep Learning | Tonghan Wang et.al. | 2406.07428v1 | null |
2024-06-11 | Graph Reasoning for Explainable Cold Start Recommendation | Jibril Frej et.al. | 2406.07420v1 | null |
2024-06-11 | Average-exact mixed anomalies and compatible phases | Yichen Xu et.al. | 2406.07417v1 | null |
2024-06-11 | Heat operators and isometry groups of Cuntz-Krieger algebras | Dimitris Michail Gerontogiannis et.al. | 2406.07416v1 | null |
2024-06-11 | Holistic Memory Diversification for Incremental Learning in Growing Graphs | Ziyue Qiao et.al. | 2406.07413v1 | null |
2024-06-11 | Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy | Xiaohan Huang et.al. | 2406.07404v1 | null |
2024-06-11 | A Survey on Recent Random Walk-based Methods for Embedding Knowledge Graphs | Elika Bozorgi et.al. | 2406.07402v1 | null |
2024-06-11 | Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance | Ruxin Zheng et.al. | 2406.07399v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis | David Ortiz-Perez et.al. | 2406.07542v1 | link |
2024-06-11 | CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Zeyuan Liu et.al. | 2406.07541v1 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Autoregressive Pretraining with Mamba in Vision | Sucheng Ren et.al. | 2406.07537v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Interpreting DESI 2024 BAO: late-time dynamical dark energy or a local effect? | Ioannis D. Gialamas et.al. | 2406.07533v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | QuickLLaMA: Query-aware Inference Acceleration for Large Language Models | Jingyao Li et.al. | 2406.07528v1 | link |
2024-06-11 | Cosmological constraints on $Ξ_{\rm s}$CDM scenario in a type II minimally modified gravity | Ozgur Akarsu et.al. | 2406.07526v1 | null |
2024-06-11 | Will Southeast Asia be the next global manufacturing hub? A multiway cointegration, causality, and dynamic connectedness analyses on factors influencing offshore decisions | Haibo Wang et.al. | 2406.07525v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | Physics-guided weak-form discovery of reduced-order models for trapped ultracold hydrodynamics | Reuben R. W. Wang et.al. | 2406.07519v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results III. Implications for cosmic molecular gas content at "Cosmic Half-past Eleven" | D. T. Chung et.al. | 2406.07512v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results I. Improved data selection and processing | J. G. S. Lunde et.al. | 2406.07510v1 | null |
2024-06-11 | Uniqueness on average of large isoperimetric sets in noncompact manifolds with nonnegative Ricci curvature | Gioacchino Antonelli et.al. | 2406.07509v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | Autoregressive Pretraining with Mamba in Vision | Sucheng Ren et.al. | 2406.07537v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | Interacting-bath dynamical embedding for capturing non-local electron correlation in solids | Jiachen Li et.al. | 2406.07531v1 | null |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Faster Spectral Density Estimation and Sparsification in the Nuclear Norm | Yujia Jin et.al. | 2406.07521v1 | null |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report | KBTG Labs et.al. | 2406.07505v1 | null |
2024-06-11 | Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices | Atli Sigurgeirsson et.al. | 2406.07504v1 | null |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502v1 | link |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499v1 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization | Frederic Kirstein et.al. | 2406.07494v2 | null |
2024-06-11 | Paraphrasing in Affirmative Terms Improves Negation Understanding | MohammadHossein Rezaei et.al. | 2406.07492v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Zeyuan Liu et.al. | 2406.07541v1 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Autoregressive Pretraining with Mamba in Vision | Sucheng Ren et.al. | 2406.07537v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | Interacting-bath dynamical embedding for capturing non-local electron correlation in solids | Jiachen Li et.al. | 2406.07531v1 | null |
2024-06-11 | Coherent Three-Photon Excitation of the Strontium Clock Transition | Junyu He et.al. | 2406.07530v1 | null |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Faster Spectral Density Estimation and Sparsification in the Nuclear Norm | Yujia Jin et.al. | 2406.07521v1 | null |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System | SBND Collaboration et.al. | 2406.07514v1 | null |
2024-06-11 | Accurate Current Sharing in a DC Microgrid Using Modified Droop Control Algorithm | Naser Souri et.al. | 2406.07513v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report | KBTG Labs et.al. | 2406.07505v1 | null |
2024-06-11 | Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices | Atli Sigurgeirsson et.al. | 2406.07504v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | Autoregressive Pretraining with Mamba in Vision | Sucheng Ren et.al. | 2406.07537v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | Interacting-bath dynamical embedding for capturing non-local electron correlation in solids | Jiachen Li et.al. | 2406.07531v1 | null |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Faster Spectral Density Estimation and Sparsification in the Nuclear Norm | Yujia Jin et.al. | 2406.07521v1 | null |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report | KBTG Labs et.al. | 2406.07505v1 | null |
2024-06-11 | Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices | Atli Sigurgeirsson et.al. | 2406.07504v1 | null |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502v1 | link |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499v1 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization | Frederic Kirstein et.al. | 2406.07494v2 | null |
2024-06-11 | Paraphrasing in Affirmative Terms Improves Negation Understanding | MohammadHossein Rezaei et.al. | 2406.07492v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551v1 | link |
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Zeyuan Liu et.al. | 2406.07541v1 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Autoregressive Pretraining with Mamba in Vision | Sucheng Ren et.al. | 2406.07537v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Interpreting DESI 2024 BAO: late-time dynamical dark energy or a local effect? | Ioannis D. Gialamas et.al. | 2406.07533v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | Interacting-bath dynamical embedding for capturing non-local electron correlation in solids | Jiachen Li et.al. | 2406.07531v1 | null |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | Cosmological constraints on $Ξ_{\rm s}$CDM scenario in a type II minimally modified gravity | Ozgur Akarsu et.al. | 2406.07526v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Faster Spectral Density Estimation and Sparsification in the Nuclear Norm | Yujia Jin et.al. | 2406.07521v1 | null |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System | SBND Collaboration et.al. | 2406.07514v1 | null |
2024-06-11 | Accurate Current Sharing in a DC Microgrid Using Modified Droop Control Algorithm | Naser Souri et.al. | 2406.07513v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results II. Updated constraints on the CO(1-0) power spectrum | N. -O. Stutzer et.al. | 2406.07511v1 | null |
2024-06-11 | COMAP Pathfinder -- Season 2 results I. Improved data selection and processing | J. G. S. Lunde et.al. | 2406.07510v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | Autoregressive Pretraining with Mamba in Vision | Sucheng Ren et.al. | 2406.07537v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | Interacting-bath dynamical embedding for capturing non-local electron correlation in solids | Jiachen Li et.al. | 2406.07531v1 | null |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Faster Spectral Density Estimation and Sparsification in the Nuclear Norm | Yujia Jin et.al. | 2406.07521v1 | null |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report | KBTG Labs et.al. | 2406.07505v1 | null |
2024-06-11 | Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices | Atli Sigurgeirsson et.al. | 2406.07504v1 | null |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502v1 | link |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499v1 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization | Frederic Kirstein et.al. | 2406.07494v2 | null |
2024-06-11 | Paraphrasing in Affirmative Terms Improves Negation Understanding | MohammadHossein Rezaei et.al. | 2406.07492v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | Autoregressive Pretraining with Mamba in Vision | Sucheng Ren et.al. | 2406.07537v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | Interacting-bath dynamical embedding for capturing non-local electron correlation in solids | Jiachen Li et.al. | 2406.07531v1 | null |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Faster Spectral Density Estimation and Sparsification in the Nuclear Norm | Yujia Jin et.al. | 2406.07521v1 | null |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report | KBTG Labs et.al. | 2406.07505v1 | null |
2024-06-11 | Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices | Atli Sigurgeirsson et.al. | 2406.07504v1 | null |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502v1 | link |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499v1 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization | Frederic Kirstein et.al. | 2406.07494v2 | null |
2024-06-11 | Paraphrasing in Affirmative Terms Improves Negation Understanding | MohammadHossein Rezaei et.al. | 2406.07492v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | Autoregressive Pretraining with Mamba in Vision | Sucheng Ren et.al. | 2406.07537v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | Interacting-bath dynamical embedding for capturing non-local electron correlation in solids | Jiachen Li et.al. | 2406.07531v1 | null |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Faster Spectral Density Estimation and Sparsification in the Nuclear Norm | Yujia Jin et.al. | 2406.07521v1 | null |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516v1 | null |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515v1 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507v1 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506v1 | link |
2024-06-11 | THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report | KBTG Labs et.al. | 2406.07505v1 | null |
2024-06-11 | Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices | Atli Sigurgeirsson et.al. | 2406.07504v1 | null |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502v1 | link |
2024-06-11 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500v2 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499v1 | null |
2024-06-11 | TextGrad: Automatic "Differentiation" via Text | Mert Yuksekgonul et.al. | 2406.07496v1 | link |
2024-06-11 | CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization | Frederic Kirstein et.al. | 2406.07494v2 | null |
2024-06-11 | Paraphrasing in Affirmative Terms Improves Negation Understanding | MohammadHossein Rezaei et.al. | 2406.07492v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551v1 | link |
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis | David Ortiz-Perez et.al. | 2406.07542v1 | link |
2024-06-11 | CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Zeyuan Liu et.al. | 2406.07541v1 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540v1 | null |
2024-06-11 | BAKU: An Efficient Transformer for Multi-Task Policy Learning | Siddhant Haldar et.al. | 2406.07539v1 | null |
2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538v1 | null |
2024-06-11 | Autoregressive Pretraining with Mamba in Vision | Sucheng Ren et.al. | 2406.07537v1 | null |
2024-06-11 | Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection | Wenxiao Wang et.al. | 2406.07536v1 | null |
2024-06-11 | Dynamics of the non-radial energy-critical inhomogeneous NLS | Carlos M. GuzmΓ‘n et.al. | 2406.07535v1 | null |
2024-06-11 | On the potential of probing the neutron star composition in accreting X-ray binaries | Kaiser Arf et.al. | 2406.07534v1 | null |
2024-06-11 | Interpreting DESI 2024 BAO: late-time dynamical dark energy or a local effect? | Ioannis D. Gialamas et.al. | 2406.07533v1 | null |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532v1 | link |
2024-06-11 | Interacting-bath dynamical embedding for capturing non-local electron correlation in solids | Jiachen Li et.al. | 2406.07531v1 | null |
2024-06-11 | Coherent Three-Photon Excitation of the Strontium Clock Transition | Junyu He et.al. | 2406.07530v1 | null |
2024-06-11 | MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation | Lu Li et.al. | 2406.07529v1 | null |
2024-06-11 | QuickLLaMA: Query-aware Inference Acceleration for Large Language Models | Jingyao Li et.al. | 2406.07528v1 | link |
2024-06-11 | Cosmological constraints on $Ξ_{\rm s}$CDM scenario in a type II minimally modified gravity | Ozgur Akarsu et.al. | 2406.07526v1 | null |
2024-06-11 | Will Southeast Asia be the next global manufacturing hub? A multiway cointegration, causality, and dynamic connectedness analyses on factors influencing offshore decisions | Haibo Wang et.al. | 2406.07525v1 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524v1 | link |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522v1 | null |
2024-06-11 | Faster Spectral Density Estimation and Sparsification in the Nuclear Norm | Yujia Jin et.al. | 2406.07521v1 | null |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520v1 | null |