Search: 入力依存 - ai.jp.net

Research Paper #Vision Transformers, Compositionality, Wavelet Transforms 🔬 ResearchAnalyzed: Jan 3, 2026 09:28

Compositionality in Vision Transformers Explored with Wavelets

Published:Dec 30, 2025 19:43

•

1 min read

•

ArXiv

Analysis

This paper investigates the compositionality of Vision Transformers (ViTs) by using Discrete Wavelet Transforms (DWTs) to create input-dependent primitives. It adapts a framework from language tasks to analyze how ViT encoders structure information. The use of DWTs provides a novel approach to understanding ViT representations, suggesting that ViTs may exhibit compositional behavior in their latent space.

Key Takeaways

•Applies a compositionality analysis framework, previously used for language models, to Vision Transformers.
•Utilizes Discrete Wavelet Transforms (DWTs) to generate image primitives.
•Finds evidence of compositional behavior in ViT latent space using DWT-based primitives.
•Offers a new perspective on how ViTs structure visual information.

Reference

“Primitives from a one-level DWT decomposition produce encoder representations that approximately compose in latent space.”

Permalink ArXiv

Research Paper #Robotics, Control Systems, UAVs 🔬 ResearchAnalyzed: Jan 3, 2026 15:43

HBO-PID for UAV Trajectory Tracking

Published:Dec 30, 2025 14:21

•

1 min read

•

ArXiv

Analysis

This paper introduces a novel control algorithm, HBO-PID, for UAV trajectory tracking. The core innovation lies in integrating Heteroscedastic Bayesian Optimization (HBO) with a PID controller. This approach aims to improve accuracy and robustness by modeling input-dependent noise. The two-stage optimization strategy is also a key aspect for efficient parameter tuning. The paper's significance lies in addressing the challenges of UAV control, particularly the underactuated and nonlinear dynamics, and demonstrating superior performance compared to existing methods.

Key Takeaways

•HBO-PID integrates Heteroscedastic Bayesian Optimization with a PID controller.
•The method addresses challenges of UAV control, such as underactuated and nonlinear dynamics.
•A two-stage optimization strategy is used for efficient parameter tuning.
•Significant performance improvements are demonstrated over SOTA methods in both simulation and real-world scenarios.

Reference

“The proposed method significantly outperforms state-of-the-art (SOTA) methods. Compared to SOTA methods, it improves the position accuracy by 24.7% to 42.9%, and the angular accuracy by 40.9% to 78.4%.”

Permalink ArXiv

Research #Cognitive Maps 🔬 ResearchAnalyzed: Jan 10, 2026 14:22

MapFormer: Self-Supervised Learning Advances Cognitive Mapping

Published:Nov 24, 2025 16:29

•

1 min read

•

ArXiv

Analysis

The research, focusing on MapFormer, demonstrates progress in self-supervised learning for cognitive mapping, a crucial area for embodied AI. The use of input-dependent positional embeddings is a key technical innovation within this work.

Key Takeaways

•MapFormer explores self-supervised learning for cognitive map creation.
•Input-dependent positional embeddings are a key technical component.
•This research has implications for embodied AI and robotics.

Reference

“MapFormer utilizes input-dependent positional embeddings.”

Permalink ArXiv

Compositionality in Vision Transformers Explored with Wavelets

Analysis

Key Takeaways

HBO-PID for UAV Trajectory Tracking

Analysis

Key Takeaways

MapFormer: Self-Supervised Learning Advances Cognitive Mapping

Analysis

Key Takeaways

📬 Get AI News Delivered

Browse by Category

Trending Topics

📬 Get AI News Delivered

Browse by Category

Trending Topics