Search:
Match:
3 results

Analysis

This paper investigates the compositionality of Vision Transformers (ViTs) by using Discrete Wavelet Transforms (DWTs) to create input-dependent primitives. It adapts a framework from language tasks to analyze how ViT encoders structure information. The use of DWTs provides a novel approach to understanding ViT representations, suggesting that ViTs may exhibit compositional behavior in their latent space.
Reference

Primitives from a one-level DWT decomposition produce encoder representations that approximately compose in latent space.

HBO-PID for UAV Trajectory Tracking

Published:Dec 30, 2025 14:21
1 min read
ArXiv

Analysis

This paper introduces a novel control algorithm, HBO-PID, for UAV trajectory tracking. The core innovation lies in integrating Heteroscedastic Bayesian Optimization (HBO) with a PID controller. This approach aims to improve accuracy and robustness by modeling input-dependent noise. The two-stage optimization strategy is also a key aspect for efficient parameter tuning. The paper's significance lies in addressing the challenges of UAV control, particularly the underactuated and nonlinear dynamics, and demonstrating superior performance compared to existing methods.
Reference

The proposed method significantly outperforms state-of-the-art (SOTA) methods. Compared to SOTA methods, it improves the position accuracy by 24.7% to 42.9%, and the angular accuracy by 40.9% to 78.4%.

Research#Cognitive Maps🔬 ResearchAnalyzed: Jan 10, 2026 14:22

MapFormer: Self-Supervised Learning Advances Cognitive Mapping

Published:Nov 24, 2025 16:29
1 min read
ArXiv

Analysis

The research, focusing on MapFormer, demonstrates progress in self-supervised learning for cognitive mapping, a crucial area for embodied AI. The use of input-dependent positional embeddings is a key technical innovation within this work.
Reference

MapFormer utilizes input-dependent positional embeddings.