Search: routed - ai.jp.net

Research Paper #AI Security, LLMs, MoE 🔬 ResearchAnalyzed: Jan 3, 2026 15:57

RepetitionCurse: DoS Attacks on MoE LLMs

Published:Dec 30, 2025 05:24

•

1 min read

•

ArXiv

Analysis

This paper highlights a critical vulnerability in Mixture-of-Experts (MoE) large language models (LLMs). It demonstrates how adversarial inputs can exploit the routing mechanism, leading to severe load imbalance and denial-of-service (DoS) conditions. The research is significant because it reveals a practical attack vector that can significantly degrade the performance and availability of deployed MoE models, impacting service-level agreements. The proposed RepetitionCurse method offers a simple, black-box approach to trigger this vulnerability, making it a concerning threat.

Key Takeaways

•MoE LLMs are vulnerable to DoS attacks due to routing imbalances.
•Adversarial prompts can force all tokens to be routed to a small subset of experts.
•RepetitionCurse is a simple, black-box method to exploit this vulnerability.
•The attack significantly increases inference latency and degrades service availability.

Reference

“Out-of-distribution prompts can manipulate the routing strategy such that all tokens are consistently routed to the same set of top-$k$ experts, which creates computational bottlenecks.”

Permalink ArXiv

Research Paper #Large Language Models (LLMs), Machine Learning, Multi-Expert Systems 🔬 ResearchAnalyzed: Jan 3, 2026 19:28

Learning with Multi-Expert Deferral for LLMs

Published:Dec 28, 2025 11:33

•

1 min read

•

ArXiv

Analysis

This paper addresses critical challenges of Large Language Models (LLMs) such as hallucinations and high inference costs. It proposes a framework for learning with multi-expert deferral, where uncertain inputs are routed to more capable experts and simpler queries to smaller models. This approach aims to improve reliability and efficiency. The paper provides theoretical guarantees and introduces new algorithms with empirical validation on benchmark datasets.

Key Takeaways

•Addresses LLM challenges of hallucinations and high inference costs.
•Proposes a multi-expert deferral framework for improved reliability and efficiency.
•Provides theoretical guarantees and introduces new algorithms.
•Empirical validation on CIFAR-10, CIFAR-100, SVHN datasets.

Reference

“The paper introduces new surrogate losses and proves strong non-asymptotic, hypothesis set-specific consistency guarantees, resolving existing open questions.”

Permalink ArXiv

Research Paper #Multi-modal Sentiment Analysis, Mixture-of-Experts, Temporal Alignment, MLLM 🔬 ResearchAnalyzed: Jan 3, 2026 19:39

Text-Routed MoE Model for Multi-Modal Sentiment Analysis

Published:Dec 28, 2025 01:58

•

1 min read

•

ArXiv

Analysis

This paper introduces TEXT, a novel model for Multi-modal Sentiment Analysis (MSA) that leverages explanations from Multi-modal Large Language Models (MLLMs) and incorporates temporal alignment. The key contributions are the use of explanations, a temporal alignment block (combining Mamba and temporal cross-attention), and a text-routed sparse mixture-of-experts with gate fusion. The paper claims state-of-the-art performance across multiple datasets, demonstrating the effectiveness of the proposed approach.

Key Takeaways

•Proposes TEXT, a new model for MSA.
•Utilizes explanations from MLLMs.
•Employs a temporal alignment block.
•Achieves state-of-the-art performance on multiple datasets.

Reference

“TEXT achieves the best performance cross four datasets among all tested models, including three recently proposed approaches and three MLLMs.”

Permalink ArXiv

Research #llm 🏛️ OfficialAnalyzed: Dec 25, 2025 23:50

Are the recent memory issues in ChatGPT related to re-routing?

Published:Dec 25, 2025 15:19

•

1 min read

•

r/OpenAI

Analysis

This post from the OpenAI subreddit highlights a user experiencing memory issues with ChatGPT, specifically after updates 5.1 and 5.2. The user notes that the problem seems to be exacerbated when using the 4o model, particularly during philosophical conversations. The AI appears to get "re-routed," leading to repetitive behavior and a loss of context within the conversation. The user suspects that the memory resets after these re-routes. This anecdotal evidence suggests a potential bug or unintended consequence of recent updates affecting the model's ability to maintain context and coherence over extended conversations. Further investigation and confirmation from OpenAI are needed to determine the root cause and potential solutions.

Key Takeaways

•ChatGPT users are reporting memory issues after recent updates.
•The 4o model may be more susceptible to these memory problems.
•Re-routing within the AI could be a contributing factor to memory loss.

Reference

“"It's as if the memory of the chat resets after the re-route."”

Permalink r/OpenAI

Research #LLM 🔬 ResearchAnalyzed: Jan 10, 2026 13:03

RoBoN: Scaling LLMs at Test Time Through Routing

Published:Dec 5, 2025 08:55

•

1 min read

•

ArXiv

Analysis

This ArXiv paper introduces RoBoN, a novel method for efficiently scaling Large Language Models (LLMs) during the test phase. The technique focuses on routing inputs to a selection of LLMs and choosing the best output, potentially improving performance and efficiency.

Key Takeaways

•RoBoN offers a new approach to scaling LLMs during inference.
•The method leverages routing to multiple LLMs for output selection.
•This can potentially optimize performance and resource utilization at test time.

Reference

“The paper presents a method called RoBoN (Routed Online Best-of-n).”

Permalink ArXiv

RepetitionCurse: DoS Attacks on MoE LLMs

Analysis

Key Takeaways

Learning with Multi-Expert Deferral for LLMs

Analysis

Key Takeaways

Text-Routed MoE Model for Multi-Modal Sentiment Analysis

Analysis

Key Takeaways

Are the recent memory issues in ChatGPT related to re-routing?

Analysis

Key Takeaways

RoBoN: Scaling LLMs at Test Time Through Routing

Analysis

Key Takeaways

📬 Get AI News Delivered

Browse by Category

Trending Topics

📬 Get AI News Delivered

Browse by Category

Trending Topics