Search: text-image - ai.jp.net

Research Paper #Diffusion Models, Generative AI, Preference Learning 🔬 ResearchAnalyzed: Jan 3, 2026 18:51

DDSPO: Enhancing Diffusion Models with Self-Supervised Preference Learning

Published:Dec 29, 2025 12:46

•

1 min read

•

ArXiv

Analysis

This paper introduces Direct Diffusion Score Preference Optimization (DDSPO), a novel method for improving diffusion models by aligning outputs with user intent and enhancing visual quality. The key innovation is the use of per-timestep supervision derived from contrasting outputs of a pretrained reference model conditioned on original and degraded prompts. This approach eliminates the need for costly human-labeled datasets and explicit reward modeling, making it more efficient and scalable than existing preference-based methods. The paper's significance lies in its potential to improve the performance of diffusion models with less supervision, leading to better text-to-image generation and other generative tasks.

Key Takeaways

•DDSPO is a novel method for preference-based training of diffusion models.
•It uses per-timestep supervision derived from contrasting outputs of a pretrained reference model.
•It eliminates the need for human-labeled data and explicit reward modeling.
•DDSPO improves text-image alignment and visual quality.
•It requires significantly less supervision compared to existing methods.

Reference

“DDSPO directly derives per-timestep supervision from winning and losing policies when such policies are available. In practice, we avoid reliance on labeled data by automatically generating preference signals using a pretrained reference model: we contrast its outputs when conditioned on original prompts versus semantically degraded variants.”

Permalink ArXiv

Paper #LLM 🔬 ResearchAnalyzed: Jan 3, 2026 19:08

REVEALER: Reinforcement-Guided Visual Reasoning for Text-Image Alignment Evaluation

Published:Dec 29, 2025 03:24

•

1 min read

•

ArXiv

Analysis

This paper addresses a crucial problem in text-to-image (T2I) models: evaluating the alignment between text prompts and generated images. Existing methods often lack fine-grained interpretability. REVEALER proposes a novel framework using reinforcement learning and visual reasoning to provide element-level alignment evaluation, offering improved performance and efficiency compared to existing approaches. The use of a structured 'grounding-reasoning-conclusion' paradigm and a composite reward function are key innovations.

Key Takeaways

Reference

“REVEALER achieves state-of-the-art performance across four benchmarks and demonstrates superior inference efficiency.”

Permalink ArXiv

DDSPO: Enhancing Diffusion Models with Self-Supervised Preference Learning

Analysis

Key Takeaways

REVEALER: Reinforcement-Guided Visual Reasoning for Text-Image Alignment Evaluation

Analysis

Key Takeaways

📬 Get AI News Delivered

Browse by Category

Trending Topics

📬 Get AI News Delivered

Browse by Category

Trending Topics