Search:
Match:
2 results

Analysis

This paper introduces Direct Diffusion Score Preference Optimization (DDSPO), a novel method for improving diffusion models by aligning outputs with user intent and enhancing visual quality. The key innovation is the use of per-timestep supervision derived from contrasting outputs of a pretrained reference model conditioned on original and degraded prompts. This approach eliminates the need for costly human-labeled datasets and explicit reward modeling, making it more efficient and scalable than existing preference-based methods. The paper's significance lies in its potential to improve the performance of diffusion models with less supervision, leading to better text-to-image generation and other generative tasks.
Reference

DDSPO directly derives per-timestep supervision from winning and losing policies when such policies are available. In practice, we avoid reliance on labeled data by automatically generating preference signals using a pretrained reference model: we contrast its outputs when conditioned on original prompts versus semantically degraded variants.

Research#llm📝 BlogAnalyzed: Dec 29, 2025 07:55

Expressive Deep Learning with Magenta DDSP w/ Jesse Engel - #452

Published:Feb 1, 2021 21:22
1 min read
Practical AI

Analysis

This article summarizes a podcast episode of Practical AI featuring Jesse Engel, a Staff Research Scientist at Google's Magenta Project. The discussion centers on creativity AI, specifically how Magenta utilizes machine learning and deep learning to foster creative expression. A key focus is the Differentiable Digital Signal Processing (DDSP) library, which combines traditional DSP elements with the flexibility of deep learning. The episode also touches upon other Magenta projects, including NLP and language modeling, and Engel's vision for the future of creative AI research.
Reference

“lets you combine the interpretable structure of classical DSP elements (such as filters, oscillators, reverberation, etc.) with the expressivity of deep learning.”