Empowering VLMs for Humorous Meme Generation
Analysis
Key Takeaways
- •Proposes HUMOR, a framework for meme generation using VLMs.
- •Employs a hierarchical Chain-of-Thought for diverse reasoning.
- •Utilizes a pairwise reward model for capturing subjective humor and aligning with human preferences.
- •Demonstrates superior reasoning diversity, preference alignment, and meme quality in experiments.
- •Presents a general training paradigm for human-aligned multimodal generation.
“HUMOR employs a hierarchical, multi-path Chain-of-Thought (CoT) to enhance reasoning diversity and a pairwise reward model for capturing subjective humor.”