Search:
Match:
2 results

Analysis

This paper addresses the limitations of Text-to-SQL systems by tackling the scarcity of high-quality training data and the reasoning challenges of existing models. It proposes a novel framework combining data synthesis and a new reinforcement learning approach. The data-centric approach focuses on creating high-quality, verified training data, while the model-centric approach introduces an agentic RL framework with a diversity-aware cold start and group relative policy optimization. The results show state-of-the-art performance, indicating a significant contribution to the field.
Reference

The synergistic approach achieves state-of-the-art performance among single-model methods.

Technology#Data Science📝 BlogAnalyzed: Dec 29, 2025 07:40

Assessing Data Quality at Shopify with Wendy Foster - #592

Published:Sep 19, 2022 16:48
1 min read
Practical AI

Analysis

This article from Practical AI discusses data quality at Shopify, focusing on the work of Wendy Foster, a director of engineering & data science. The conversation highlights the data-centric approach versus model-centric approaches, emphasizing the importance of data coverage and freshness. It also touches upon data taxonomy, challenges in large-scale ML model production, future use cases, and Shopify's new ML platform, Merlin. The article provides insights into how a major e-commerce platform like Shopify manages and leverages data for its merchants and product data.
Reference

We discuss how they address, maintain, and improve data quality, emphasizing the importance of coverage and “freshness” data when solving constantly evolving use cases.