AI Detectives on the Construction Site: VLMs See Workers' Actions & Emotions!
Analysis
Key Takeaways
- •VLMs are being used to analyze construction worker actions and emotions from images.
- •GPT-4o demonstrated superior performance in both action and emotion recognition compared to other models.
- •This research has the potential to significantly improve safety and productivity on construction sites.
“GPT-4o consistently achieved the highest scores across both tasks, with an average F1-score of 0.756 and accuracy of 0.799 in action recognition, and an F1-score of 0.712 and accuracy of 0.773 in emotion recognition.”