PDS Dataset Video-Tutorials

How AI-assisted workflows are unlocking California police records

An AI-powered database offers a model for extracting and structuring police records for public accessibility and ...

RoadSocial: A Diverse VideoQA Dataset and Benchmark for Road Event Understanding from Social Video Narratives

Abstract: We introduce RoadSocial, a large-scale, diverse VideoQA dataset tailored for generic road event understanding from social media narratives. Unlike existing datasets limited by regional bias, ...

IEEE

Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents

Abstract: Recent advancements in Large Language Models (LLMs) and Vision-Language Models (VLMs) have sparked significant interest in developing GUI visual agents. We introduce MONDAY (Mobile OS ...

GitHub

OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing

The dataset, code, model, and benchmark are currently under review. Please stay tuned. The quality and diversity of instruction-based image editing datasets are continuously increasing, yet ...

GitHub

Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets

This repository provides a PyTorch implementation of Unified World Model (UWM). UWM combines action diffusion and video diffusion to enable scalable pretraining on ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results