Samenvatting
Although action recognition for procedural tasks has received notable attention, it has a fundamental flaw in that no measure of success for actions is provided. This limits the applicability of such systems especially within the industrial domain, since the outcome of procedural actions is often significantly more important than the mere execution. To address this limitation, we define the novel task of procedure step recognition (PSR), focusing on recognizing the correct completion and order of procedural steps. Alongside the new task, we also present the multi-modal IndustReal dataset. Unlike currently available datasets, IndustReal contains procedural errors (such as omissions) as well as execution errors. A significant part of these errors are exclusively present in the validation and test sets, making IndustReal suitable to evaluate robustness of algorithms to new, unseen mistakes. Additionally, to encourage reproducibility and allow for scalable approaches trained on synthetic data, the 3D models of all parts are publicly available. Annotations and benchmark performance are provided for action recognition and assembly state detection, as well as the new PSR task. IndustReal, along with the code and model weights, is available at: https://github.com/TimSchoonbeek/IndustReal.
Originele taal-2 | Engels |
---|---|
Titel | 2024 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2024 |
Uitgeverij | Institute of Electrical and Electronics Engineers |
Pagina's | 4353-4362 |
Aantal pagina's | 10 |
ISBN van elektronische versie | 979-8-3503-1892-0 |
DOI's | |
Status | Gepubliceerd - 9 apr. 2024 |
Evenement | 2024 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2024 - Waikoloa, Verenigde Staten van Amerika Duur: 3 jan. 2024 → 8 jan. 2024 |
Congres
Congres | 2024 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2024 |
---|---|
Verkorte titel | WACV 2024 |
Land/Regio | Verenigde Staten van Amerika |
Stad | Waikoloa |
Periode | 3/01/24 → 8/01/24 |
Financiering
The authors sincerely express their gratitude to Dr. Jacek Kustra, Goutham Balachandran, and all participants for their contributions. This work is partially executed at ASML Research and has received funding from ASML and the TKI research grant (project number TKI2112P07).
Financiers | Financiernummer |
---|---|
ASML | TKI2112P07 |
Vingerafdruk
Duik in de onderzoeksthema's van 'IndustReal: A Dataset for Procedure Step Recognition Handling Execution Errors in Egocentric Videos in an Industrial-Like Setting'. Samen vormen ze een unieke vingerafdruk.Datasets
-
IndustReal Dataset of Egocentric Videos for Procedure Understanding
Schoonbeek, T. J. (Ontwerper), Houben, T. (Ontwerper), Onvlee, H. (Ontwerper), de With, P. H. N. (Bijdrager) & van der Sommen, F. (Bijdrager), 4TU.Centre for Research Data, 23 aug. 2024
DOI: 10.4121/b008dd74-020d-4ea4-a8ba-7bb60769d224.v2, https://data.4tu.nl/datasets/b008dd74-020d-4ea4-a8ba-7bb60769d224/2 en nog één link, https://data.4tu.nl/datasets/b008dd74-020d-4ea4-a8ba-7bb60769d224 (minder tonen)
Dataset