Computer vision; Deep learning; SoccerNet-v3; Human tracking; Player tracking; Soccer; Sports; Sport; Football
Abstract :
[en] Tracking objects in soccer videos is extremely important to gather both player and team statistics, whether it is to estimate the total distance run, the ball possession or the team formation. Video processing can help automating the extraction of those information, without the need of any invasive sensor, hence applicable to any team on any stadium. Yet, the availability of datasets to train learnable models and benchmarks to evaluate methods on a common testbed is very limited. In this work, we propose a novel dataset for multiple object tracking composed of 200 sequences of 30s each, representative of challenging soccer scenarios, and a complete 45-minutes half-time for long-term tracking. The dataset is fully annotated with bounding boxes and tracklet IDs, enabling the training of MOT baselines in the soccer domain and a full benchmarking of those methods on our segregated challenge sets. Our analysis shows that multiple player, referee and ball tracking in soccer videos is far from being solved, with several improvement required in case of fast motion or in scenarios of severe occlusion.
Research Center/Unit :
Montefiore Institute - Montefiore Institute of Electrical Engineering and Computer Science - ULiège TELIM
Disciplines :
Electrical & electronics engineering
Author, co-author :
Cioppa, Anthony ✱; Université de Liège - ULiège > Montefiore Institute of Electrical Engineering and Computer Science
Giancola, Silvio ✱
Deliège, Adrien ✱; Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Télécommunications
Kang, Le ✱
Zhou, Xin ✱
Cheng, Zhiyu
Ghanem, Bernard
Van Droogenbroeck, Marc ; Université de Liège - ULiège > Montefiore Institute of Electrical Engineering and Computer Science
✱ These authors have contributed equally to this work.
Language :
English
Title :
SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in Soccer Videos
Publication date :
June 2022
Event name :
International Workshop on Computer Vision in Sports (CVsports)
Event organizer :
IEEE
Event place :
New Orleans, United States - Louisiana
Event date :
du 19 juin 2022 au 20 juin 2022
Audience :
International
Main work title :
IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
Publisher :
IEEE
Pages :
3490-3501
Peer reviewed :
Peer reviewed
Name of the research project :
Applications et Recherche pour une Intelligence Artificielle de Confiance (ARIAC)
Funders :
SPW - Public Service of Wallonia
Funding number :
2010235
Funding text :
This work was supported by the Service Public de Wallonie (SPW) Recherche, under Grant No. 2010235 – ARIAC by DigitalWallonia4.ai
Mykhaylo Andriluka, Stefan Roth, and Bernt Schiele. Monocular 3D pose estimation and tracking by detection. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. (CVPR), pages 623-630, San Francisco, CA, USA, June 2010. 2
Samreen Anjum and Danna Gurari. CTMC: Cell tracking with mitosis detection dataset challenge. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. (CVPR), pages 982-983, Seattle, WA, USA, June 2020. 2, 4
Adrià Arbués Sangüesa, Adriàn Martín, Javier Fernández, Coloma Ballester, and Gloria Haro. Using player's bodyorientation to model pass feasibility in soccer. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. Work. (CVPRW), pages 3875-3884, Seattle, WA, USA, June 2020. 3
PaddlePaddle Authors. PaddleDetection, object detection and instance segmentation toolkit based on PaddlePaddle. https://github.com/PaddlePaddle/ PaddleDetection, 2019. 6
Philipp Bergmann, Tim Meinhardt, and Laura Leal-Taixé. Tracking without bells and whistles. In IEEE Int. Conf. Comput. Vis. (ICCV), pages 941-951, Seoul, South Korea, Oct.-Nov. 2019. 2
Keni Bernardin and Rainer Stiefelhagen. Evaluating multiple object tracking performance: the CLEAR MOT metrics. EURASIP J. Image and Video Process., 2008:1-10, May 2008. 6
Marco Bertini, Alberto Del Bimbo, and Walter Nunziati. Player identification in soccer videos. In ACM SIGMM Int. workshop on Multimedia inf. retrieval, pages 25-32, Nov. 2005. 3
Alex Bewley, Zongyuan Ge, Lionel Ott, Fabio Ramos, and Ben Upcroft. Simple online and realtime tracking. In IEEE Int. Conf. Image Process. (ICIP), pages 3464-3468, Phoenix, AZ, USA, Sept. 2016. Inst. Elect. and Electron. Engineers (IEEE). 2, 6
Erik Bochinski, Tobias Senst, and Thomas Sikora. Extending IOU based multi-object tracking by visual information. In IEEE Int. Conf. Adv. Video and Signal Based Surveillance (AVSS), pages 1-6, Auckland, New Zealand, Nov. 2018. 2
Anthony Cioppa, Adrien Deliège, Silvio Giancola, Bernard Ghanem, and Marc Van Droogenbroeck. SoccerNet-v3: Scaling up SoccerNet with multi-view spatial localization and re-identification. Submitted to Scientific Data, 2022. 3
Anthony Cioppa, Adrien Deliège, Silvio Giancola, Bernard Ghanem, Marc Van Droogenbroeck, Rikke Gade, and Thomas B Moeslund. A context-aware loss function for action spotting in soccer videos. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. (CVPR), pages 13123-13133, Seattle, WA, USA, June 2020. 3
Anthony Cioppa, Adrien Deliège, Silvio Giancola, Floriane Magera, Olivier Barnich, Bernard Ghanem, and Marc Van Droogenbroeck. Camera calibration and player localization in SoccerNet-v2 and investigation of their representations for action spotting. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. Work. (CVPRW), CVsports, pages 4537-4546, Nashville, TN, USA, June 2021. 3
Anthony Cioppa, Adrien Deliège, Maxime Istasse, Christophe De Vleeschouwer, and Marc Van Droogenbroeck. ARTHuS: Adaptive real-time human segmentation in sports through online distillation. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. Work. (CVPRW), CVsports, pages 2505-2514, Long Beach, CA, USA, June 2019. 3
Anthony Cioppa, Adrien Deliège, Noor Ul Huda, Rikke Gade, Marc Van Droogenbroeck, and Thomas B. Moeslund. Multimodal and multiview distillation for real-time player detection on a football field. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. Work. (CVPRW), CVsports, pages 3846-3855, Seattle, WA, USA, June 2020. 3
Achal Dave, Tarasha Khurana, Pavel Tokmakov, Cordelia Schmid, and Deva Ramanan. TAO: A large-scale benchmark for tracking any object. In Eur. Conf. Comput. Vis. (ECCV), volume 12350 of Lect. Notes Comput. Sci., pages 436-454. Springer, 2020. 2, 4
DeepSORT in PaddleDetection. https://github. com/PaddlePaddle/PaddleDetection/tree/release/2.3/configs/mot/deepsort. 2, 6
Adrien Deliège, Anthony Cioppa, Silvio Giancola, Meisam J. Seikavandi, Jacob V. Dueholm, Kamal Nasrollahi, Bernard Ghanem, Thomas B. Moeslund, and Marc Van Droogenbroeck. SoccerNet-v2: A dataset and benchmarks for holistic understanding of broadcast soccer videos. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. Work. (CVPRW), CVsports, pages 4508-4519, Nashville, TN, USA, June 2021. 2, 3, 4
Patrick Dendorfer, Hamid Rezatofighi, Anton Milan, Javen Shi, Daniel Cremers, Ian Reid, Stefan Roth, Konrad Schindler, and Laura Leal-Taixé. MOT20: A benchmark for multi object tracking in crowded scenes. arXiv, abs/2003.09003, 2020. 2, 4, 6, 7
Patrick Dendorfer and Heon Song. Motchallenge evaluation kit. https://github.com/dendorferpatrick/ MOTChallengeEvalKit, 2020. 6
Piotr Dollar, Christian Wojek, Bernt Schiele, and Pietro Perona. Pedestrian detection: A benchmark. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. (CVPR), pages 304-311, Miami, FL, USA, June 2009. Inst. Elect. and Electron. Engineers (IEEE). 6
Andreas Ess, Bastian Leibe, Konrad Schindler, and Luc Van Gool. A mobile vision system for robust multi-person tracking. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. (CVPR), pages 1-8, Anchorage, AK, USA, June 2008. Inst. Elect. and Electron. Engineers (IEEE). 6
FairMOT in PaddleDetection. https://github com/PaddlePaddle/PaddleDetection/tree/release/2.3/configs/mot/fairmot. 6
Baba Fakhar, Hamidreza Rashidy Kanan, and Alireza Behrad. Event detection in soccer videos using unsupervised learning of spatio-temporal features based on pooled spatial pyramid model. Multimedia Tools and Applicat., 78(12):16995-17025, June 2019. 2
Na Feng, Zikai Song, Junqing Yu, Yi-Ping Phoebe Chen, Yizhu Zhao, Yunfeng He, and Tao Guan. SSET: a dataset for shot segmentation, event detection, player tracking in soccer videos. Multimedia Tools and Applicat., 79(39):28971-28992, Oct. 2020. 3, 4, 5
James Ferryman and Ali Shahrokni. PETS2009: Dataset and challenge. In IEEE Int. Work. Perform. Evaluation Track. Surveill. (PETS), pages 1-6, Snowbird, UT, USA, Dec. 2009. 2
Pascual Figueroa, Neucimar Leite, Ricardo ML Barros, Isaac Cohen, and Gerard Medioni. Tracking soccer players using the graph representation. In IEEE Int. Conf. Pattern Recogn. (ICPR), volume 4, pages 787-790, Cambridge, UK, Aug. 2004. IEEE. 3
Xin Gao, Xusheng Liu, Taotao Yang, Guilin Deng, Hao Peng, Qiaosong Zhang, Hai Li, and Junhui Liu. Automatic key moment extraction and highlights generation based on comprehensive soccer video understanding. In IEEE Int. Conf. Multimedia and Expo Work. (ICMEW), pages 1-6, London, UK, July 2020. 2
Zheng Ge, Songtao Liu, Feng Wang, Zeming Li, and Jian Sun. YOLOX: Exceeding YOLO series in 2021. arXiv, abs/2107.08430, 2021. 6
Andreas Geiger, Philip Lenz, and Raquel Urtasun. Are we ready for autonomous driving? The KITTI vision benchmark suite. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. (CVPR), pages 3354-3361, Providence, RI, USA, June 2012. 2, 4
Sebastian Gerke, Antje Linnemann, and Karsten Müller. Soccer player recognition using spatial constellation features and jersey number recognition. Comp. Vis. and Image Underst., 159:105-115, June 2017. 3
Silvio Giancola, Mohieddine Amine, Tarek Dghaily, and Bernard Ghanem. SoccerNet: A scalable dataset for action spotting in soccer videos. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn.Work. (CVPRW), pages 1711-1721, Salt Lake City, UT, USA, June 2018. 2, 3
Samuel Hurault, Coloma Ballester, and Gloria Haro. Selfsupervised small soccer player detection and tracking. In Int. ACM Work. Multimedia Content Anal. Sports (MMSports), pages 9-18, Seattle, WA, USA, Oct. 2020. 3
Maxime Istasse, Julien Moreau, and Christophe De Vleeschouwer. Associative embedding for team discrimination. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. Work. (CVPRW), CVsports, pages 2477-2486, Long Beach, CA, USA, June 2019. 3
Sachiko Iwase and Hideo Saito. Parallel tracking of all soccer players by integrating detected positions in multiple view images. In IEEE Int. Conf. Pattern Recogn. (ICPR), volume 4, pages 751-754, Cambridge, UK, Aug. 2004. 3
Haohao Jiang, Yao Lu, and Jing Xue. Automatic soccer video event detection based on a deep neural network combined CNN and RNN. In IEEE Int. Conf. Tools with Artif. Intell. (ICTAI), pages 490-494, San Jose, CA, USA, Nov. 2016. 2
Yudong Jiang, Kaixu Cui, Leilei Chen, Canjin Wang, and Changliang Xu. SoccerDB: A large-scale database for comprehensive video understanding. In Int. ACM Work. Multimedia Content Anal. Sports (MMSports), pages 1-8, 2020. 2
Rudolph Emil Kalman. A new approach to linear filtering and prediction problems. J. Basic Eng., 82(1):35-45, 1960. 6
Muhammad Zeeshan Khan, Summra Saleem, Muhammad A. Hassan, and Muhammad Usman Ghanni Khan. Learning deep C3D features for soccer video event detection. In Int. Conf. Emerging Technol. (ICET), pages 1-6, Islamabad, Pakistan, Nov. 2018. 2
Maria Koshkina, Hemanth Pidaparthy, and James H. Elder. Contrastive learning for sports video: Unsupervised player classification. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. Work. (CVPRW), pages 4528-4536, Nashville, TN, USA, June 2021. 3
Laura Leal-Taixé, Anton Milan, Ian Reid, Stefan Roth, and Konrad Schindler. MOTchallenge 2015: Towards a benchmark for multi-target tracking. arXiv, abs/1504.01942, 2015. 2
Gen Li, Shikun Xu, Xiang Liu, Lei Li, and Changhu Wang. Jersey number recognition with semi-supervised spatial transformer network. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. Work. (CVPRW), pages 1783-1790, Salt Lake City, UT, USA, June 2018. 3
Hengyue Liu and Bir Bhanu. Pose-guided R-CNN for jersey number recognition in sports. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn.Work. (CVPRW), pages 2457-2466, Long Beach, CA, USA, 2019. 3
Tingxi Liu, Yao Lu, Xiaoyu Lei, Lijing Zhang, HaoyuWang, Wei Huang, and Zijian Wang. Soccer video event detection using 3D convolutional networks and shot boundary detection via deep feature distance. In Int. Conf. Neural Inf. Process., volume 10635 of Lect. Notes Comput. Sci., pages 440-449. Springer, 2017. 2
Yang Liu, Luiz G. Hafemann, Michael Jamieson, and Mehrsan Javan. Detecting and matching related objects with one proposal multiple predictions. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. (CVPR), pages 4520-4527, Nashville, TN, USA, June 2021. 3
Wei-Lwun Lu, Jo-Anne Ting, James J. Little, and Kevin P. Murphy. Learning to track and identify players from broadcast sports videos. IEEE Trans. Pattern Anal. Mach. Intell., 35(7):1704-1716, July 2013. 3
Jonathon Luiten and Arne Hoffhues. Trackeval. https:// github.com/JonathonLuiten/TrackEval, 2020. 6
Jonathon Luiten, Aljosa Osep, Patrick Dendorfer, Philip Torr, Andreas Geiger, Laura Leal-Taixé, and Bastian Leibe. HOTA: A higher order metric for evaluating multi-object tracking. Int. J. Comp. Vis., 129(2):548-578, Oct. 2021. 6
Wenhan Luo, Junliang Xing, Anton Milan, Xiaoqin Zhang, Wei Liu, and Tae-Kyun Kim. Multiple object tracking: A literature review. Artif. Intell., 293, Apr. 2021. 2
Siwei Lyu, Ming-Ching Chang, Dawei Du, Wenbo Li, Yi Wei, Marco Del Coco, Pierluigi Carcagnì, Arne Schumann, Bharti Munjal, Doo-Hyun Choi, et al. UA-DETRAC 2018: Report of AVSS2018 & IWT4S challenge on advanced traffic monitoring. In IEEE Int. Conf. Adv. Video and Signal Based Surveillance (AVSS), pages 1-6, Auckland, New Zealand, Nov. 2018. 2
Mehrtash Manafifard, Hamid Ebadi, and Hamid Abrishami Moghaddam. A survey on player tracking in soccer videos. Comp. Vis. and Image Underst., 159:19-46, June 2017. 3
Anton Milan, Laura Leal-Taixé, Ian Reid, Stefan Roth, and Konrad Schindler. MOT16: A benchmark for multi-object tracking. arXiv, abs/1603.00831, 2016. 2, 4, 6
Mahdiar Nekoui, Fidel Omar Tito Cruz, and Li Cheng. FALCONS: FAast Learner-grader for CONtorted poses in Sports. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. Work. (CVPRW), pages 900-901, Seattle, WA, USA, June 2020. 3
Peter Nillius, Josephine Sullivan, and Stefan Carlsson. Multi-target tracking-linking identities using Bayesian network inference. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. (CVPR), volume 2, pages 2187-2194, New York City, NY, USA, June 2006. 3
Malte Pedersen, Joakim Bruslund Haurum, Stefan Hein Bengtson, and Thomas B. Moeslund. 3D-ZeF: A 3D zebrafish tracking benchmark dataset. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. (CVPR), pages 2426-2436, Seattle, WA, USA, June 2020. 2, 4
Svein Arne Pettersen, Dag Johansen, Håvard Johansen, Vegard Berg-Johansen, Vamsidhar Reddy Gaddam, Asgeir Mortensen, Ragnar Langseth, Carsten Griwodz, Håkon Kvale Stensland, and Pål Halvorsen. Soccer video and player position dataset. In ACM Multimedia Syst. Conf., Singapore, Singapore, Mar. 2014. Assoc. for Comput. Machinery. 3
Upendra M. Rao and Umesh C. Pati. A novel algorithm for detection of soccer ball and player. In Int. Conf. Commun. and Signal Process. (ICCSP), pages 344-348, Melmaruvathur, India, Apr. 2015. 3
Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell., 39(6):1137-1149, June 2017. 3
Francisco Romero-Ferrero, Mattia G. Bergomi, Robert C. Hinz, Francisco J. H. Heras, and Gonzalo G. de Polavieja. idtracker. ai: tracking all individuals in small or large collectives of unmarked animals. Nat. methods, 16(2):179-182, Jan. 2019. 2
Melike Şah and Cem Direkoǧlu. Evaluation of image representations for player detection in field sports using convolutional neural networks. In Int. Conf. Theory Appl. Fuzzy Syst. Soft Comput., volume 896 of Adv. in Intell. Syst. and Comput., pages 107-115. Springer, 2019. 3
Ryan Sanford, Siavash Gorji, Luiz G. Hafemann, Bahareh Pourbabaee, and Mehrsan Javan. Group activity detection from trajectory and video data in soccer. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. Work. (CVPRW), pages 898-899, Seattle, WA, USA, June 2020. 3
Himangi Saraogi, Rahul Anand Sharma, and Vijay Kumar. Event recognition in broadcast soccer videos. In Indian Conf. Comput. Vision, Graph. Image Process., pages 1-7, Dec. 2016. 2
Shuai Shao, Zijian Zhao, Boxun Li, Tete Xiao, Gang Yu, Xiangyu Zhang, and Jian Sun. CrowdHuman: A benchmark for detecting human in a crowd. arXiv, abs/1805.00123, 2018. 7
Mohamad-Hoseyn Sigari, Hamid Soltanian-Zadeh, and Hamid-Reza Pourreza. A framework for dynamic restructuring of semantic video analysis systems based on learning attention control. Image and Vis. Comp., 53:20-34, 2016. Event-based Media Processing and Analysis. 2
Zikai Song, ZhiwenWan, Wei Yuan, Ying Tang, Junqing Yu, and Yi-Ping Phoebe Chen. Distractor-aware tracker with a domain-special optimized benchmark for soccer player tracking. In Int. Conf. Multimedia Retrieval, page 276-284, Taipei, Taiwan, Aug. 2021. Assoc. for Comput. Machinery. 3
Josephine Sullivan and Stefan Carlsson. Tracking and labelling of interacting multiple targets. In Eur. Conf. Comput. Vis. (ECCV), volume 3953 of Lect. Notes Comput. Sci., pages 619-632. Springer, 2006. 3
ShiJie Sun, Naveed Akhtar, HuanSheng Song, Ajmal Mian, and Mubarak Shah. Deep affinity network for multiple object tracking. IEEE Trans. Pattern Anal. Mach. Intell., 43(1):104-119, Jan. 2019. 2
Ramana Sundararaman, Cedric De Almeida Braga, Eric Marchand, and Julien Pettre. Tracking pedestrian heads in dense crowd. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. (CVPR), pages 3865-3875, Nashville, TN, USA, June 2021. 2, 4
Inc. SuperAnnotate AI. SuperAnnotate. https:// superannotate.com/. 4
Mostafa Tavassolipour, Mahmood Karimian, and Shohreh Kasaei. Event detection and summarization in soccer videos using Bayesian network and copula. IEEE Trans. Circuits and Syst. for Video Technol., 24(2):291-304, Feb. 2014. 2
Rajkumar Theagarajan, Federico Pala, Xiu Zhang, and Bir Bhanu. Soccer: Who has the ball? generating visual analytics and player statistics. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn.Work. (CVPRW), pages 1749-1757, Salt Lake City, UT, USA 2018. 3
Quang Tran, Bac Vo, Tien Dinh, and Duc Duong. Automatic player detection, tracking and mapping to field model for broadcast soccer videos. In Int. Conf. Adv. Mob. Comput. Multimedia (MoMM), pages 240-243. ACM Press, Dec. 2011. 3
Zhongdao Wang, Liang Zheng, Yixuan Liu, Yali Li, and Shengjin Wang. Towards real-time multi-object tracking. In Eur. Conf. Comput. Vis. (ECCV), volume 12356 of Lect. Notes Comput. Sci., pages 107-122, 2020. 6
Longyin Wen, Dawei Du, Zhaowei Cai, Zhen Lei, Ming-Ching Chang, Honggang Qi, Jongwoo Lim, Ming-Hsuan Yang, and Siwei Lyu. UA-DETRAC: A new benchmark and protocol for multi-object detection and tracking. Comp. Vis. and Image Underst., 193, Apr. 2020. 2
Tong Xiao, Shuang Li, Bochao Wang, Liang Lin, and Xiaogang Wang. Joint detection and identification feature learning for person search. arXiv, abs/1604.01850, 2016. 6
Junliang Xing, Haizhou Ai, Liwei Liu, and Shihong Lao. Multiple player tracking in sports video: A dual-mode twoway Bayesian inference approach with progressive observation modeling. IEEE Trans. Image Process., 20(6):1652-1667, June 2010. 3
Yukun Yang, Min Xu, Wanneng Wu, Ruiheng Zhang, and Yu Peng. 3D multiview basketball players detection and localization based on probabilistic occupancy. In Digit. Image Comp.: Tech. and Applicat., pages 1-8, Canberra, ACT, Australia, Dec. 2018. 3
Junqing Yu, Aiping Lei, Zikai Song, Tingting Wang, Hengyou Cai, and Na Feng. Comprehensive dataset of broadcast soccer videos. In IEEE Conf. Multimedia Inf. Process. Retr. (MIPR), pages 418-423, Miami, FL, USA, June 2018. 3
Shanshan Zhang, Rodrigo Benenson, and Bernt Schiele. CityPersons: A diverse dataset for pedestrian detection. In IEEE Int. Conf. Comput. Vis. and Pattern Recogn. (CVPR), pages 4457-4465, Honolulu, HI, USA, July 2017. Inst. Elect. and Electron. Engineers (IEEE). 6
Yifu Zhang, Peize Sun, Yi Jiang, Dongdong Yu, Zehuan Yuan, Ping Luo, Wenyu Liu, and Xinggang Wang. Byte-Track: Multi-object tracking by associating every detection box. arXiv, abs/2110.06864, 2021. 2, 6
Yifu Zhang, Chunyu Wang, Xinggang Wang, Wenjun Zeng, and Wenyu Liu. FairMOT: On the fairness of detection and re-identification in multiple object tracking. Int. J. Comp. Vis., 129:3069-3087, Sept. 2021. 2, 6
Liang Zheng, Hengheng Zhang, Shaoyan Sun, Manmohan Chandraker, and Qi Tian. Person re-identification in the wild. arXiv, abs/1604.02531, 2016. 6