[Computer Vision] Image, Video 분야 subtask 및 데이터 종류 정리

Task 종류와 관련 dataset (핫한 순위별로)

Video Object Tracking
- Dataset
  - OTB(Object Tracking Benchmark)
Video Action Classification / Action Recogntion
- Dataset
  - UCF101: 13,320 video clip and its categories(101 categories, divided into 5 types: 1)Human-Object Interaction 2) Body-Motion Only 3) Human-Human Interaction 4) Playing Musical Instruments 5) Sports)
  - Kinetics: 500,000 video clips(around 10 second, high-quality) and 600 human action classes
  - HMDB51: 6,849 video clips and 51 action categories
Video Object Segmentation / Semantic Segmentation / Panoptic Segmentation
- Dataset
  - DAVIS(Densely ANnotated Video Segmentation): 50 video sequences with densely annotated frames
  - Cityscapes: images(5000 fine annotated, 20000 coarse annotated, X video), pixel annotations for 30 classes
  - NYUv2(NYU-Depth V2): a variety of indoor scenes as recorded by both the RGB and Depth camera
Video Understanding
Video Classification (not action classfication)
- : task of producing a label that is relevant to the video given its frames
Video Prediction
Video Super-Resolution
Video Compression
Human Pose Estimation
- Dataset
  - Human3.6M: motion capture datasets, accurate 3D joint positions(3.6M human poses) and high-resolution video
Video Dense Estimation
- : task of estimating depth
- Dataset
  - NYUv2(NYU-Depth V2)

[Generative Model] Variational AutoEncoder 2. Application: Conditional VAE, Convolutional CVAE 코드 구현 (0)	2022.01.01
[Generative Model] Variational AutoEncoder 1. Basic: AE, DAE, VAE (0)	2021.12.06
[Basic] 3x3 Conv, 1x1 Conv 하는 이유(FCN vs. FC Layer vs. FPN) (0)	2021.11.20
[Instance segmentation] Mask R-CNN/Detectron2 모델 파일 분석 (0)	2021.11.09
[톺아보기] Pytorch를 이용한 Image Classifier 코드, Gradient Descent (0)	2021.10.26

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`