Home Backend Development Python Tutorial Datasets for Computer Vision (4)

Datasets for Computer Vision (4)

Dec 09, 2024 pm 07:43 PM

Buy Me a Coffee☕

*Memos:

  • My post explains MNIST, EMNIST, QMNIST, ETLCDB, Kuzushiji and Moving MNIST.
  • My post explains Fashion-MNIST, Caltech 101, Caltech 256, CelebA, CIFAR-10 and CIFAR-100.
  • My post explains Oxford-IIIT Pet, Oxford 102 Flower, Stanford Cars, Places365, Flickr8k and Flickr30k.

(1) ImageNet(2009):

  • has the 1,331,167 object images(1,281,167 for train and 50,000 for validation) each connected to the label from 1000 classes: *Memos:
    • Each class has the one or more names which represent the same things.
    • You can download ILSVRC2012_devkit_t12.tar.gz, ILSVRC2012_img_train.tar and ILSVRC2012_img_val.tar.
  • is ImageNet() in PyTorch.

Datasets for Computer Vision (4)

(2) LSUN(Large-scale Scene Understanding)(2015):

  • has scene images and there are the 10 datasets Bedroom, Bridge, Church Outdoor, Classroom, Conference Room, Dining Room, Kitchen, Living Room, Restaurant and Tower:
    • Bedroom has 3,033,342 bedroom images(3,033,042 for train and 300 for validation).
    • Bridge has 818,987 bridge images(818,687 for train and 300 for validation).
    • Church Outdoor has 126,527 church outdoor images(126,227 for train and 300 for validation).
    • Classroom has 126,527 classroom images(126,227 for train and 300 for validation).
    • Conference Room has 229,369 conference room images(229,069 for train and 300 for validation).
    • Dining Room has 657,871 dining room images(657,571 for train and 300 for validation).
    • Kitchen has 2,212,577 kitchen images(2,212,277 for train and 300 for validation).
    • Living Room has 1,316,102 living room images(1,315,802 for train and 300 for validation).
    • Restaurant has 626,631 restaurant images(626,331 for train and 300 for validation).
    • Tower has 708,564 tower images(708,264 for train and 300 for validation).
  • is LSUN() in PyTorch but it has the bug.

Datasets for Computer Vision (4)

(3) MS COCO(Microsoft Common Objects in Context)(2014):

  • has object images with annotations and there are the 16 datasets 2014 Train images and 2014 Val images with 2014 Train/Val annotations, 2014 Test images with 2014 Testing Image info, 2015 Test images with 2015 Testing Image info, 2017 Train images and 2017 Val images with 2017 Train/Val annotations, 2017 Stuff Train/Val annotations or 2017 Panoptic Train/Val annotations, 2017 Test images with 2017 Testing Image info and 2017 Unlabeled images with 2017 Unlabeled Image info: *Memos:
    • 2014 Train images has 82,782 images.
    • 2014 Val images has 40,504 images.
    • 2014 Train/Val annotations has 123,286 annotations(82,782 for train and 40,504 for validation) for 2014 Train images and 2014 Val images.
    • 2014 Test images has 40,775 images.
    • 2014 Testing Image info has 40,775 annotations for 2014 Test images.
    • 2015 Test images has 81,434 images.
    • 2015 Testing Image info has 81,434 annotations for 2015 Test images.
    • 2017 Train images has 118,287 images.
    • 2017 Val images has 5,000 images.
    • 2017 Train/Val annotations has 123,287 annotations(118,287 for train and 5,000 for validation) for 2017 Train images and 2017 Val images.
    • 2017 Stuff Train/Val annotations has 123,287 annotations(118,287 for train and 5,000 for validation) for 2017 Train images and 2017 Val images.
    • 2017 Panoptic Train/Val annotations has 123,287 annotations(118,287 for train and 5,000 for validation) for 2017 Train images and 2017 Val images.
    • 2017 Test images has 40,670 images.
    • 2017 Testing Image info has 40,670 annotations for 2017 Test images.
    • 2017 Unlabeled images has 123,403 images.
    • 2017 Unlabeled Image info has 123,403 annotations for 2017 Unlabeled images.
  • is also called just COCO.
  • is CocoDetection() or CocoCaptions()

Datasets for Computer Vision (4)

The above is the detailed content of Datasets for Computer Vision (4). For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
WWE 2K25: How To Unlock Everything In MyRise
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

How to solve the permissions problem encountered when viewing Python version in Linux terminal? How to solve the permissions problem encountered when viewing Python version in Linux terminal? Apr 01, 2025 pm 05:09 PM

Solution to permission issues when viewing Python version in Linux terminal When you try to view Python version in Linux terminal, enter python...

How to efficiently copy the entire column of one DataFrame into another DataFrame with different structures in Python? How to efficiently copy the entire column of one DataFrame into another DataFrame with different structures in Python? Apr 01, 2025 pm 11:15 PM

When using Python's pandas library, how to copy whole columns between two DataFrames with different structures is a common problem. Suppose we have two Dats...

How to teach computer novice programming basics in project and problem-driven methods within 10 hours? How to teach computer novice programming basics in project and problem-driven methods within 10 hours? Apr 02, 2025 am 07:18 AM

How to teach computer novice programming basics within 10 hours? If you only have 10 hours to teach computer novice some programming knowledge, what would you choose to teach...

How does Uvicorn continuously listen for HTTP requests without serving_forever()? How does Uvicorn continuously listen for HTTP requests without serving_forever()? Apr 01, 2025 pm 10:51 PM

How does Uvicorn continuously listen for HTTP requests? Uvicorn is a lightweight web server based on ASGI. One of its core functions is to listen for HTTP requests and proceed...

How to dynamically create an object through a string and call its methods in Python? How to dynamically create an object through a string and call its methods in Python? Apr 01, 2025 pm 11:18 PM

In Python, how to dynamically create an object through a string and call its methods? This is a common programming requirement, especially if it needs to be configured or run...

What are some popular Python libraries and their uses? What are some popular Python libraries and their uses? Mar 21, 2025 pm 06:46 PM

The article discusses popular Python libraries like NumPy, Pandas, Matplotlib, Scikit-learn, TensorFlow, Django, Flask, and Requests, detailing their uses in scientific computing, data analysis, visualization, machine learning, web development, and H

How to avoid being detected by the browser when using Fiddler Everywhere for man-in-the-middle reading? How to avoid being detected by the browser when using Fiddler Everywhere for man-in-the-middle reading? Apr 02, 2025 am 07:15 AM

How to avoid being detected when using FiddlerEverywhere for man-in-the-middle readings When you use FiddlerEverywhere...

See all articles