Showing images 0 - 0 out of 0 matching images.

Images are displayed from Training and Validation sets only.
Hover over image to zoom in.

Image 1: VizWiz_train_00001428.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a pair of headphones on a desk.

Visual question: He said like, you know, kind of try to avoid the dog, or whatever.

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. phone

Reasons why answers differ:

Image captions:

  1. A beats by dre speaker sits on display
  2. A pair of speakers is right on the display case.
  3. A picture of a large black speaker for sale for $199.
  4. A speaker that costs $200 is on display.
  5. some sort of stereo system that is black and red

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 2: VizWiz_train_00017943.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: There is a book on the floor.

Visual question: I can't.

Answers:

  1. box labeled card
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. card
  9. card
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a capsules used as medication for diabetes in a packet
  2. A package of A card brand medication on a table.
  3. A package of medication is on top of a table.
  4. A white and purple box of allergy medication opened on the couch.
  5. For medicine with name of the product card is shown bu here.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 3: VizWiz_val_00003195.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a sign on a restaurant.

Visual question: what is in this bag?

Answers:

  1. laundry detergent
  2. unanswerable
  3. unanswerable
  4. unsuitable
  5. unsuitable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a warning label from a product printed in several languages
  2. An orange back of a label that has warnings in English, French, and Spanish
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Warning information for something important one must use on a regular basis.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 4: VizWiz_train_00014401.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A painting of an apple on a table.

Visual question: What is this?

Answers:

  1. leaf
  2. leaf
  3. leaf ant nut
  4. leaf
  5. leaf
  6. acorn oak leaf
  7. leaf
  8. leaf
  9. left
  10. leaf

Reasons why answers differ:

Image captions:

  1. A brown leaf with a fruit or nut laid on top of it
  2. a green birch leaf with a small round fruit on top of it
  3. A leaf from a tree sitting on a table, and on top of the leaf is a yellow object which is probably a citrus fruit
  4. A small artificial leaf with a fake fruit attached.
  5. I see a leaf laying on the table with a ball

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 5: VizWiz_train_00009117.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cup that is sitting in the sand.

Visual question: What is this?

Answers:

  1. hair gel
  2. shower gel
  3. shower gel
  4. shower gel soap
  5. gel
  6. gel
  7. shower gel
  8. unsuitable
  9. shower gel
  10. shower gel

Reasons why answers differ:

Image captions:

  1. A clear bottle of gel is on the floor, the gel has its description and title written on it in text.
  2. A clear bottle of shower gel with a squeeze top.
  3. A clear bottle with brown lettering contains shower gel.
  4. A white bottle of shower gel with orange letters on a marble like surface.
  5. clear shower gel in white container, burnt orange writing

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 6: VizWiz_train_00003207.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cat is laying on a dark floor.

Visual question: What does this can contain?

Answers:

  1. unanswerable
  2. unsuitable
  3. unsuitable
  4. image very blurry
  5. unsuitable
  6. unsuitable
  7. unsuitable
  8. peas
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 7: VizWiz_train_00001218.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a train on a table.

Visual question: What is on that box?

Answers:

  1. capn crunch oops all berries
  2. captain crunch
  3. oops all berries
  4. capn crunch
  5. captain crunch cereal
  6. capn crunch
  7. capn crunch
  8. capn crunch
  9. berry capn crunch
  10. serial

Reasons why answers differ:

Image captions:

  1. A Cap'N Crunch Oops! All Berries cereal box.
  2. Blue box of cereal with Cap'N Crunch animated man.
  3. Blue cardboard cereal box with a cartoon character and photo of the cereal.
  4. Box of Captain Crunch Oops! a Berry cereal.
  5. CAP'N CRUNCH'S Oops! All Berries sweetened corn and oat cereal.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 8: VizWiz_train_00020847.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A finger is pressing a control panel of a dishwasher.
  2. A person is touching the speed button on a food processor appliance.
  3. A person using the setting buttons on a kitchen machine.
  4. a woman's finger is touching a speed button on some type of appliance made of metal
  5. Several buttons are on the black electronic device.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 9: VizWiz_train_00023390.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A man in a large witch hat with a long white beard.
  2. A man is wearing a cone fabric large sun hat and has a serious look on his face and a long beard, long hair, and a gray shirt.
  3. A photo of a man with a large pointed hat and a long white beard.
  4. a picture a character in a TV series
  5. A thin man with a long white beard and long grey locks flowing from underneath a large black rimmed hat.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 10: VizWiz_train_00017839.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bunch of books sitting on a shelf.

Visual question: What Harry Potter book is this?

Answers:

  1. goblet fire
  2. goblet fire
  3. goblet fire
  4. goblet fire
  5. harry potter goblet fire
  6. goblet fire
  7. harry potter goblet fire
  8. goblet fire
  9. goblet fire
  10. goblet fire

Reasons why answers differ:

Image captions:

  1. a bookshelf with games and a Harry Potter book of DVD package in the foreground
  2. A box for a Harry Potter book on tape is standing on a shelf with other tapes and boxes.
  3. a Harry Potter book with other books on a table with a box under the table
  4. A stack of wrestling DVDs, Harry Potter books and novelty toys on a shelf.
  5. Harry Potter and the goblet of fire audiobook by JK Rowling and voiced by Jim Dale sits on a shelf

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 11: VizWiz_val_00005004.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A computer screen is displaying a window with a picture
  2. Here is a picture of something very bright
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Very colorful but I don't know what I'm looking at.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 12: VizWiz_train_00014118.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A red and white picture of a stop sign.

Visual question: Could you please tell me what this says?

Answers:

  1. video images
  2. video imagessales service rentals integration installation
  3. video images sales serviced rentals integration installation
  4. video images
  5. video images sales service rentals integration installation
  6. video images sales service rentals integration installation
  7. sales service rentals integration installation
  8. video images
  9. video images
  10. i video images

Reasons why answers differ:

Image captions:

  1. A close up that save video images on it.
  2. A red and white sign for video images
  3. A red front cover with the white words video inside of it
  4. A red label for video images, offering sales, service, rentals, integration, and installation.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 13: VizWiz_train_00004247.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A view of a wall in a bathroom.

Visual question: What is this?

Answers:

  1. smoke alarm on ceiling
  2. smoke alarm
  3. ceiling
  4. smoke detector
  5. bulb holder picture
  6. camera
  7. ceiling
  8. alarm
  9. ceiling
  10. ceiling

Reasons why answers differ:

Image captions:

  1. A picture of three cupcakes and the words Cherry Cakes.
  2. an image of a fire notifier/alarm in a house
  3. On the white ceiling is a rounded out square fire alarm.
  4. Quality issues are too severe to recognize visual content.
  5. The flat surface is a ceiling and has a shower on it for fires.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 14: VizWiz_train_00020355.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. 2 pound Package of frozen Chicken Breast Strips
  2. A bag of chicken breast strips that are frozen.
  3. A close up shot of a label on a bag of chicken breast strips.
  4. A pack of chicken that has not been cooked yet.
  5. Clear plastic packaging with chicken breast strips inside.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 15: VizWiz_val_00006167.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A silver and purple controller with a purple dial.
  2. A white person holding a Revlon electric that is pink and has a lock and unlock.
  3. An electronic device with the words Revlon and a toggle.
  4. Someone is holding a Revlon personal care product.
  5. The electronics is a charger that is purple and silver and has a long cord.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 16: VizWiz_train_00014364.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a microwave on a table.

Visual question: What does this say?

Answers:

  1. whiskers paws
  2. whiskers paws
  3. unsuitable
  4. whiskers paws
  5. unanswerable
  6. unsuitable
  7. whiskers paws
  8. whiskers paws
  9. whiskers paws
  10. whiskers paws

Reasons why answers differ:

Image captions:

  1. A pet calendar is sitting on a person's lap.
  2. A small calendar about cats and dogs is sitting on your lap.
  3. A very wonderful view and worth seeing at all times, my friend
  4. A whiskers and paws flip book sitting on top of a pair of legs in black shorts.
  5. Some type of packaging sitting on a person's lap.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 17: VizWiz_train_00000395.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A table with a vase of flowers in it.

Visual question: What is this?

Answers:

  1. flowers
  2. flower
  3. purple flowers
  4. irises
  5. flowers
  6. flower centerpiece
  7. flowers
  8. flower
  9. floral centerpiece
  10. flowers

Reasons why answers differ:

Image captions:

  1. A man sitting behind a white table with purple and yellow flowers in a clear vase with green marbles.
  2. A person in a black shirt sitting at a white table with purple and green flowers on it.
  3. a vase on a table full of yellow and purple flowers
  4. Beautiful table centerpiece of purple lilies in full bloom.
  5. Purple flowers in a glass vase for a table centerpiece.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 18: VizWiz_train_00015916.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a sign on a wall.

Visual question: What is the expiration date?

Answers:

  1. oct 14 2011
  2. oct 14 2011
  3. oct 14 2011
  4. oct 4th 2011
  5. 2011
  6. oct 14 2011
  7. october 14 2011
  8. october 24 2011
  9. oct 14 2011
  10. oct 2011

Reasons why answers differ:

Image captions:

  1. A box of Clover organic Unsalted Butter.
  2. A packet of organic, unsalted butter, net weight 16 oz or 454 g, from Clover Organic Farms, with a sell by date of October 14, 2011, laying on a dark colored surface.
  3. A pound of organic farms butter from Clover Hill organic farms.
  4. Clover Organic Farms organic unsalted butter, turned on the side with the sell by date
  5. The end of a butter container is displayed against a black background.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 19: VizWiz_train_00012564.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a black and white toothbrush.

Visual question: What is this?

Answers:

  1. fan
  2. fan
  3. fan
  4. fan
  5. heater
  6. air purifier
  7. machine
  8. fan
  9. dont know
  10. fan

Reasons why answers differ:

Image captions:

  1. A air conditioning machine that is placed on a window sill.
  2. a light grey standing fan in a windowsill
  3. A small portable air conditioning fan is seen in a white hallway entrance.
  4. A tower fan is sitting on a window seat, beside brown patterned curtains.
  5. Tall black oscillating fan with cord wrapped around stand

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 20: VizWiz_train_00014269.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A sign that is sitting on a table.

Visual question: What is this product?

Answers:

  1. medicine
  2. advil
  3. unanswerable
  4. milk
  5. usda recommended servings label
  6. medicine
  7. unanswerable
  8. medicine
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A box that is white with drug facts listed.
  2. a manual with written instructions on how to operate some equipment
  3. Label of medication, but image is too blurry to read words.
  4. Quality issues are too severe to recognize visual content.
  5. The drug facts for a menthol-based drug sitting on some sort of fabric

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 21: VizWiz_train_00014230.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a sign on a table.

Visual question: What is this?

Answers:

  1. moist towelettes
  2. moist towelettes
  3. moist towelettes
  4. towelettes
  5. towelettes
  6. antibacterial moist towelettes
  7. home 360 antibacterial moist towelettes
  8. antibacterial moist towelettes
  9. antibacterial moist towelettes
  10. moist towelettes

Reasons why answers differ:

Image captions:

  1. A black and yellow package of antibacterial wipes.
  2. A black bottle with a white top and an oval home360 label
  3. A container of citrus scented, home 360 branded, anti-bacterial moist towelettes
  4. A container of Home 360 Antibacterial hand wipes
  5. A plastic container of antibacterial moist towelettes with a black and yellow label.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 22: VizWiz_val_00000739.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What is this?

Answers:

  1. computer screen
  2. unsuitable image
  3. unsuitable image
  4. unsuitable image
  5. unsuitable image
  6. unsuitable image
  7. unsuitable image
  8. unsuitable image
  9. unsuitable image
  10. unsuitable image

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. image quality is poor to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 23: VizWiz_val_00003834.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person is holding a bottle of beer.

Visual question: What is this a model of?

Answers:

  1. soda
  2. sprite
  3. soda bottle
  4. sprite
  5. unanswerable
  6. sprite soda
  7. sprite
  8. sprite bottle
  9. unanswerable
  10. sprite

Reasons why answers differ:

Image captions:

  1. A hand holding a 10 fluid ounce bottle of Sprite in a bedroom.
  2. A hand holding a very small Sprite bottle with black cat
  3. A person is holding a small bottle of Sprite.
  4. A small green bottle of soda has a green label
  5. Image shown a bottle in a finger tip.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 24: VizWiz_val_00006625.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A black leather pad is on a table surface.
  2. black half circle with black stitching around the edge, with a white surface behind it
  3. black hat or cloth I don't know actually with other white something
  4. Quality issues are too severe to recognize visual content.
  5. Round black leather stool with stitching around the edge

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 25: VizWiz_train_00007229.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bathroom with a tile floor sitting on top of it.

Visual question: What is this?

Answers:

  1. bathroom scale
  2. scale
  3. weigh scale
  4. scale
  5. bathroom scale
  6. scale
  7. scale
  8. scale
  9. this weight machine
  10. scale

Reasons why answers differ:

Image captions:

  1. A bathroom with beige tiles with a blue and white checkered scale and throw rugs on the floor.
  2. A blue and white checkered scale in the bathroom.
  3. A floor scale on a tile floor, probably a bathroom.
  4. Pale bathroom with two rugs and a square scale on a tiled floor.
  5. White tiled bathroom with two mats and a blue scale.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 26: VizWiz_train_00003408.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a sign on a table.

Visual question: What are these?

Answers:

  1. cd roms
  2. blank cds
  3. unsuitable
  4. music cd r
  5. music cds
  6. music cd r
  7. cd
  8. dvds
  9. memorex cd r
  10. music cd r

Reasons why answers differ:

Image captions:

  1. A blurry photo of a pack of CDs that is upside down.
  2. A box contains multiple CDs for use in burning
  3. A CD-R music disc with 700 mb capacity in its case
  4. A music cd of 700 mb on top of a counter.
  5. A package of eighty Memorex recordable compact discs

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 27: VizWiz_train_00000921.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a cell phone.

Visual question: What is this?

Answers:

  1. panadol cold flu
  2. medication
  3. gum
  4. medicine
  5. cold flu medicine
  6. cold flu tablets
  7. panadol cold flu
  8. panadol
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A container of cold and flu medicine pills
  2. A hand holding a green and blue box of cold & flu medicine.
  3. A person is holding medicine in a green box for the cold and flu.
  4. A person is holding up a box of cold and flu pills.
  5. Someone is holding a box of cold and flu medication.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 28: VizWiz_val_00002321.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person sitting next to a bottle of water.

Visual question: What is this product?

Answers:

  1. unanswerable
  2. tequila
  3. sauna tequila
  4. sauna tequila
  5. alcohol
  6. sauna gold
  7. alcohol
  8. i dont know
  9. alcohol
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. a bottle of yellow liqueur laying on a carpeted floor
  2. A clear glass bottle of alcohol laying front down on a light grey shaggy carpet.
  3. A glass bottle of Sauza Gold alcoholic beverage.
  4. A glass bottle with liquid laying on a tan carpet near a person.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 29: VizWiz_val_00001887.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person holding a cell phone in their hand.

Visual question: Fuck this

Answers:

  1. unanswerable
  2. nope
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. phone
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A person holding a small black cell phone
  2. A person is holding a cellphone in his hand.
  3. A person's hand holds an old-school cell phone over their floral-pattern couch.
  4. MEN Hands cell phone front camera keypad torch light
  5. Someone holding a cell phone with push buttons.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 30: VizWiz_train_00015533.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of an airplane on a table.

Visual question: What's that?

Answers:

  1. marker
  2. dry board marker
  3. dry erase marker
  4. marker
  5. marker
  6. dry erase marker
  7. marker
  8. unanswerable
  9. marker
  10. marker

Reasons why answers differ:

Image captions:

  1. a dark green marker that ask to be recapped after use
  2. a large white marker with a green cap
  3. Large dry-erase or regular marker in teal color
  4. Some kind of marker, with a green cap.
  5. white tube with a green lid and black and yellow writing

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 31: VizWiz_val_00002182.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a stop sign on a table.

Visual question: What is this?

Answers:

  1. coaster
  2. coaster
  3. coaster
  4. estrella barcelona
  5. estrella
  6. coaster
  7. coaster
  8. estrella barcelona
  9. says barcelona estrella
  10. estrella

Reasons why answers differ:

Image captions:

  1. a red circular object reading "estrella damm Barcelona", probably a drink coaster
  2. A red fabric ornament with Barcelona written on it fancy is sitting on a wooden brown table.
  3. A round red coaster that is a little worn out and that says Barcelona.
  4. A round red placeholder that says Bar CEO ONA is on the table by the money.
  5. imagine how you would describe this image on the phone to a friend.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 32: VizWiz_val_00001159.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: Watch this!

Answers:

  1. computer
  2. ok
  3. unanswerable
  4. unanswerable
  5. unsuitable image
  6. unanswerable
  7. unanswerable
  8. unsuitable image
  9. unanswerable
  10. unanswerable

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A computer screen is on its home screen and in a dark room
  2. A computer screen that is on and showing someone's Desktop
  3. A laptop is lit up with the Windows desktop icons.
  4. I see a lot up computer screen with a cord hanging over it.
  5. Laptop screen showing various desktop icons in a dark room.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 33: VizWiz_train_00001836.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bench on a table.

Visual question: What item is this?

Answers:

  1. label instructions
  2. unsuitable
  3. unsuitable
  4. unsuitable
  5. unsuitable
  6. potato chips
  7. unsuitable
  8. unanswerable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A black bag of chips is unopened facing away from the camera.
  2. A crushed metal shaped item that has information
  3. A package of food is on top of a table.
  4. A ready to cook meal is seen in the picture.
  5. The backside of a packaging bag with red gold and black.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 34: VizWiz_train_00017875.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a white toilet on a wall.

Visual question: Tell me please the color of this shirt.

Answers:

  1. white
  2. white
  3. white
  4. white
  5. white
  6. white
  7. white
  8. white
  9. white
  10. white

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A piece of white fabric with a number of long vertical creases in it; similar to a tissue.
  2. A white shower curtain is hanging from a rod.
  3. it looks like white cotton cloth they used for screen or stitching purpose.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 35: VizWiz_val_00005494.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A bed against a blank wall next to a box or stand.
  2. A bed with a blue comforter and a black pillow and blanket, on a wooden floor and a white side table and white wall.
  3. A bed with a blue sheet and a pillow on it.
  4. A bed with black pillow cases and blue sheet.
  5. disordered bed, a person seems to be lying down.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 36: VizWiz_train_00008791.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cup of food on a table.

Visual question: What's in this packet please?

Answers:

  1. curried sausages recipe sauce
  2. curry flavor
  3. curried sausages
  4. curried sausages
  5. curried sausage
  6. curried sausages
  7. curried sausages
  8. curried sausages
  9. curried sausages recipe base
  10. curried sausages

Reasons why answers differ:

Image captions:

  1. A bag of curried sausages from the brand Masterfoods.
  2. A bag of Masterfoods curried sausages recipe base.
  3. A package of curried sausages in a beige package.
  4. a package of Masterfoods curried sausages recipe base
  5. image quality is high to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 37: VizWiz_val_00002808.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person is holding a plastic bag of food.

Visual question: What kind of muffin is this?

Answers:

  1. blueberry
  2. blueberry
  3. wild blueberry
  4. wild blueberry
  5. wild blueberry
  6. wild blue berry
  7. wild blueberry
  8. wild blueberry
  9. blueberry
  10. wild blueberry

Reasons why answers differ:

Image captions:

  1. A hand holding a blueberry muffin in a clear plastic wrap
  2. A hand holding a wild blueberry muffin in a cellophane wrapper next to a computer keyboard.
  3. a single wrapped Wild Blueberry muffin from Clover Hill
  4. Cloverhill wild blueberry muffin blue and clear packaging
  5. Very nicely taken the picture and focus point of the object is nice.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 38: VizWiz_train_00010696.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a pizza on a table.

Visual question: Okay let's try this again, what is the brand name for the lasagna?

Answers:

  1. unsuitable
  2. unsuitable
  3. unanswerable
  4. unsuitable
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. unsuitable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A blue shrink wrapped package of meat lasagna.
  2. A frozen meat and lasagna with 4 cheese meal
  3. A package of frozen lasagna is sitting on a counter.
  4. A package of pre-made meat and cheese lasagna with a blue and red label showing an image of the product on the front.
  5. Shows a blue package of a lasagna TV dinner

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 39: VizWiz_train_00003035.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A street sign sitting on top of a building.

Visual question: What's the name of the CD?

Answers:

  1. rih
  2. unanswerable
  3. unanswerable
  4. rihanna
  5. rihanna
  6. ria
  7. unanswerable
  8. rihanna
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a cd case or book cover with rih
  2. A dark book with a photo of someone's hands and arms and blue text across it.
  3. A novel lays face-up on a light wooden surface.
  4. The left part of a book or some type of media is visible on a wood surface.
  5. The upper left quadrant of a CD case is in black with light blue lettering.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 40: VizWiz_train_00017690.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding an umbrella.

Visual question: What is this item?

Answers:

  1. french roast
  2. french roast coffee packet
  3. french roast single cup bag
  4. coffee bag
  5. coffee
  6. french roast coffee
  7. coffee beans
  8. coffee
  9. coffee
  10. french roast coffee

Reasons why answers differ:

Image captions:

  1. A person with a pack of French roast coffee.
  2. a small white plastic packet of coffee with brown graphics and black lettering.
  3. A white bag of French Roast coffee beans.
  4. French roast coffee sitting on someone's red shorts.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 41: VizWiz_train_00008630.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A piece of books sitting on a table.

Visual question: Can you please tell me what this is. Thanks

Answers:

  1. candy
  2. blueberry nut
  3. unsuitable
  4. blueberry nut
  5. blueberry nut
  6. blueberry nut
  7. blueberry nut
  8. blueberry nuts
  9. target blueberry nut
  10. blueberry nut

Reasons why answers differ:

Image captions:

  1. a ash color theme packed with a yellow label
  2. a black bag of blueberry nuts nutrition facts
  3. A silver bag containing Blueberry nut is displayed on a countertop.
  4. Back of a blueberry nut flavored product that is showing the nutritional facts.
  5. The back side of a package of a food item.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 42: VizWiz_train_00004119.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a man in a mirror.

Visual question: Who is the guy?

Answers:

  1. chiang kai shek
  2. mexican
  3. unanswerable
  4. ho chi mein
  5. unanswerable
  6. unknown
  7. historical figure
  8. unanswerable
  9. unanswerable
  10. picture

Reasons why answers differ:

Image captions:

  1. A picture of a man in a blue button up shirt.
  2. A picture of a man wearing a blue button up shirt
  3. An image of the Chinese historical figure Sun Yat-sen
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 43: VizWiz_train_00019367.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a clock on a table.

Visual question: Did a pack of chips?

Answers:

  1. unsuitable
  2. unanswerable
  3. yes
  4. yes
  5. yes
  6. yes
  7. yes
  8. yes
  9. yes
  10. yes

Reasons why answers differ:

Image captions:

  1. A bag of snacks sits in a cupboard.
  2. A package of deli style lime and black pepper potato chips.
  3. A potato chip bag sitting on a table.
  4. An airplane meal tray containing a bag of potato chips
  5. an unopened bag of flavored potato chips on a table

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 44: VizWiz_val_00000343.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What is on this page?

Answers:

  1. dollar amounts
  2. account statement
  3. numbers
  4. i dont know
  5. numbers
  6. unanswerable
  7. numbers
  8. ledger
  9. bank statment
  10. unanswerable

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A paper of a spreadsheet of figures is on a wooden surface.
  2. a piece of white paper with lines with number on the lines like a monthly bill
  3. A table of numbers or part of an expense sheet.
  4. A textbook contains mostly numbers and account information
  5. Stack of papers that look like receipts on a wood table.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 45: VizWiz_val_00000447.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: Hello there, could you give me an idea of what's on the screen? Please be as specific as possible. Thank you.

Answers:

  1. error message
  2. windows recovery system error message
  3. error message
  4. error message
  5. open window saying problem
  6. unanswerable
  7. unanswerable
  8. error message
  9. unsuitable image
  10. error message

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A computer monitor is turned on and has a dialogue box.
  2. Computer screen with a blue light and small data window.
  3. Part of the screen of a computer monitor indicating that a username or password is incorrect.
  4. Quality issues are too severe to recognize visual content.
  5. someones laptop with some kind of error messages popped up

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 46: VizWiz_train_00001995.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A lamp sitting on top of a table.

Visual question: Is my light on?

Answers:

  1. yes
  2. yes
  3. yes
  4. yes
  5. yes
  6. yes
  7. yes
  8. yes
  9. yes
  10. yes

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A lamp is atop of a beige table inside of a room.
  2. a lit lamp that is sitting on an end table
  3. a white lamp turned on in front of a blue wall
  4. An end table with a lit lamp, a couple of bottles, and other miscellaneous items on it.
  5. I see a lamp near a water bottle on the table

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 47: VizWiz_train_00006790.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a laptop computer on a table.

Visual question: What is the expiration date?

Answers:

  1. unsuitable
  2. unanswerable
  3. unsuitable
  4. unclear
  5. no
  6. unsuitable
  7. unsuitable
  8. unsuitable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A packaged product is on top of a wooden surface, the product is has its text description written on it.
  2. Cooking directions on the box of a microwaveable food item.
  3. Could be a package with instructions for installation of the content.
  4. Quality issues are too severe to recognize visual content.
  5. The back of a package of ready to cook foods has the cooking instructions on it.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 48: VizWiz_train_00017182.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person ' s hand holding a stop sign.

Visual question: What is this food, please?

Answers:

  1. 20 beef meatballs
  2. beef meatballs
  3. meatballs
  4. beef meatballs
  5. 20 beef meatballs
  6. meatballs
  7. meatballs
  8. beaf meat food
  9. 20 beef meatballs
  10. beef meatballs

Reasons why answers differ:

Image captions:

  1. a hand holding a container of beef meatballs
  2. clear plastic packaging with a blue label and white writing
  3. hand holding package of 20 beef meatballs 100% British beef
  4. Person's hand holding unopened container of uncooked meatballs
  5. Someone is holding a frozen package of 20 beef meatballs in their hand.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 49: VizWiz_train_00007614.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a bottle of water.

Visual question: This can.

Answers:

  1. chunk light tuna
  2. chuck light tuna
  3. tuna
  4. chunk light tuna
  5. chunk light tuna
  6. tuna
  7. unanswerable
  8. chunk light tuna
  9. chunk light tuna
  10. tuna

Reasons why answers differ:

Image captions:

  1. A can of chunk light tuna being held by someone's left hand.
  2. A can of chunk tuna being held by a man above a round table.
  3. A person is holding a can of tuna in their hand.
  4. A person is standing next to a table, holding a can of tuna.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 50: VizWiz_val_00000639.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What is that a picture of?

Answers:

  1. side open mouth
  2. face mouth
  3. mouth
  4. mouth
  5. mouth
  6. mouth
  7. mouth
  8. mouth
  9. mouth
  10. mouth

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a close up of half of a person's mouth
  2. A close up of the corner of a person's open mouth.
  3. A half opened mouth showing teeth and rosy lips
  4. A mouth is really close to your personal space!
  5. A zoomed in image of someone's open mouth and lips

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Showing images 0 - 0 out of 0 matching images.