Showing images 0 - 0 out of 0 matching images.

Images are displayed from Training and Validation sets only.
Hover over image to zoom in.

Image 1: VizWiz_train_00007022.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A piece of pizza that is on a table.

Visual question: What is this box?

Answers:

  1. garlic 3 cheese texas toast
  2. 3 cheese garlic bread
  3. garlic bread
  4. bread
  5. texas toast
  6. garlic bread
  7. garlic bread
  8. garlic 3 cheese texas toast
  9. fg
  10. garlic toast

Reasons why answers differ:

Image captions:

  1. A box of garlic 3 cheese texas toast sits on a white cabinet next to a roll of paper towels.
  2. A box of garlic bread is sitting on a counter next to a single piece of sliced pepperoni and some pieces of cheese.
  3. A package of sliced baked breads is left on a table
  4. a photo of garlic 3 cheese bread texas toast.
  5. Breakfast from a box sits on the counter top.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 2: VizWiz_train_00009967.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cell phone sitting on top of a counter.

Visual question: Can you tell me what this is, please?

Answers:

  1. buckyballs
  2. magnetic balls
  3. buckyballs
  4. buckyballs
  5. magnetic desktop
  6. unsuitable
  7. magnetic desktop
  8. magnetic toy
  9. led lights
  10. buckyballs

Reasons why answers differ:

Image captions:

  1. A box containing a toy called Buckyballs that says, "The amazing magnetic desktoy you can't put down!"
  2. A package of Buckyballs magnetic balls laying on a tan shag carpet.
  3. A white and orange photo of packaging for an item called Buckyballs.
  4. Box of a desk toy called Buckyballs, with their slogan saying you won't be able to put it down
  5. Buckyballs magnetic desktop toy in box with plastic covering.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 3: VizWiz_train_00003268.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a wall in a room.

Visual question: Are the lights on or off?

Answers:

  1. on
  2. on
  3. on
  4. on
  5. unsuitable
  6. on
  7. on
  8. on
  9. off
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A window built into the blue wall in the room.
  2. A window that has the blinds closed blocking an outside view.
  3. One curtain rod holder on left of window with closed louvered shade.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 4: VizWiz_train_00014361.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a blue and white tie.

Visual question: Thank you.

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. youre welcome
  6. welcome
  7. unanswerable
  8. unanswerable
  9. 3 musketeers
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. 3 Musketeers chocolate candy bar wrapper on a surface
  2. 3 MUSKETEERS IS Humorous, vain, slave to fashion, good-hearted; comical and jaunty in his sword fighting.
  3. A 3 Musketeers candy bar laying on a counter.
  4. a 3 musketeers candy bar that says now richer chocolate
  5. A package containing a 3 Musketeers candy bar.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 5: VizWiz_val_00002766.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a vase with flowers.

Visual question: Is this stripes or flowers?

Answers:

  1. flowers
  2. flowers
  3. flowers
  4. flowers
  5. flowers
  6. flowers
  7. flowers
  8. flowers
  9. flowers
  10. flowers

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A blue fabric has white and green flowers on it.
  2. a blue green and white cloth with design on it
  3. A green and blue floral pattern on a type of fabric.
  4. Bright blue fabric with green and white Hawaiian floral pattern.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 6: VizWiz_val_00001385.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person walking down a path near a river.

Visual question: Do you see the river from there, from this perspective?

Answers:

  1. yes
  2. yes
  3. yes
  4. yes
  5. yes
  6. yes
  7. yes
  8. yes
  9. yes
  10. yes

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A dark lake with sparse Fall trees and lush grass on its banks
  2. A outdoors area with a river or lake and cars on the other side.
  3. Creek bed that appears raised due to the rain
  4. green grass next to a lake with dead trees and blue sky
  5. Its cloudy Outside, there is a muddy lake surrounded with leafless trees.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 7: VizWiz_train_00023912.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: is it not?

Answers:

  1. unsuitable image
  2. unanswerable
  3. unanswerable
  4. unsuitable image
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. unsuitable image
  10. unanswerable

This image does not have annotations for Reasons Why Answers Differ.

This image does not have annotations for Captions.

This image does not have annotations for Skills.

This image does not have annotations for Quality Issues.

This image does not have annotations for Text Presence.

Image 8: VizWiz_train_00013701.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bottle of water.

Visual question: What is this product?

Answers:

  1. plastic bottle
  2. unsuitable
  3. water bottle
  4. unanswerable
  5. water
  6. water
  7. bottle
  8. water bottle
  9. water bottle
  10. water

Reasons why answers differ:

Image captions:

  1. a clear plastic bottle with an opaque plastic lid on it
  2. A water bottle is sitting right on top of the table.
  3. Quality issues are too severe to recognize visual content.
  4. Soda is in the plastic bottle sitting on the table.
  5. The top of a bottle of water in a person's kitchen.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 9: VizWiz_train_00003352.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A dark picture of a black sky at night.

Visual question: Is this pajamas?

Answers:

  1. unsuitable
  2. no
  3. unsuitable
  4. no
  5. unsuitable
  6. unsuitable
  7. unanswerable
  8. unsuitable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. An image that shows absolutely nothing, there was an error in the photograph.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 10: VizWiz_train_00020166.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A black table with glass case holds a television set and a wireless house thermostat on top.
  2. A black television with a grey DVD player on top.
  3. A photo of an entertainment center that is reflecting a person's foot.
  4. An old black box TV sits on the ground unplugged with some other electronic devices on top of it
  5. the corner of a black media stand, sitting on carpet

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 11: VizWiz_val_00001210.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a table with a sign.

Visual question: What is this?

Answers:

  1. yogurt
  2. unanswerable
  3. yogurt
  4. yogurt
  5. yogurt
  6. yogurt
  7. yogurt
  8. yogurt
  9. yogurt
  10. yogurt

Reasons why answers differ:

Image captions:

  1. A 650g can or tub of yogurt with high calcium content.
  2. container of yogurt sets on a black swirled counter top
  3. Container or yogurt sitting on a gray surface.
  4. The top side of a white food container on a counter
  5. Top of a 650 gram container of 2% yogurt.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 12: VizWiz_train_00008457.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bottle of beer.

Visual question: Can you see the writing on this bottle and if so what does it say?

Answers:

  1. milliliter measurements
  2. ergerg
  3. no
  4. no
  5. unsuitable
  6. i cannot back up
  7. unsuitable
  8. only writing shows countenance bottle
  9. unanswerable
  10. no

Reasons why answers differ:

Image captions:

  1. a container with liquid inside of it, red in color, on a desktop
  2. A red water bottle half filled with milliliter markings on the side
  3. Drink Container with Trees on it half full with the lid open
  4. Orange plastic water bottle filled with approximately 650 milliliters of water sitting on a wood counter.
  5. Translucent orange reusable water bottles decorated with small outlines of trees, filled halfway with water, on top of a wooden table with blue chairs in the background.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 13: VizWiz_val_00004702.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A close up of the temperature dial of a white colored oven and it is set to 375 and to its left is a digital timer.
  2. A close-up of a button showing oven temperature set at 375 degrees.
  3. A small dial on an oven that controls the oven temperature.
  4. An oven temperature knob is set to 375 degrees.
  5. the dial of the oven with a temperature set at 375 degrees.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 14: VizWiz_train_00007423.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a plate of food on a table.

Visual question: Would you tell me what this is?

Answers:

  1. hamburger helper double cheeseburger macaroni
  2. double cheeseburger macaroni
  3. hamburger helper classic double cheeseburger macaroni
  4. hamburger helper
  5. hamburger helper box
  6. hamburger helper
  7. hamburger helper double cheeseburger macaroni
  8. hamburger helper
  9. hamburger helper
  10. hamburger helper

Reasons why answers differ:

Image captions:

  1. A box of hamburger helper of classic cheeseburger macaroni.
  2. A cardboard box of a popular hamburger heat-and-eat dinner rests on a counter top.
  3. A picture of what appears to be some food.
  4. A well packed of hamburger is placed on the table.
  5. A yellow and red box of Hamburger Helper dried foods on top of a tan laminate surface.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 15: VizWiz_train_00009294.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a tree with trees.

Visual question: What color are those curtains.

Answers:

  1. black
  2. green whte
  3. white green
  4. white green leaf pattern
  5. green white
  6. green white
  7. pink green
  8. white green
  9. white floral pattern
  10. green white brown

Reasons why answers differ:

Image captions:

  1. A white piece of fabric containing images of trees and leaves.
  2. A white piece of paper has a drawing of green leaves
  3. it's a photo of a lake and some grass plants
  4. Quality issues are too severe to recognize visual content.
  5. The reflection of tree leaves over a rippling pond.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 16: VizWiz_train_00011653.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a drink.

Visual question: Hi and what kind of safe is this? Thank you.

Answers:

  1. unanswerable
  2. peach
  3. peach pie filling
  4. peach pie topping
  5. can peach pie filling
  6. unanswerable
  7. unanswerable
  8. peach topping
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A tin can of peaches being held by someone.
  2. IMAGE WAS UNCLEAR BUT IT IS NOT ITEM
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 17: VizWiz_train_00020737.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. Can all nature tertiary processed food mix vegetable soup.
  2. Cans of food are shown sitting on top of a microwave
  3. Image is of two cans of Campbell's Select Harvest soup, Southwestern Style Vegetable and Minestrone with Whole Grain Pasta.
  4. Three cans of soup sitting on a white object with grids.
  5. THREE SMALL CANS OF FOOD ON A MICROWAVE

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 18: VizWiz_val_00005792.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A green package or box of passion herbal tea.
  2. A green plate with letter written in white with a round label
  3. lemon blueberry tea circular on green item with a red banner and brand in the middle
  4. Part of the label of a package of Timothy's brand herbal tea appears, showing the words "lemon" and "passion tea".
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 19: VizWiz_train_00011895.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a picture of a book.

Visual question: I am told that the flavor of this food is invisible. I do not understand. How does invisible kool aid not have a specific flavor?

Answers:

  1. unanswerable
  2. sugar flavor
  3. unsuitable
  4. idont know
  5. yes
  6. unanswerable
  7. unanswerable
  8. unsuitable
  9. no question yes invisible meaning no color not flavor that not mentioned
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A Kool aid invisible flavor packet on a person's blue jeaned lap.
  2. A very blurry photo of a packet of kool aid mix.
  3. candy with attractive rapper to eat and enjoy
  4. Kool Aid drink packet on someone's leg who is sitting down
  5. Packets of Kool Aid like this will keep you bouncing off the walls for weeks.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 20: VizWiz_train_00023005.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A black pair of pants and another clothing are on a bed sheet.
  2. A floral cloth and cargo shorts are folded on a bed.
  3. clothing laid out, some black pants with buttons on the pocket, a paisley fabric that has golds, blues and reds and white fabric in the background.
  4. Dark green cotton pants and a multi-color paisley print top are shown on a white fabric background.
  5. Two pieces of clothing, one sitting on top of the other, both sitting on top of cloth

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 21: VizWiz_val_00004132.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a car that is seen.

Visual question: What setting is this?

Answers:

  1. fn
  2. fn
  3. f n
  4. unsuitable
  5. fn
  6. function
  7. unanswerable
  8. function button
  9. fn
  10. function

Reasons why answers differ:

Image captions:

  1. Bottom left corner key on a keyboard which represents the function key.
  2. Close up picture of the function key on a keyboard.
  3. someone's keyboard with the letters FN on it
  4. The "fn" or "function" button on a computer keyboard.
  5. The key on the keyboard appears on the left side and has the letters ln on it.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 22: VizWiz_train_00001491.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bottle of food is on a counter.

Visual question: Hey can you tell me how much this weighs? Thanks.

Answers:

  1. unanswerable
  2. unsuitable
  3. we
  4. unanswerable
  5. unanswerable
  6. 2 oz
  7. no
  8. unanswerable
  9. 12 oz
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A picture of a pack of Kroger home sense facial tissues.
  2. a small package of Kroger home sense facial tissues
  3. a small package of napkins that you can use for cleaning
  4. A small plastic package of 2-Ply tissues is on the marble table.
  5. Some facial tissues are in a package on a counter.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 23: VizWiz_train_00015754.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A chair that is sitting in a room.

Visual question: What is that?

Answers:

  1. chair
  2. chair
  3. chair
  4. chair
  5. chair
  6. office chair
  7. chair
  8. office chair
  9. desk chair
  10. office chair

Reasons why answers differ:

Image captions:

  1. A black computer chair with no handles and adjustable height
  2. A computer chair that has no arms and is black
  3. An office chair with lumbar support and no armrests.
  4. Black mid sized computer chair with no arm rests sitting in the middle of a room.
  5. the corner of a persons room a computer chair in front of a messy shelf

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 24: VizWiz_train_00010205.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a wall on a table.

Visual question: What is this?

Answers:

  1. tarot card
  2. unanswerable
  3. taro card
  4. unsuitable
  5. unanswerable
  6. card
  7. card
  8. book
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. a gold cover of a book with blue and yellow writing on it
  2. A Moon tarot card XVIII against a tan background.
  3. A playing card with the roman numerals xviii written on the left side of it.
  4. Quality issues are too severe to recognize visual content.
  5. Tarot cards in this box have such interesting designs.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 25: VizWiz_val_00000300.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: Please can you tell me what this writing says on the picture that I've taken a photo of, thank you?

Answers:

  1. burglar alarm
  2. unanswerable
  3. unsuitable image
  4. unsuitable image
  5. no
  6. beat burglar tips
  7. tips
  8. beat burglar
  9. beat burglar
  10. unsuitable image

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A billboard has a lot of papers attached to it.
  2. A sheet is on the wall with basic tips about beating a burglar.
  3. A typed notification is hanging on a wall.
  4. A white sheet with many black words on it near a colorful picture.
  5. Some text talks about what to do if there is a burglar.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 26: VizWiz_train_00000590.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a book with a dog on it.

Visual question: Please tell me what is in that pocket?

Answers:

  1. pepperoni cheese
  2. cookies
  3. cookies
  4. cookies
  5. cookies
  6. cookies
  7. picture cut off cant tell
  8. unanswerable
  9. unanswerable
  10. cookie

Reasons why answers differ:

Image captions:

  1. A box of cookies with a red and green label that are gluten free.
  2. A box of gluten free snack cakes sits on a kitchen counter.
  3. A box with two main color that is red and green is placed on a high place.
  4. A gluten-free, egg free tart box resting on the counter
  5. A green and red box of gluten, wheat, and egg free cookies.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 27: VizWiz_train_00008397.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A pair of scissors sitting on top of a table.

Visual question: What's in this box?

Answers:

  1. cereal
  2. cereal
  3. cereal
  4. cereal
  5. cereal
  6. breakfast cereal
  7. cereal
  8. cereal
  9. dry cereal
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A box of cereal with a bearded man wearing a crown and holding a spoon on the front
  2. A box of cereal with a cartoon character on the front.
  3. A cartoon drawing of a crowned, white-bearded King, brandishing an oversized spoon.
  4. A cartoon image of a king with a long white beard holding a giant spoon.
  5. The front of a cereal box with a picture of a king holding a spoon

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 28: VizWiz_train_00012432.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A glass of wine sitting on top of a wooden table.

Visual question: What is this?

Answers:

  1. glass water
  2. glass water
  3. glass water
  4. glass water
  5. water glass
  6. glass water
  7. glass water
  8. glass water
  9. glass
  10. cup

Reasons why answers differ:

Image captions:

  1. A glass of water on top of a coaster with a flower drawing.
  2. A very wonderful view and worth seeing at all times, my friend
  3. An small glass sitting on a table with a coaster underneath it.
  4. Clear glass cup filled halfway with water on a coaster with a white flower and red coloring.
  5. glass of water on top of a coaster on a red wooden counter top

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 29: VizWiz_train_00000125.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a book on a table.

Visual question: What is the expiration date on this milk?

Answers:

  1. unanswerable
  2. november
  3. unanswerable
  4. unsuitable
  5. unsuitable
  6. unanswerable
  7. unsuitable
  8. unavailable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A carton of vanilla almond milk with information on it.
  2. A close up of a label on vanilla almond milk.
  3. A label of Almond milk vanilla flavor is showing.
  4. A photo of a box of organic almond milk vanilla that has writing on it.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 30: VizWiz_train_00012654.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person holding a remote control in their hand.

Visual question: What does this say?

Answers:

  1. unanswerable
  2. unsuitable
  3. unsuitable
  4. unsuitable
  5. unanswerable
  6. unsuitable
  7. not clear
  8. unsuitable
  9. unsuitable
  10. cant see too small to read

Reasons why answers differ:

Image captions:

  1. A barefoot person holding a small white bottle.
  2. A hand is holding a pill bottle showing part of the dosage information label.
  3. A person holding a bottle of medication in their left hand.
  4. the back of a plastic bottle of medication with directions
  5. The instructions label on the back of a bottle of pills.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 31: VizWiz_train_00006716.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a clock on a table.

Visual question: What time is it?

Answers:

  1. cabel box
  2. 11:05
  3. unanswerable
  4. 11:03
  5. 11:03
  6. 11:05
  7. unanswerable
  8. 11:03
  9. unanswerable
  10. 11:03

Reasons why answers differ:

Image captions:

  1. A cable box with a coaxial cable laying on top of it.
  2. A coax cable cord is looped atop a Verizon tuner.
  3. a large black TV box with the time 11:03 on it
  4. A stereo component is powered up on a shelf.
  5. A Verizon brand digital video recorder and the cable wire connected to it.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 32: VizWiz_train_00014054.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A sign that is sitting on a street.

Visual question: Can you tell me the size of this please?

Answers:

  1. no
  2. unsuitable
  3. unanswerable
  4. no
  5. cant read lable
  6. unsuitable
  7. unsuitable
  8. unsuitable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. a blue article of clothing with a white colored tag
  2. A dark colored v neck shirt with a white tag on it.
  3. A plain white label inside the collar of a dark colored v-neck shirt.
  4. A shirt has a tag on the back and is a v-neck.
  5. The top part of a shirt with the label showing but can't see what's on the label.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 33: VizWiz_train_00016725.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a plate of food.

Visual question: Which type of hamburger helper is this?

Answers:

  1. cheesy pasta
  2. cheesy pasta
  3. cheesy pasta
  4. cheesy pasta
  5. cheesy pasta
  6. cheesy pasta
  7. cheesy pasta
  8. cheesy pasta
  9. cheesy pasta
  10. cheesy pasta

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A box of cheesy noodles picturing what its contents will look like when prepared.
  2. A close up of an orange package of macaroni and cheese.
  3. A frozen food box of with pasta on the front.
  4. A very wonderful view and worth seeing at all times, my friend
  5. I see a upside down picture of cheesy pasta dish.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 34: VizWiz_train_00014143.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person on a red surface.

Visual question: What color is this floor question mark

Answers:

  1. red
  2. red
  3. maroon
  4. brown
  5. unanswerable
  6. candy apple red
  7. blurry
  8. red
  9. red
  10. red

Reasons why answers differ:

Image captions:

  1. A red background with a white light streak
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 35: VizWiz_train_00009321.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cell phone sitting on top of a table.

Visual question: Is this?

Answers:

  1. coffee package
  2. maxwell house coffee k cups
  3. coffee box
  4. what this
  5. coffee
  6. maxwell house blend coffee
  7. maxwell house coffee
  8. coffee
  9. coffee
  10. coffee

Reasons why answers differ:

Image captions:

  1. A bag of Maxwell House blend coffee that sits on the edge of a hotel sink
  2. A bag of Maxwell House - House Blend coffee.
  3. A brown bag of coffee with a blue and white logo is leaning on a white wall
  4. a brown package with blue writing on it for Maxwell House coffee
  5. a package of Maxwell House coffee pods on a table

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 36: VizWiz_val_00006108.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a close up of a white fabric blanket
  2. A close up picture of some sort of grey material with stitching running down the middle.
  3. A greyish white blanket with small squares on it.
  4. A quilt is laid in front of a camera and has a square pattern on it.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 37: VizWiz_train_00000380.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A laptop is in front of a window.

Visual question: is this Iphone

Answers:

  1. unsuitable
  2. no
  3. no
  4. ipad
  5. no
  6. no
  7. no
  8. no
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A laptop computer screen is sitting in front of a gold curtain, and a wooden bookshelf is directly to the left of it.
  2. A laptop or an iPad computer screen on a counter with nothing on the screen.
  3. Appears to be a picture of a bathroom with tablet in it
  4. Here is a photo of a brown curtain and shelving near it, and in the front is a screen to a tablet perhaps.
  5. White apple tablet with nothing displayed on the screen.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 38: VizWiz_val_00002735.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bright orange background.

Visual question: What color is this?

Answers:

  1. orange
  2. orange
  3. orange
  4. orange
  5. orange
  6. orange
  7. red orange
  8. orange
  9. red
  10. orange

Reasons why answers differ:

Image captions:

  1. IT TOO BLUE AND IT'S HARD TO FIND
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 39: VizWiz_train_00006552.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a book on a table.

Visual question: What is this item?

Answers:

  1. ginger
  2. ginger
  3. ginger
  4. ginger
  5. ginger
  6. ginger
  7. ginger
  8. ginger
  9. ginger
  10. ginger

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A container of ginger is on top of a table.
  2. A container of ginger with a White, Red and Yellow label
  3. a container of ground ginger and its barcode
  4. A spice container with a label that reads "Ginger".
  5. a vegetable grow under the earth and used in preparing dishes

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 40: VizWiz_train_00003080.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A view of a building with a window in the background.

Visual question: what type of package is this please

Answers:

  1. unanswerable
  2. unanswerable
  3. unsuitable
  4. unanswerable
  5. unsuitable
  6. unanswerable
  7. unanswerable
  8. bdgsh
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a light that is on a ceiling that is white
  2. bright big ceiling rectangle light that is on
  3. Image is a overhead lighting for viewing.
  4. The overhead light is very bright and is charged with vertical bars.
  5. three bulb fluorescent light on the ceiling, nothing else visible

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 41: VizWiz_train_00013033.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a plate of food.

Visual question: What flavor is this?

Answers:

  1. beef
  2. beef
  3. beef
  4. beef
  5. beef
  6. beef
  7. beef
  8. beef
  9. beef
  10. beef

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A package of ramen noodle soup beef flavor is on pink table.
  2. A package of Ramen noodle soup laying on a table with a clock behind it.
  3. A red and yellow pouch of noodles on a white and red placemat.
  4. A soft unopened plastic package of Ramen beef-flavored noodle soup sitting on a place-mat next to a dinner plate.
  5. Beef ramen noodles in the package resting on a pink rimmed white plate

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 42: VizWiz_train_00007804.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A television sitting on a table next to a window.

Visual question: What is this?

Answers:

  1. speaker
  2. radio
  3. portable speaker
  4. speakerbox
  5. wireless speaker
  6. speaker
  7. speaker
  8. speaker
  9. speaker
  10. speaker

Reasons why answers differ:

Image captions:

  1. A small green and silver speaker in front of cabinet doors.
  2. A speaker sits on a desktop with cabinet doors in the background.
  3. An audio speaker with two cabinet doors in the background
  4. On a shiny surface is a small speaker.
  5. Silver speaker radio placed on wooden floor in front of dresser.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 43: VizWiz_train_00009741.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cat that is sitting in a room.

Visual question: What kind of dog is this?

Answers:

  1. west highland white terrier
  2. white dog
  3. schnauzer
  4. cairn terrier
  5. yorkie
  6. white dog
  7. unanswerable
  8. unanswerable
  9. westie
  10. terrier

Reasons why answers differ:

Image captions:

  1. A small dog with perky ears in front of a corner.
  2. A small white dog, behind him there is some type of shelf unit
  3. A white dog in a bedroom with a gate behind it that's green
  4. A white dog that has really long hair in front of its face.
  5. A white shaggy dog stands on a wooden floor in front of a door.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 44: VizWiz_train_00021176.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A red wall has a white ribbed surface along its lower half.
  2. Dark pink painted wall has a ribbed white radiator about 5 inches from the dark floor.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. White molding is on the peach colored wall in the room.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 45: VizWiz_val_00001084.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What's in this tent?

Answers:

  1. unsuitable image
  2. unsuitable image
  3. unanswerable
  4. people
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. unsuitable image
  9. unanswerable
  10. unsuitable image

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A cushioned seat with dark green patterned fabric
  2. A person's elbow in front of a couch
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 46: VizWiz_train_00001158.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person ' s hand on a skateboard.

Visual question: What is this item?

Answers:

  1. organic black beans
  2. black beans
  3. canned beans
  4. can black beans
  5. beans
  6. beans
  7. black beans
  8. can black beans
  9. black beans
  10. black beans

Reasons why answers differ:

Image captions:

  1. A can of Natural Value organic black beans is being held in the hand of someone.
  2. A person holds a can of natural value beans with a picture of beans on the cover
  3. A red can containing some black beans is being held up by someone
  4. A red labeled tin can of black beans.
  5. an ingredient which is used for preparing dishes which is black in color and it is a bean

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 47: VizWiz_train_00003760.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a black and white wall.

Visual question: What does the display say?

Answers:

  1. unsuitable
  2. unanswerable
  3. unsuitable
  4. unsuitable
  5. nothing
  6. nothing
  7. 9
  8. no display
  9. 0
  10. 0

Reasons why answers differ:

Image captions:

  1. A black LED instrument panel with green light indicators.
  2. a dark doorway in a very white, shiny wall.
  3. A digital display unit on the top of the oven shows a timer.
  4. A silver digital clock with its display in the middle.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 48: VizWiz_train_00015103.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a pair of scissors.

Visual question: Hopefully this comes out better this time, if you can tell me what this logo is, what the name brand is.

Answers:

  1. adidas
  2. unanswerable
  3. dont say
  4. unsuitable
  5. unanswerable
  6. unanswerable
  7. i cant tell
  8. unanswerable
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a black pouch with the zipper half way open
  2. A zipper pouch on a black fabric fanny pack
  3. Beautiful view from behind the walls hidden under dark mist
  4. Black fabric back with a zipper and velcro on top of a tan background.
  5. Pictured is what appears to be a black canvas bag/luggage with a red, white and blue pin.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 49: VizWiz_train_00000647.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cup of coffee sitting on a table.

Visual question: What kind of soda is this?

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. lemon lime
  5. unsuitable
  6. unanswerable
  7. sprinte
  8. unanswerable
  9. unanswerable
  10. lemon lime

Reasons why answers differ:

Image captions:

  1. A can of carbonated beverage sitting on a flat horizontal surface with a bowl of food in the background.
  2. A container / package that contains various goods / edible / liquid items.
  3. A soft drink can is sitting next to a bowl on the table.
  4. An unopened can of soda with nutrition facts in front of a half-eaten bowl of soup
  5. Green beverage can and a bowl of liquid food.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 50: VizWiz_train_00019249.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a sign.

Visual question: What does this say?

Answers:

  1. fashion tote
  2. preferred for stylish prote fashion tote b
  3. fashion
  4. fashion tote
  5. fashion tote bag
  6. preferred stylish prote fashion tote
  7. fashion tote
  8. fashion tote
  9. fashion tote bag
  10. fashion tote

Reasons why answers differ:

Image captions:

  1. A hand holding a clothing tag or advertisement card.
  2. A hand holding an upside down label for an item.
  3. a picture of someone's hand holding a tag saying preferred for stylish tote
  4. paper tag colored green brown and purple with text print on it
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Showing images 0 - 0 out of 0 matching images.