Showing images 0 - 0 out of 0 matching images.

Images are displayed from Training and Validation sets only.
Hover over image to zoom in.

Image 1: VizWiz_train_00010140.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person in the water.

Visual question: Can you see what is on the computer screen?

Answers:

  1. no
  2. no
  3. no
  4. no
  5. reflection
  6. no
  7. phone
  8. no
  9. dirt
  10. dust

Reasons why answers differ:

Image captions:

  1. A computer or television made by Dell that's filthy.
  2. A Dell brand laptop screen with fingerprints on the screen.
  3. A really dirty laptop computer covered in pet hairs and dirt.
  4. An old, decrepit Dell computer displays someone hold a computer mouse on the screen.
  5. The reflection of an Apple iPhone in a ruined laptop screen

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 2: VizWiz_train_00022909.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A computer monitor showing an open document on the screen.
  2. A computer screen with a group of websites and controls on screen.
  3. A picture of a laptop or desktop screen
  4. a screen on a computer with a tab pulled up
  5. In this picture is a image of a screen

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 3: VizWiz_train_00020569.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A cell phone with a broken screen and a zebra print screen protector is sitting on the edge of a table.
  2. A Motorola cell phone with a zebra phone case and a broken screen.
  3. A Motorola smartphone with a cracked screen and zebra-print case.
  4. A smartphone with a cracked screen in a zebra phone case.
  5. broken smartphone with a black and white zebra case

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 4: VizWiz_train_00017611.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of an oven in a room.

Visual question: What is the brand of this receiver?

Answers:

  1. unsuitable
  2. musicer
  3. pioneer
  4. unsuitable
  5. unsuitable
  6. pioneer
  7. unanswerable
  8. unsuitable
  9. unsuitable
  10. pioneer

Reasons why answers differ:

Image captions:

  1. A black console with a wooden box on top sitting on a wood shelf.
  2. A black stereo system with a brown box on top.
  3. a DVR/VCR sitting in front of an electric plug
  4. Black and orange stove top with wires and digital screen
  5. Not a good image and image has drawbacks in quality.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 5: VizWiz_train_00011216.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a plate of food on a table.

Visual question: Whats the name of this box?

Answers:

  1. anne
  2. unanswerable
  3. luzianne ice tea
  4. luzianne
  5. luzianne
  6. tea
  7. unsuitable
  8. luzianne
  9. luzianne
  10. iced tea

Reasons why answers differ:

Image captions:

  1. A box of Luzianne brand iced tea pouches.
  2. A box of Luzianne tea containing multiple pouches for brewing.
  3. A box of Luzianne's new iced tea pouches.
  4. Red Luzianne drink pouch placed on a white speckled counter top.
  5. The front label of a box of instant iced tea mix.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 6: VizWiz_train_00005454.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A book that is sitting on the floor.

Visual question: What is this product?

Answers:

  1. unsuitable
  2. unsuitable
  3. unsuitable
  4. unsuitable
  5. mrs callers
  6. mrs cubbinsons
  7. unanswerable
  8. unsuitable
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. Box of food on the lap of a person
  2. Image quality is low to recognize visual content.
  3. Packaged food label showing cooking instructions and recipe ideas
  4. Quality issues are too severe to recognize visual content.
  5. The back of a package of a Mrs Cubbison's food item is shown.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 7: VizWiz_train_00000306.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a banana on a table.

Visual question: What flavor dressing is this please?

Answers:

  1. mango chipotle
  2. mango chipotle
  3. mango chipotle
  4. mango chipotle
  5. mango chipotle
  6. chipotle
  7. mango chipotle
  8. mango chipotle
  9. mango chipotle
  10. mango

Reasons why answers differ:

Image captions:

  1. A bottle of Kraft mango chipotle vinaigrette in a plastic container with orange liquid.
  2. a full bottle of Mango Chipotle flavored salad dressing
  3. A hand holding a jar of salad dressing which is orange in color and has picture of mango and some words on it.
  4. bottle of orange Kraft mango chipotle salad dressing
  5. KRAFT Mango Chipotle in orange bottle holding by someone.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 8: VizWiz_train_00022071.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A black computer screen showing the F8 command
  2. A black computer screen with white writing giving the option to press F8
  3. a computer monitor showing a DOS screen and choices including one for f8
  4. a computer screen with a black background displaying some instructions in white text.
  5. A monitor with a black screen that says 'for this choice, press F8'.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 9: VizWiz_train_00021196.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. appears to be a picture of a box of crakcers
  2. Back of a box of cheese it's showing the assorted flavors they offer.
  3. rectangle box containing small starchy snack that has cheese dusting.
  4. The back of a box of Cheez It's from the side.
  5. the back of a red box of cheez it snack crackers

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 10: VizWiz_train_00003836.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person holding up a bottle of beer.

Visual question: What kind of soup is this?

Answers:

  1. cream celery
  2. cream celery
  3. cream celery
  4. cream celery
  5. cream celery
  6. cream celery
  7. celery
  8. cream celery
  9. cream celery
  10. cream celery

Reasons why answers differ:

Image captions:

  1. A can of Campbell's Cream of Celery soup held so that the label shows the contents of the can prepared.
  2. A person is holding a can of Campbell's condensed soup.
  3. A white appliance and a person holding a red, white and yellow can of soup.
  4. Beautiful view from behind the walls hidden under dark mist
  5. someone holding a can of Campbell's cream of celery soup

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 11: VizWiz_train_00019795.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up view of a silver shirt.

Visual question: what color are these pants?

Answers:

  1. grey
  2. grey
  3. grey
  4. grey
  5. grey
  6. brown
  7. grey
  8. sofa
  9. grey
  10. grey

Reasons why answers differ:

Image captions:

  1. Dark slate gray textured and patterned cloth surface.
  2. It looks like the grey fabric that is textured may be a seat of some kind.
  3. Quality issues are too severe to recognize visual content.
  4. Someone has photographed a length of grey, herringbone textured fabric.
  5. Something which is black in color and it looks like a cloth.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 12: VizWiz_train_00016571.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a blurry light in a room.

Visual question: What is this item?

Answers:

  1. unanswerable
  2. light bulb
  3. light
  4. unsuitable
  5. unsuitable
  6. mirror
  7. unsuitable
  8. mirror
  9. mirror on door
  10. light

Reasons why answers differ:

Image captions:

  1. a flash in a mirror with a wide wooden frame
  2. Front view into an oven that appears to have something in it.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 13: VizWiz_train_00006274.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a pillow on a table.

Visual question: What is this?

Answers:

  1. leg
  2. forearm
  3. insufficient image quality
  4. unsuitable
  5. unsuitable
  6. unanswerable
  7. unsuitable
  8. unsuitable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A package of food lies on a rug, obscured by a large cloth.
  2. An arm is in front of a package of food.
  3. I see food product packaging behind someone's arm.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 14: VizWiz_train_00023737.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What is the membership number of this card?

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. unsuitable image
  5. unanswerable
  6. unanswerable
  7. sams club
  8. unanswerable
  9. unanswerable
  10. unsuitable image

This image does not have annotations for Reasons Why Answers Differ.

This image does not have annotations for Captions.

This image does not have annotations for Skills.

This image does not have annotations for Quality Issues.

This image does not have annotations for Text Presence.

Image 15: VizWiz_train_00017946.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a book on a table.

Visual question: What page is this and can you tell me if it is upside down or right side up? Thank you.

Answers:

  1. 330 upside down
  2. 330 upside own
  3. music
  4. 300 upside down
  5. ee
  6. 380 upside down
  7. upside down
  8. upside down
  9. 330 upside down
  10. upside down i think page 330 little blurry

Reasons why answers differ:

Image captions:

  1. A paper with study guides for a music class on a desk.
  2. A piece of paper has musical notes on it.
  3. A piece of paper that is on top of a desk.
  4. Page out of a book about music or music theory.
  5. textbook of a musical note on upside down manner

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 16: VizWiz_val_00002612.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cloudy sky with a gray and white clouds.

Visual question: What do the big gray clouds look like in the sky?

Answers:

  1. big grey clouds
  2. storm clouds
  3. clouds
  4. slightly slightly gloomy
  5. unanswerable
  6. cotton candy
  7. cotton ball
  8. clouds
  9. cotton
  10. full all encompassing

Reasons why answers differ:

Image captions:

  1. A photograph of an evening sky with clouds, sunset, trees and electric poles and wires.
  2. Dark blue clouds in the sky with a few street lights and trees in the background.
  3. Gray, wispy clouds in the sky with a bit of blue shining in between.
  4. someone is outside looking at the clear blue sky with clouds
  5. Storm clouds rolling over a neighborhood with a streetlight in the foreground

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 17: VizWiz_train_00004280.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A plate of food that is on a table.

Visual question: What kind of food product is this?

Answers:

  1. corn chips
  2. chips
  3. corn chips
  4. potatoes
  5. fritos scoops corn chips
  6. corn chips
  7. chips
  8. scoops corn chips
  9. corn chips
  10. scoops corn chips

Reasons why answers differ:

Image captions:

  1. A bag of Scoops! branded corn chips in a dark room.
  2. A blue, red, and yellow bag of Scoops brand corn chips.
  3. a sachet a snacks which is named scoops
  4. some sort of chips that are in a blue bag
  5. The bottom of a bag of Scoops corn chips with the label showing prominently.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 18: VizWiz_val_00001304.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bottle of beer sitting on top of a table.

Visual question: What is this a model of?

Answers:

  1. vinegar
  2. bottle
  3. vinegar
  4. room
  5. bottle
  6. bottle
  7. oil bottle
  8. glass bottle
  9. unanswerable
  10. ship

Reasons why answers differ:

Image captions:

  1. A bottle of a brown liquid on a white countertop.
  2. A bottle of liquid that is on the table.
  3. A bottle of some sort of liquid, possibly a cooking oil.
  4. An open bottle of olive oil at a dining table
  5. room with the French door and window has a table a bottle on it.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 19: VizWiz_train_00018551.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a large white cage.

Visual question: What is this?

Answers:

  1. fan
  2. fan
  3. fan
  4. oscillating desk fan
  5. table fan
  6. fan
  7. fan
  8. fan
  9. fan
  10. fan

Reasons why answers differ:

Image captions:

  1. A small white fan is on top of the floor.
  2. A white rotating fan that has 3 settings.
  3. appears to be a picture of a fan
  4. The fan is white in color and has three small blades.
  5. White Atlantic Breeze fan with four buttons on a carpeted floor.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 20: VizWiz_train_00018058.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a refrigerator with a cat on it.

Visual question: what flavor is this

Answers:

  1. cherry
  2. cherry
  3. cherry
  4. cherry
  5. cherry
  6. cherry
  7. cherry
  8. cherry
  9. cherry
  10. cherry

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A package of Kool-Aid is on a counter.
  2. A red packet of powdered Kool Aid in cherry flavor.
  3. A small package of red kool-aid powder mix
  4. Hey Kool-Aid package thing flat on top of a kitchen counter
  5. Packet of cherry flavored Kool Aid brand drink mix.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 21: VizWiz_val_00007307.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. Quality issues are too severe to recognize visual content or a black screen
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 22: VizWiz_train_00020674.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A corner of a red keyboard with black and white keys
  2. A red piano keyboard with white and black keys says nordu Synthe
  3. Black and white keys of a piano keyboard.
  4. keyboard is seen with alternate white and black color
  5. Nord stage piano keyboard user manual for blind.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 23: VizWiz_train_00013008.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a refrigerator on a wall.

Visual question: Can you tell what this is?

Answers:

  1. aloe gel
  2. aloe vera gel
  3. aloe vera gel
  4. ever
  5. aloe vera gel
  6. aloe vera gel
  7. gel
  8. gel
  9. aloe vera gel
  10. aloe vera gel

Reasons why answers differ:

Image captions:

  1. A 12-oz bottle of aloe vera gel with a green and orange label
  2. A bottle of aloe vera gel with a green and orange label against a white background.
  3. A rectangular box containing some kind of drink.
  4. Aloe vera gel bottle right on the floor.
  5. The side of a bottle of soap with green aloe vera soap inside.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 24: VizWiz_train_00013938.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a refrigerator with a toilet.

Visual question: This is a washing machine setting knob. The bottom left hand corner contains a bump, a raised dot. I need to know what setting that is.

Answers:

  1. medium
  2. unanswerable
  3. medium
  4. medium
  5. unanswerable
  6. medium
  7. medium
  8. medium
  9. medium
  10. medium

Reasons why answers differ:

Image captions:

  1. A washing machine has a control knob allowing two different wash cycles.
  2. A washing machine is turned on and is against the wall.
  3. A white clothes washer with the knob turned to medium.
  4. A white washing machine has a dirty dial
  5. The control knob of a white washing machine.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 25: VizWiz_train_00013503.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a view of a white sky.

Visual question: What color are these shorts?

Answers:

  1. unsuitable
  2. ra
  3. unsuitable
  4. blue
  5. black
  6. black
  7. grey
  8. grey
  9. black
  10. no shorts

Reasons why answers differ:

Image captions:

  1. A dark colored bed sheet with a few wrinkles.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 26: VizWiz_train_00018040.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A red tree with a plant on it.

Visual question: What's this?

Answers:

  1. unanswerable
  2. valentine
  3. tattoo
  4. heart chain lock 2 pink roses 2 doves flying over
  5. red heart roses
  6. heart lock
  7. heart locket doves roses
  8. picture heart locked chain around flowers below doves above
  9. picture
  10. art

Reasons why answers differ:

Image captions:

  1. a card of some sort that has a heart locked up and doves flying above it.
  2. a clip art depiction of a red heart with a chain and lock around it with pink roses underneath the heart and the two white doves flying above the heart
  3. a heart with a lock on it and flowers with white birds above
  4. A romantic picture of a heart with a lock and key, two love birds and some pink roses
  5. I am looking at a picture of a heart with a gold chain with a lock wrapped around the heart.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 27: VizWiz_val_00002569.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a beer.

Visual question: Can you tell me what this food item is?

Answers:

  1. manwich
  2. manwich
  3. manwich
  4. can manwich
  5. manwich
  6. manwich
  7. manwich
  8. unanswerable
  9. no
  10. can

Reasons why answers differ:

Image captions:

  1. A can of Manwich is ready for dinner if you think you are man enough.
  2. A hand is holding a can Manwich sloppy joe sauce upside down.
  3. a person hand holding a red color little can
  4. A person is holding a can of soup.
  5. a vegetables which is ready to eat stored in a container

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 28: VizWiz_val_00006934.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a pink and blue watch on a wrist with the time three twenty six
  2. A pink and blue watch with a circular face
  3. A pink watch that shows the time as 3:26.
  4. a white female wearing a pink and blue digital watch
  5. Hot pink and blue watch on a wrist.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 29: VizWiz_val_00004889.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A white bottle with directions and a barcode on the label.
  2. A white metal spray can with a white cap being held by someone's hand.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. the back label of a bottle with a visible barcode and directions printed

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 30: VizWiz_val_00001782.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A statue of an elephant on a white surface.

Visual question: What is this object?

Answers:

  1. ashtray
  2. decorative bronze sculpture lioness
  3. statue
  4. no
  5. ashtray
  6. ash tray
  7. statue
  8. dog bowl
  9. ashtray
  10. plague tiger on top

Reasons why answers differ:

Image captions:

  1. a gold ashtray with a large cat on the top of it
  2. a golden color award with a symbol of a lion
  3. A photo of a cougar gold statue on a white flat surface.
  4. A sculpture is right on top of the table.
  5. Bronze statue of a feline species animal mounted on top of opening container.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 31: VizWiz_train_00011775.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a wooden wall next to a knife.

Visual question: What is the color of this seen?

Answers:

  1. lite brown white
  2. brown
  3. beige burgundy
  4. white marble burgundy lines
  5. peach
  6. brown
  7. tan marble
  8. light brown
  9. red cream
  10. tan

Reasons why answers differ:

Image captions:

  1. A countertop or table, nothing resting on it.
  2. A light beige colored marble type surface, either a floor or a counter.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 32: VizWiz_train_00006978.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a white bed in a room.

Visual question: Can you please tell me what kind of tube this is?

Answers:

  1. unanswerable
  2. unsuitable
  3. unsuitable
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. unsuitable
  8. unanswerable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. a small bottle of lotion laying on a white shirt with a gray cord laying across it.
  2. A white bottle with green text laying on a bed with a gray house next to it.
  3. A white t shirt that has a tag shown
  4. Quality issues are too severe to recognize visual content.
  5. White bed sheets with what looks like a tube of lotion on it.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 33: VizWiz_train_00012421.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person sitting on top of a white table.

Visual question: What is the picture on this piece of paper?

Answers:

  1. vines
  2. plant
  3. letter some branches
  4. letter vines leaves
  5. vines
  6. flowers
  7. some tree branches
  8. letter
  9. letter in tree
  10. black white card floral like design letter in upper right corner

Reasons why answers differ:

Image captions:

  1. A coloring book page on a person's lap.
  2. A coloring book page with black outlines and the letter A in a box.
  3. A coloring page that has the letter A with a design next to it.
  4. A uncolored coloring page resting on a person's lap.
  5. Someone is sitting with a book meant for coloring.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 34: VizWiz_train_00022047.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A big box of pizza style hot pockets
  2. A box for hot pockets and it's nutritional information.
  3. A box of hot pockets is opened and has nutritional info on the back.
  4. A white package of Hot pockets turned on the Nutritional facts side.
  5. The back of a box of hot pockets which shows the nutritional facts

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 35: VizWiz_train_00022504.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a blurred picture of looks like the sky with grass on the ground
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 36: VizWiz_train_00017183.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cup of coffee.

Visual question: Hopefully this is a better angle, and this may hopefully show which one this is. Either the chocolate or tapioca. Thanks.

Answers:

  1. chocolate
  2. chocolate
  3. chocolate
  4. chocolate
  5. chocolate
  6. chocolate
  7. chocolate
  8. chocolate
  9. chocolate
  10. chocolate

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A container of 90 calorie each chocolate pudding packs.
  2. A person is holding a package of chocolate pudding.
  3. A picture of the food is on the packaging.
  4. image shows a chocolate packet and two chocolate glass in a hand.
  5. The label of a package of chocolate pudding cups.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 37: VizWiz_train_00010288.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a plate of food.

Visual question: What is this? Thank you.

Answers:

  1. unanswerable
  2. unanswerable
  3. tv dinner
  4. chicken mashed potatoes chives
  5. meat cheddar cheese mashed potatoes
  6. frozen meal
  7. chicken mashed potatoes
  8. this panner rice
  9. frozen meal
  10. chicken bacon mashpotatoes

Reasons why answers differ:

Image captions:

  1. A box chicken and potatoes with onions in them.
  2. A box of microwavable food that contains bacon, cheddar cheese with mashed potatoes.
  3. A freezer meal box, with what appears to be barbecue chicken with bacon and cheese, served with mashed potatoes.
  4. A red packaging with a picture of mashed potato and chicken on the cover
  5. Appears to be a picture of a TV dinner

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 38: VizWiz_val_00001021.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: Describe the picture

Answers:

  1. facebook page on computer
  2. laptop wooden desk back something behind
  3. computer screen
  4. computer screen
  5. computer screen
  6. kid
  7. boy
  8. computer screen
  9. boy
  10. screen shot computer screen

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A computer monitor showing a page of Facebook.
  2. A computer screen showing a window with a boy in a video.
  3. A corner of a computer screen showing the picture of a small dark haired boy.
  4. A laptop monitor was pictured as it was laid back and with its brightness, the text can't be read.
  5. grey color laptop placed on brown wooden table

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 39: VizWiz_train_00006853.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person laying on top of a bed.

Visual question: What does this shirt say?

Answers:

  1. unanswerable
  2. nothing
  3. unanswerable
  4. nothing
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. nothing
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A brown pillow case is on the white bedding
  2. A person's lower body next to a white and pink fabric surface.
  3. Brown fabric, possibly a garment, lying on white fabric, possibly a bed; the photographer's body and legs are visible in the shot.
  4. Person wearing blue jeans and blue shirt standing next to bed with brown colored shirt stretched out on bed.
  5. The end of a bed that's been made up very neatly.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 40: VizWiz_val_00004495.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 41: VizWiz_train_00008504.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person is holding a bottle of wine.

Visual question: What is this?

Answers:

  1. medicine
  2. medicine
  3. eyedrops
  4. small bottle
  5. eye drops
  6. eye drops
  7. eye drops
  8. eye drops
  9. saline drops
  10. eyedrops

Reasons why answers differ:

Image captions:

  1. A hand holding a small bottle of some kind of drops, showing the back with the ingredients list.
  2. A small plastic bottle of eye drops with the instructions showing.
  3. A small plastic eye dropper bottle is being held
  4. bottle of nasal spray held up in a living room
  5. The back of a small bottle of eye drops or contact drops.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 42: VizWiz_train_00001752.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person on the ground.

Visual question: Does this say it's for cats or dogs?

Answers:

  1. unsuitable
  2. unsuitable
  3. unsuitable
  4. unsuitable
  5. unsuitable
  6. unanswerable
  7. unanswerable
  8. unsuitable
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A person holds up a gold credit card
  2. A wrinkled up piece of paper being held up.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 43: VizWiz_train_00012413.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a drink.

Visual question: What kind of can is this? Thank you

Answers:

  1. unanswerable
  2. unsuitable
  3. juice
  4. unsuitable
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. tin
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A green labeled tin can of ready to eat preserved foods.
  2. A hand is holding a can of food with the back panel showing.
  3. A person holding up a green can with a barcode
  4. Persons hand holding a round tin can with black lettering on it.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 44: VizWiz_train_00000070.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a street sign in the dark.

Visual question: What does the display say?

Answers:

  1. frequency
  2. frequency audio
  3. frequency
  4. frequency
  5. frequency 87 audiomode: ste cut off
  6. unsuitable
  7. unclear
  8. frequency 8%
  9. frequency audiomode
  10. frequency

Reasons why answers differ:

Image captions:

  1. A bright display saying something about FREDQUENCY's and AUDIO MODE.
  2. A screen that is showing the frequency and audio mode that the device is set to.
  3. Black background with a Blue box with writing in box; Appears to be a from a dashboard in car but not sure
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 45: VizWiz_train_00006572.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cup of coffee on a table.

Visual question: What is the color of this?

Answers:

  1. red white
  2. red white
  3. can red
  4. red
  5. red
  6. red white
  7. red
  8. cream
  9. red white
  10. red white

Reasons why answers differ:

Image captions:

  1. A can of Coca-Cola is sitting in front of several other objects on a table.
  2. a coca cola can next to a phone on top of a white counter.
  3. Camera needs to be placed a little higher up to get the full picture and to see the full figure of soda can.
  4. Coca-Cola soda can next to a telephone on top of a desk
  5. the bottom of a 12 FL oz can of coca cola showing 140 calories per can

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 46: VizWiz_val_00007076.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A computer monitor displaying the Windows login prompt.
  2. a computer screen with a windows XP box open on it
  3. A dated black plastic flat screen computer shows the Windows login prompt.
  4. A small computer monitor is on a table and a larger monitor is hanging on the wall above a table and chairs.
  5. Monitor screen showing a windows XP logo, inside a room.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 47: VizWiz_val_00003528.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A tv is sitting on a wooden surface.

Visual question: What is in this screen?

Answers:

  1. image
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. unsuitable
  6. unsuitable
  7. unanswerable
  8. unsuitable
  9. monitor
  10. wood shelf

Reasons why answers differ:

Image captions:

  1. A computer screen is displaying a window with a message
  2. Laptop, opened in front of a wooden wall or hutch
  3. Part of a laptop computer can be seen sitting on a light wooden surface.
  4. Quality issues are too severe to recognize visual content.
  5. The top right corner of a computer screen can be seen against a wooden background.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 48: VizWiz_train_00005958.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person ' s hand.

Visual question: What do you see?

Answers:

  1. unanswerable
  2. unsuitable
  3. unanswerable
  4. finger pants
  5. clothing part finger
  6. unsuitable
  7. skin clothes
  8. unsuitable
  9. thumb
  10. skin

Reasons why answers differ:

Image captions:

  1. I see a finger and maybe a few blankets and spreads.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 49: VizWiz_train_00007189.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A kitchen with a bottle of food on it.

Visual question: what kind of spice is this, thank you?

Answers:

  1. lemon pepper
  2. lemon pepper
  3. lemmon pepper
  4. lemon pepper
  5. lemon pepper seasoning
  6. unanswerable
  7. lemon pepper
  8. lemon pepper
  9. lemon pepper
  10. lemon pepper

Reasons why answers differ:

Image captions:

  1. a jar of lemon pepper seasoning with a yellow lid and green label
  2. a jar of seasoning with a yellow lid and red and green label.
  3. a kitchen countertop that is very white with stuff on it
  4. a plastic shaker of McCormick lemon pepper with a yellow lid sitting on a counter top
  5. A small container of lemon pepper seasoning sitting on a kitchen counter in front of a utensil container, and next to the stove.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 50: VizWiz_val_00005129.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A white pill bottle with purple and orange labeling.
  2. An label bottle with an nomenclature of ingredients in orange and blue.
  3. Quality issues are too severe to recognize visual content.
  4. The package contains information about the enclosed medication
  5. the photo is of a bottle of vitamin supplements.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Showing images 0 - 0 out of 0 matching images.