Showing images 0 - 0 out of 0 matching images.

Images are displayed from Training and Validation sets only.
Hover over image to zoom in.

Image 1: VizWiz_train_00007616.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cup of food.

Visual question: What is this?

Answers:

  1. crunchy peanut butter
  2. peanut butter
  3. peanut butter
  4. crunchy peanut butter
  5. crunchy peanut butter
  6. peanut butter
  7. peanut butter
  8. crunchy peanut butter
  9. crunchy peanut butter
  10. crunchy peanut butter

Reasons why answers differ:

Image captions:

  1. A container of crunchy peanut butter with a blue lid.
  2. a jar of crunchy peanut butter with nutrition facts on it
  3. A jar of peanut butter is sitting on a white surface, against a brown wall.
  4. appears to be a picture of peanut butter
  5. The side of a crunchy peanut butter container is shown as it sits on a counter.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 2: VizWiz_train_00015757.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a glass on a table.

Visual question: is there any pattern on this coat?

Answers:

  1. mug fantasy train picture
  2. sd
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. train
  7. no coat
  8. unanswerable
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A coffee cup depicting a picture of a train sitting on a table
  2. A coffee mug printed with an orange train-type vehicle flying above a metropolitan city.
  3. a coffee mug with a handle and an image of a train on it.
  4. A mug with a colorful train on the glass table
  5. A mug with a picture of a flying orange vehicle over a city.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 3: VizWiz_train_00017897.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a sunset with a blurry background.

Visual question: What is the name of this product?

Answers:

  1. towel
  2. unsuitable
  3. unsuitable
  4. unanswerable
  5. chair
  6. unanswerable
  7. 0
  8. unusable image
  9. finger
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 4: VizWiz_train_00012356.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a book on a wall.

Visual question: What chocolates are these?

Answers:

  1. unanswerable
  2. unsuitable
  3. unanswerable
  4. assorted chocolate toffees
  5. unanswerable
  6. unanswerable
  7. big selection
  8. hershey
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A box of packaged chocolates with a red label.
  2. A large red box of chocolates and toffees.
  3. A red candy box is pictured here shown in landscape.
  4. A RED COLOR RECTANGULAR SHAPED CHOCOLATE BOX WAS HANDLED A PERSON
  5. a wine colored paper with BIG SELECTION written on it

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 5: VizWiz_train_00012960.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cup of coffee sitting on a table.

Visual question: What type of drink is this?

Answers:

  1. diet soda
  2. diet coke
  3. diet coke
  4. diet coke
  5. diet coke
  6. cola
  7. diet coke
  8. diet coke
  9. diet coke
  10. diet coke

Reasons why answers differ:

Image captions:

  1. A silver aluminum can with Diet Coke labeling.
  2. A silver can of Diet Coke on top of a white surface in a room with a white wall and dark grey floor.
  3. A silver can with the words Diet Coke written on it in black and red.
  4. a single can of diet coke brand soda pop
  5. an aluminum can of diet coke with a pull tab

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 6: VizWiz_train_00004557.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a plate of food.

Visual question: What is this?

Answers:

  1. pancake mix
  2. buttermilk pancake mix
  3. pancake mix
  4. pancake mix
  5. buttermilk pancake mix
  6. pancake mix
  7. pancake mix
  8. pancake mix
  9. this cooking package
  10. pancake batter

Reasons why answers differ:

Image captions:

  1. 28 ounce bag of buttermilk pancake mix, just add milk
  2. a 28 ounce plastic package of buttermilk pancake mix
  3. A 28 oz plastic package of buttermilk pancake mixture
  4. A bag of pancake mix sitting on someone's knees with a white background.
  5. Buttermilk pancake mix in a white and yellow bag

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 7: VizWiz_train_00004248.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a chair that is seen.

Visual question: Read it.

Answers:

  1. unanswerable
  2. unanswerable
  3. unsuitable
  4. unanswerable
  5. unsuitable
  6. unsuitable
  7. unanswerable
  8. unsuitable
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A bill with yellow, red, and blue print on it.
  2. A person holding a blue and yellow piece of paper.
  3. A piece of paper with elaborate drawings on it and folded corner.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 8: VizWiz_val_00005814.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A list of words commonly used in law enforcement
  2. A page from test or homework questions of a law class.
  3. A piece of paper with fill-in-the-blank questions labeled "Vocab" at the top.
  4. A white sheet of paper that says Vocab and has questions on it.
  5. The top portion of a written exam with blank spaces on it

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 9: VizWiz_train_00012497.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a sign on a wall.

Visual question: What is in the box?

Answers:

  1. krusteaz honey cornbread mix
  2. cornbread mix
  3. natural honey cornbread
  4. ho
  5. honey cornbread
  6. honey cornbread
  7. unanswerable
  8. cornbread
  9. honey cornbread
  10. honey cornbread

Reasons why answers differ:

Image captions:

  1. A box has honey cornbread in it and a image of cornbread
  2. A box of Krusteaz brand honey cornbread mix.
  3. A box of Krusteaz Natural Honey Cornbread with a picture of cornbread biscuits
  4. appears to be a picture of a box of food
  5. The front of a package of a cornbread mix.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 10: VizWiz_train_00002132.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a blue sky with a light.

Visual question: What color is my shirt?

Answers:

  1. unsuitable
  2. unsuitable
  3. unsuitable
  4. unanswerable
  5. blue
  6. unsuitable
  7. blue
  8. unsuitable
  9. blue
  10. blue

Reasons why answers differ:

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 11: VizWiz_val_00005619.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A screen picture of a hand coming through was seems to be glass with the time and date on it as well.
  2. A screenshot of someone's phone or iPad, picture is a hand reaching out and something like water squirting out sides of glove.
  3. An iPad screen displays the time of 1:01 and the date of Monday, February 11.
  4. Animated background with a hand breaking out of the glass for a iPhone lock screen.
  5. iPad unlock screen featuring a hand piercing through the glass

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 12: VizWiz_train_00004058.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a neon sign on top of it.

Visual question: YEAH WHAT FRECUENCY IS THIS?

Answers:

  1. 44.5
  2. 44.5
  3. unsuitable
  4. 44.5
  5. 44.5
  6. 44.5
  7. 44.58
  8. 44.5
  9. 44.58
  10. 44.5

Reasons why answers differ:

Image captions:

  1. A digital display that currently shows 44 point 50.
  2. a digital display with the number 44.5 up close.
  3. A digital readout says 44.5 on this device.
  4. Green colored digital display with partial numbers showing, 44 point 5.
  5. The numbers are bright green in color and have a decimal with them.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 13: VizWiz_train_00000283.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bottle of beer.

Visual question: What is in this can please?

Answers:

  1. unsuitable
  2. unsuitable
  3. national
  4. unanswerable
  5. unanswerable
  6. tomato sauce
  7. unanswerable
  8. tomato sauce
  9. tomato sauce
  10. pasta sauce

Reasons why answers differ:

Image captions:

  1. A 15 ounce (425g) can of traditional style spaghetti sauce.
  2. A can with a yellow and red label sits on a wooden table.
  3. A canned food that has a yellow, green, and red label on a wooden surface.
  4. A photo of a yellow can of national tomato paste sitting on a counter.
  5. A small can with a picture of red pasta sauce on top of noodles on a table.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 14: VizWiz_train_00008947.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: An open book with an advertisement on it.

Visual question: Can you read this to me?

Answers:

  1. blurry
  2. unsuitable
  3. unsuitable
  4. unsuitable
  5. no
  6. unsuitable
  7. unsuitable
  8. unsuitable
  9. unsuitable
  10. no

Reasons why answers differ:

Image captions:

  1. A list of off-leash dog area rules on a wooden surface.
  2. a list of rules for dogs off leash in the park
  3. A list of rules for Off-Leash dog area
  4. List of rules for off leash dog areas on a wooden board.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 15: VizWiz_train_00023575.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: Where is this?

Answers:

  1. unanswerable
  2. room
  3. bedroom
  4. hanging on wall
  5. bedroom
  6. bedroom
  7. bedroom
  8. bedroom
  9. bedroom
  10. unanswerable

This image does not have annotations for Reasons Why Answers Differ.

This image does not have annotations for Captions.

This image does not have annotations for Skills.

This image does not have annotations for Quality Issues.

This image does not have annotations for Text Presence.

Image 16: VizWiz_train_00023085.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A pack of Maruchan brand chicken-flavored ramen noodle soup.
  2. a package of Maruchan Ramen Noodle soup in chicken flavor on a wood surface
  3. A package of Maruchan Ramen Noodle Soup, Chicken flavor.
  4. a package of ramen noodle soup chicken flavor
  5. Package of ramen noodles sitting on a wood countertop.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 17: VizWiz_train_00015711.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A yellow banana sitting on top of a table.

Visual question: What vegetable is this

Answers:

  1. squash
  2. squash
  3. spaghetti squash
  4. spaghetti squash
  5. squash
  6. spaghetti squash
  7. spaghetti squash
  8. tsth
  9. squash
  10. squash

Reasons why answers differ:

Image captions:

  1. A large lime green melon is laying on the tiled floor in a home.
  2. A light green melon or squash sitting on the white tiled floor next to a brown area rug.
  3. giant oblong shaped greenish yellow melon next to rug
  4. Oval yellow shaped spaghetti squash whole on tiled floor.
  5. The squash is long and oval and is yellow in color with a hard end that is green.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 18: VizWiz_train_00003362.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a book with an orange phone.

Visual question: What is this?

Answers:

  1. hot sauce
  2. pringles chips
  3. pringles
  4. unsuitable
  5. hot sauce
  6. hot sauce potato crisps
  7. hot sauce
  8. can original hot sauce crisps
  9. hot sauce flavored pringles
  10. pringles chips

Reasons why answers differ:

Image captions:

  1. A orange package for a container of potato crisps.
  2. A tall spherical foil and paper package of potato crisps.
  3. Container of spicy pringles, cannot see what flavor.
  4. Quality issues are too severe to recognize visual content.
  5. The can of chips are a very hot variety of Pringles.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 19: VizWiz_train_00001410.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A pair of red shoes in a room.

Visual question: What is this logo on this cap?

Answers:

  1. red sox
  2. unanswerable
  3. unanswerable
  4. boston red sox
  5. unsuitable
  6. unanswerable
  7. unanswerable
  8. unsuitable
  9. baseball team
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A baseball cap that is navy with a red decal.
  2. A baseball cap with the Boston Red Sox logo
  3. Beautiful view from behind the walls hidden under dark mist
  4. Bill of ball cap with embroidery and stitching.
  5. seam between a ball cap bill and topper with a red and white logo.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 20: VizWiz_train_00006134.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cake on a table.

Visual question: What is this?

Answers:

  1. cherry crumble
  2. pie
  3. pie
  4. fruit cobbler
  5. cobbler
  6. dessert
  7. foot item
  8. cobbler
  9. fruit cobbler
  10. cherry cobbler

Reasons why answers differ:

Image captions:

  1. A close up of a dessert that has been half eaten with a pie behind it.
  2. A granite like counter top displaying 2 deserts in baking dishes.
  3. a small take away with some food in it
  4. some type of dessert that is being eaten by someone
  5. two pies; one in a round baking dish uneaten and one in a rectangular baking dish with 1/3 portion gone

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 21: VizWiz_train_00002948.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cat sitting on top of a book.

Visual question: What are the baking instructions?

Answers:

  1. unanswerable
  2. unsuitable
  3. unanswerable
  4. unsuitable
  5. oven
  6. unsuitable
  7. dont know
  8. unsuitable
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A box of cookies are laying on a person's lap.
  2. a container of Pillsbury brand Christmas cookie dough
  3. a human more likely a woman holding a booklet setting in a chair
  4. A package of pre make Christmas cookies laying on someone's lap
  5. I see product packaging for Pillsbury frozen food that is resting on someone's lap.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 22: VizWiz_train_00022764.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A gray, white and black striped piece of material that is flat.
  2. A white fabric with silver, black, white and blue horizontal stripes.
  3. Silver Satin type fabric with white black and gray stripes.
  4. The corner of a grey pillow with black, white, and green stripes.
  5. The end of a necktie with diagonal stripes of silver, black, grey, dark blue

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 23: VizWiz_train_00018349.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a hotdog.

Visual question: What product is this made by?

Answers:

  1. amys california burgers
  2. amys
  3. amys
  4. veggies
  5. amys
  6. vegetables
  7. amys
  8. amys
  9. amys
  10. amys

Reasons why answers differ:

Image captions:

  1. a box of frozen dinner near am ac book computer
  2. A container of Amy's California Burger with a computer in the background.
  3. A hand holding a package for frozen "California burger" patties.
  4. A person holding up a frozen organic California burger patty made by Amy's.
  5. Box of frozen organic Amy's California meatless burger made with vegetables and grains

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 24: VizWiz_train_00019821.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cup of coffee sitting on top of a table.

Visual question: What color is this ...?

Answers:

  1. nhsdghb
  2. red black grey white
  3. black white red grey
  4. red white blue
  5. striped red white black grey
  6. mug striped red white blue on green tabletop
  7. red white black green stripes
  8. red white blue
  9. green
  10. red black white grey thick striped

Reasons why answers differ:

Image captions:

  1. a coffee cup with horizontal stripes that are red white and black
  2. A green table with a glass mug on it and some cloth
  3. A mug on a table with black shakers and blue napkins, with various people sitting behind.
  4. A striped mug is on top of the table.
  5. mug with red, white, black and green stripes sitting on a green table

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 25: VizWiz_train_00000404.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A man that is standing in a bathroom.

Visual question: Hello. Can someone please tell me what this is please? Many thanks and any instructions for us on the bottle? Thank you.

Answers:

  1. cleaner
  2. cleaner
  3. bathroom toilet cleaner instructions on other side bottle
  4. bathroom toilet cleaner
  5. mr muscle bath toilet cleaner
  6. mr muscle bathroom toilet cleaner
  7. mr muscle bathroom toilet cleaner no instructions
  8. bathroom toilet cleaner
  9. cleaner
  10. bathroom cleaner

Reasons why answers differ:

Image captions:

  1. a container of Mr Muscle bathroom and toilet cleaner
  2. a person hand holding a blue pack of Mr muscle bathroom and toilet
  3. an orange bottle of Mr Muscle bathroom and toilet cleaner
  4. An orange plastic bottle of toilet cleaner with the branding Mr Muscle.
  5. some Mr muscle toilet cleaner for your bathroom

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 26: VizWiz_train_00020941.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a computer monitor with a webcam on top of it
  2. A image of a computer showing the screen.
  3. A large TV screen next to a table with medicine bottles and boxes on it
  4. A sideways picture of a screen sitting in a cluttered room.
  5. A television or computer monitor sits in a room with cabinets, boxes and several bottles in the background.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 27: VizWiz_train_00009381.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A view of a white wall with a dirty tub.

Visual question: What is this?

Answers:

  1. rgrsfdg
  2. unsigned back check
  3. back check
  4. unsuitable
  5. back receipt
  6. unanswerable
  7. check
  8. back check
  9. paper
  10. paper

Reasons why answers differ:

Image captions:

  1. a countertop with a check on the surface
  2. Back of a check with a signature area not wrote on.
  3. Paper with Original Document written on it on top of a table
  4. Quality issues are too severe to recognize visual content.
  5. The back of a check is on top of a counter.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 28: VizWiz_train_00023300.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A can of progresso rich and hearty chicken and homestyle noodles soup
  2. a can of Progresso Rich and Hearty soup
  3. A can of Progresso soup sitting on a counter top.
  4. A can of soup sitting on a counter.
  5. Can of Progresso chicken noodle soup, can resting on a counter with kitchen appliances/tools on the counter also.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 29: VizWiz_train_00020783.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A white panel of light being held up by a hand in a very dark room.
  2. box like structure and is not clear to view and I can't find the object
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 30: VizWiz_train_00019005.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A reflection of a mirror in a bathroom.

Visual question: How much time is left on this timer?

Answers:

  1. 2:15
  2. 2:16
  3. 2 16
  4. 2:16 minutes
  5. 2 minutes 16 seconds
  6. 2:16
  7. 2:15
  8. unsuitable
  9. 2.16
  10. 2 16

Reasons why answers differ:

Image captions:

  1. A convection oven has a blue light for a clock and a knob in the bottom.
  2. A fancy microwave with many buttons is sitting on the counter.
  3. A microwave with a silver body and black panel showing the time.
  4. an electric cooking device with COOK MAGIC written on it
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 31: VizWiz_train_00020126.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A menu advertising Japanese food and beer is stood up on a table.
  2. A sushi menu with beer and various sushi on it
  3. An image of Sapporo brand beer on a menu with appetizers.
  4. appears to be a picture of a food screen
  5. Photo is of a place mat depicting several types of sushi, along with an advertisement for Sapporo beer.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 32: VizWiz_train_00008243.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a blue wall.

Visual question: what is the picture on this sweatshirt?

Answers:

  1. 0
  2. unanswerable
  3. unanswerable
  4. blank
  5. blank
  6. blue
  7. unsuitable
  8. unanswerable
  9. blue
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A blue knitted sweater with tiny hairs falling out of place.
  2. a blue piece of fabric possibly a towel or blanket
  3. A blue sweater is knitted with a blue yarn.
  4. Quality issues are too severe to recognize visual content.
  5. SNAPSHOT OF A BLUE TOWEL, VIEWED UP CLOSE

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 33: VizWiz_val_00001340.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a stop light on a wall.

Visual question: Hope my finger wasn't in the way but did it get the message this time?

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. unsuitable
  5. unanswerable
  6. create text
  7. create text
  8. create text
  9. unsuitable
  10. this create txte

Reasons why answers differ:

Image captions:

  1. A close up of a screen that has the words Create Text.
  2. a screenshot of some sort of monitor or computer
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. text screen with text that reads create text with a checkbox

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 34: VizWiz_train_00016291.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: There is a can of soda and a sign.

Visual question: How many cups of milk and water will I need to make this mix?

Answers:

  1. unanswerable
  2. 2 cups water
  3. unsuitable
  4. unanswerable
  5. unanswerable
  6. just water
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A box of chicken flavored stuffing mix made by Stater Bros sits on a wood table.
  2. A red box of stuffing is sitting on a wood counter.
  3. A red box package of Stater Bros brand stuffing mix is in front of a drink thermos.
  4. box of stuffing mix with blue water jug behind it on a wood table
  5. The right side of the front of a box of stuffing mix.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 35: VizWiz_train_00007489.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A computer keyboard sitting on top of a wooden desk.

Visual question: what make is this computer?

Answers:

  1. unsuitable
  2. unanswerable
  3. hp
  4. unsuitable
  5. unanswerable
  6. i am unable to see make
  7. unanswerable
  8. dell
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a black keyboard with white and purple ear buds
  2. A laptop keyboard and touchpad with a pair of earbuds laying across it.
  3. BLACK COMPUTER KEYBOARD AND A PAIR OF HEADPHONES
  4. IMAGE WAS CLEAR BUT IT WAS NOT ITEM
  5. Laid out laptop with a set of earbuds plugged into it

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 36: VizWiz_train_00006647.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bottle of beer sitting on a table.

Visual question: Okay, I need to know which of these jars is the almond butter. I could go ahead and taste it, but I'm allergic to peanut butter and I don't know which jar is which. Thank you.

Answers:

  1. right metal lid
  2. right jar
  3. right jar
  4. top right
  5. 1 on left
  6. unsuitable
  7. almost butter on right
  8. unsuitable
  9. kill
  10. top right metal lid almond

Reasons why answers differ:

Image captions:

  1. A jar of almond butter and another jar that is of similar type on a wooden table.
  2. A plastic jar of peanut butter on the left and a glass jar of almond butter on the right.
  3. Two containers sit on a wooden table with one label that reads no stir almond butter.
  4. Two jars of food sitting on a table, one of which is almond butter.
  5. Two jars of food sitting on a wood table.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 37: VizWiz_train_00012661.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a drink.

Visual question: Please tell me what's in this can?

Answers:

  1. cream mushroom
  2. cream mushroom soup
  3. cream mushroom soup
  4. cream mushroom
  5. cream mushroom soup
  6. cream mushroom soup
  7. cream mushroom soup
  8. cream mushroom soup
  9. cream mushroom soup
  10. cream mushroom soup

Reasons why answers differ:

Image captions:

  1. a cream contains mushroom packed in a bottle
  2. A hand holding a can of cream of mushroom soup up to the camera
  3. a sealed and unopened can of cream of mushroom
  4. A tin can of cream of mushroom soup is held in someone's hand.
  5. Can of ready made cream of mushroom soup

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 38: VizWiz_train_00007478.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a building with a clock on it.

Visual question: What is the title of this book?

Answers:

  1. unsuitable
  2. unsuitable
  3. unanswerable
  4. unsuitable
  5. unanswerable
  6. artwork
  7. unsuitable
  8. unanswerable
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A red old fashioned school house painting with children in front.
  2. An illustration of a red school house with people.
  3. Painting of a red school house that says "Sterling School District #2"
  4. Painting of old fashioned school house with children in front of door.
  5. Someone has photographed a painting of a red school house with a teacher and children in front,

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 39: VizWiz_train_00019819.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a book on a table.

Visual question: What kind of eggnog is this?

Answers:

  1. unanswerable
  2. unanswerable
  3. i cant say
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. no idea
  8. unanswerable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A close up of a nutrition facts label for some kind of food.
  2. A food nutrition label with a green background.
  3. An label which contains the percentage of nutrients stuck behind a plastic container.
  4. Quality issues are too severe to recognize visual content.
  5. The nutritional information on the back of a container of food.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 40: VizWiz_train_00013002.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a sign with an orange book.

Visual question: How long should I cook this for in the oven?

Answers:

  1. 15 18 minutes
  2. unsuitable
  3. 15 18 mins
  4. 15 18 minutes
  5. 15 18 min
  6. 15 18 minutes
  7. i don no
  8. 15 18 minutes
  9. 15 18 minutes
  10. 15 to 18 minutes

Reasons why answers differ:

Image captions:

  1. A package of food has cooking directions on the back.
  2. A red package of food with the cooking instructions displayed
  3. An orange piece of paper giving instructions for cooking a chicken recipe is shown.
  4. Directions for chicken and sauce are shown on the back of this package.
  5. image shown a page that full of text.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 41: VizWiz_val_00000946.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What kind of system is this?

Answers:

  1. desktop
  2. unsuitable image
  3. windows
  4. desktop computer
  5. windows
  6. windows
  7. windows
  8. windows pc
  9. windows
  10. windows

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A desktop computer with package of ramen noodles on keyboard.
  2. A pack of dried noodles laying on a computer keyboard.
  3. A package of Ramen noodles sits on top of a computer keyboard on a desk, and a computer screen, computer speaker, and two people can be seen in the background.
  4. Desk with computer monitor and keyboard with windows logo.
  5. I see a desk with a computer and a keyboard with a package of food on the keyboard in an office with one person, at least, mostly hidden in the background.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 42: VizWiz_train_00000354.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A can of soda sitting on a table.

Visual question: What is this?

Answers:

  1. saxa
  2. salt
  3. saxa
  4. can
  5. saxa
  6. saxa
  7. saxa
  8. saxa
  9. saxa
  10. can

Reasons why answers differ:

Image captions:

  1. a can of Saxa table salt, another jar of seasoning and a jar of marmalade
  2. a container of something call saxa on a table
  3. A red can of seasoning is sitting on a blue table with two other seasoning containers behind it.
  4. A red can of something called Saxa with two other bottles behind it, all sitting on a table
  5. some type of table that is blue and has a container red

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 43: VizWiz_val_00007019.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 44: VizWiz_train_00011077.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cup of food next to a bowl of coffee.

Visual question: What is in this can?

Answers:

  1. vegetarian beans
  2. unanswerable
  3. baked beans
  4. unanswerable
  5. beans
  6. beans
  7. baked beans
  8. beans
  9. unsuitable
  10. beans

Reasons why answers differ:

Image captions:

  1. A can of food is on the left edge of the frame.
  2. A can of food laying on a surface showing only the right side of the can.
  3. a can of food or beans with the word Maid on it
  4. A picture of what appears to be a can of beans.
  5. An unopened can of baked beans laying on a table.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 45: VizWiz_train_00000549.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up view of a computer keyboard.

Visual question: Is this dayquil or nightquil?

Answers:

  1. unsuitable
  2. unsuitable
  3. unanswerable
  4. unanswerable
  5. unsuitable
  6. unsuitable
  7. unsuitable
  8. unsuitable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A black and white document that is upside down and isn't readable.
  2. A label of a package is on top of a table.
  3. An excerpt from a sheet of text that is written on a white surface.
  4. The back of a food label with black font lettering
  5. The page or paper is full of text but is too blurry to identify what it says.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 46: VizWiz_val_00001396.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cup on a table.

Visual question: What soda is this?

Answers:

  1. diet pepsi max
  2. diet pepsi max
  3. pepsi max
  4. diet pepsi max
  5. diet pepsi max
  6. diet pepsi
  7. diet pepsi max
  8. pepsi max
  9. diet pepsi max
  10. diet pepsi max

Reasons why answers differ:

Image captions:

  1. A bottle of soda is displayed on the side of the box
  2. A picture on the television of a bottle of diet Pepsi.
  3. An advertisement for diet Pepsi max which is an invigorating cola.
  4. appears to be a picture of a TV screen
  5. Backlit advertisement for Diet Pepsi Max on a vending machine.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 47: VizWiz_train_00018959.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A laptop computer sitting on top of a bed.

Visual question: What is it?

Answers:

  1. laptop computer
  2. computer
  3. laptop
  4. laptop
  5. laptop
  6. computer
  7. laptop
  8. computer
  9. dti
  10. laptop

Reasons why answers differ:

Image captions:

  1. A laptop opened on a round table and powered on.
  2. A laptop screen turned on and placed on a table.
  3. A wonderful view of the fog windows in the room is very thick
  4. I see a laptop on the in the dark
  5. Laptop computer open on a table with screen brightly lit.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 48: VizWiz_val_00000948.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: Is it still checking for problems?

Answers:

  1. yes
  2. yes
  3. yes
  4. yes
  5. yes
  6. yes
  7. yes
  8. yes
  9. yes
  10. yes

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A computer monitor is pictures, with a black background screen, and a startup repair window open, as it seems unable to start.
  2. a computer screen with a progress bar window on it
  3. A pop-up box on a computer screen with Startup repair and a attempting repair slider.
  4. A portion of a computer screen with text, a progress bar, and a button to click.
  5. Computer screen with a black background and open window containing a warning message.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 49: VizWiz_train_00007468.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A computer keyboard sitting in front of a building.

Visual question: What is this?

Answers:

  1. keyboard monitor
  2. keyboard
  3. computer keyboard
  4. keyboard
  5. keyboard
  6. keyboard
  7. keyboard
  8. keyboard
  9. computer keyboard
  10. keyboard

Reasons why answers differ:

Image captions:

  1. A black, oval, split hand computer keyboard in front of a computer screen, on top of a work surface.
  2. A computer monitor and black keyboard keys with white letters.
  3. Appears to be a picture of a computer keyboard
  4. Ergonomically designed computer keyboard with the left and right hand keys separated.
  5. Pictured is an up close view of a black keyboard and a bright computer screen.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 50: VizWiz_train_00012500.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a television with a light on it.

Visual question: Is the screen on now?

Answers:

  1. no
  2. yes
  3. nothing
  4. no
  5. no
  6. no
  7. no
  8. no
  9. off
  10. no

Reasons why answers differ:

Image captions:

  1. A black computer monitor is sitting on a desk.
  2. A black computer screen with the letters IBM on the top.
  3. a black older IBM monitor with a flash on it
  4. An IBM flat screen monitor is on a desk near an Apple device.
  5. An image of a computer monitor screen with operating buttons on the lower right

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:2 / 5 annotators

Showing images 0 - 0 out of 0 matching images.