Showing images 0 - 0 out of 0 matching images.

Images are displayed from Training and Validation sets only.
Hover over image to zoom in.

Image 1: VizWiz_val_00001481.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a train on a wall.

Visual question: what's this? what's this?

Answers:

  1. thomas train
  2. i see thomas train
  3. thomas train
  4. thomas tank engine
  5. thomas train
  6. thomas train
  7. train
  8. thomas train
  9. thomas train
  10. fdgdfg

Reasons why answers differ:

Image captions:

  1. A grey smiley face is on a blue train engine car with Chinese symbols at the bottom.
  2. A large image of a blue cartoon train is seen
  3. a screen displaying a cartoon with a circular head
  4. An image of an item with Thomas the Tank Engine on it.
  5. Thomas the Train character, blue and grey with a smiling face

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 2: VizWiz_val_00001811.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bright light with a sun in the background.

Visual question: What's this?

Answers:

  1. unsuitable
  2. unanswerable
  3. unsuitable
  4. unsuitable
  5. unsuitable
  6. unsuitable
  7. unanswerable
  8. unsuitable
  9. flash
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A bright glare is in the center of a dark room.
  2. A dark surface containing the words signal up.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 3: VizWiz_train_00016605.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a plate of food.

Visual question: What flavor of pop tarts are these?

Answers:

  1. frosted smores
  2. smores
  3. smores
  4. frosted smores
  5. smores
  6. smores
  7. smores
  8. smores
  9. smores
  10. frosted smores

Reasons why answers differ:

Image captions:

  1. A box of frosted S'more flavored Pop Tarts with chocolate and marshmallow filling.
  2. A close up of the picture on a box of pop tarts.
  3. Box of Frosted S'mores Pop Tarts with an image of a s'more and that it is a good source of 8 vitamins and minerals
  4. Close up of a popular name brand breakfast snack in a blue box with white text and a picture of s'mores on the front
  5. I see a box of pop tarts on the table

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 4: VizWiz_train_00005996.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A book sitting on top of a wooden table.

Visual question: What is this?

Answers:

  1. unsuitable
  2. unanswerable
  3. croutons
  4. salad
  5. snack
  6. unsuitable
  7. food
  8. packaged food
  9. unsuitable
  10. food

Reasons why answers differ:

Image captions:

  1. A plastic package of ready to cook food is laid face down on a table.
  2. A sealed pouch of some sort of side dish sitting on a counter
  3. A wonderful view of the fog windows in the room is very thick
  4. an unopened bag of a product with food images
  5. Instructions are on the back of the food package

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 5: VizWiz_train_00013612.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a sunset in the sun.

Visual question: What is on the screen?

Answers:

  1. unsuitable
  2. unsuitable
  3. crowd people
  4. light
  5. large crowd people
  6. sports
  7. unsuitable
  8. people
  9. people
  10. large crowd people gathering in celebration

Reasons why answers differ:

Image captions:

  1. A group of people are seen on the screen of a TV.
  2. A picture of a TV screen with many people.
  3. A television monitor with a crowd of people on TV.
  4. A TV screen with a gathering of people on it.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 6: VizWiz_val_00005251.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A luxpro digital thermostat on a beige wall.
  2. a programmable thermostat with two triangular controls on top right
  3. A room control thermometer temperature reading 74 degrees with up and down buttons.
  4. A white digital thermostat hanging on a tan wall.
  5. A white thermostat is on top of the wall.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 7: VizWiz_val_00001013.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What letters do I type in to win the prize?

Answers:

  1. tlz
  2. tlz
  3. tlz
  4. tlz
  5. tlz
  6. tlz
  7. tlz
  8. tlz
  9. tlz
  10. tlz

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A computer monitor with a window open on it.
  2. A computer screen shows the user has just won ten swag bucks.
  3. A computer screen wanting the user to fill a captcha field.
  4. A computer screen with a tab on the swagbucks website.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 8: VizWiz_val_00004763.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A very wonderful view and worth seeing at all times, my friend
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 9: VizWiz_val_00004837.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A hand is in front of a white bathroom sink
  2. A person is holding their hand over a sink.
  3. A person's left hand and thumb on top of a white ceramic surface
  4. A person's thumb is held over the sink.
  5. Someone hand is above a white sink and only the thumb is clearly seen.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 10: VizWiz_train_00020517.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A can of mushroom soup is shown in blurry close up on a plain counter top.
  2. A can of soup of on the table.
  3. A partially shown can of low sodium mushroom soup sits on a counter top.
  4. A white can of golden mushroom soup on a flat beige counter top.
  5. Canned good with white label and black text with picture of canned good on the label.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 11: VizWiz_train_00022527.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A microwaves LCD screen is shown with the time
  2. Digital timer and control button panel on a microwave oven.
  3. Input controls for a microwave, display reads 00:00.
  4. Owen indicator is open and it shows zero minutes
  5. Touchpad and screen for a microwave oven

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 12: VizWiz_val_00005741.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A close up of a red and yellow graphic pattern or a painting.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Red fabric with various textures and yellow square objects attached to it

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 13: VizWiz_train_00008650.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a toothbrush on a table.

Visual question: What is this?

Answers:

  1. triscuits
  2. triscuits
  3. triscuit snacks
  4. triscuits
  5. crackers
  6. triscuit
  7. triscuit
  8. triscuit
  9. crackers
  10. triscuit

Reasons why answers differ:

Image captions:

  1. A box of crackers is on top of a table.
  2. A yellow box of triscuits and a container of drink are shown on a wooden table.
  3. Part of a Triscuit package and green tea jug are seen sitting on a wood surface.
  4. The box is rectangular and has the word Triscuit on with a small red triangle on the top corner.
  5. The top of a box of Triscuit crackers on a wooden table.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 14: VizWiz_train_00013407.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A wooden chair sitting on top of a table.

Visual question: What about now, please?

Answers:

  1. what
  2. no
  3. unanswerable
  4. unanswerable
  5. unsuitable
  6. unanswerable
  7. unsuitable
  8. unsuitable
  9. unanswerable
  10. chair

Reasons why answers differ:

Image captions:

  1. A brown chair is in the kitchen by the table
  2. A brown wooden kitchen chair pushed into the table.
  3. A light brown wooden kitchen chair pushed under a kitchen table.
  4. A light colored wooden chair is pushed into a kitchen table
  5. A wooden chair with vertical slats is pulled up to the kitchen table.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 15: VizWiz_val_00005407.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. Back view of a box of food showing instructions on how to cook it.
  2. box of food with cooking instructions facing forward
  3. Cooking instructions on the back of a Marie Callender's frozen dinner.
  4. Microwave instructions are shown on the back of this meal.
  5. the back of a package of food showing preparation instructions

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 16: VizWiz_train_00000693.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up view of a dark room.

Visual question: Is it a computer?

Answers:

  1. unsuitable
  2. unsuitable
  3. unsuitable
  4. unanswerable
  5. no
  6. no
  7. no
  8. unsuitable
  9. no
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A dark room with very little light showing a tile floor
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 17: VizWiz_train_00019902.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A wooden board is next to a box.

Visual question: What's in this picture?

Answers:

  1. casserole
  2. casserole
  3. casserole
  4. casserole box
  5. casserole
  6. casserole
  7. unsuitable
  8. casserole
  9. unanswerable
  10. table box

Reasons why answers differ:

Image captions:

  1. a box of some sort of desserts on a table
  2. A casserole packaged in a box from the brand M&M.
  3. Food is in a box on the table, and it says M&M casserole on the side.
  4. Picture of a box containing a dessert of some kind.
  5. Pre-packaged orange casserole in a box reading Les Aliments M&M.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 18: VizWiz_train_00007624.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a cell phone.

Visual question: What flavor is this?

Answers:

  1. jamaican rum
  2. unsuitable
  3. unsuitable
  4. jamaican
  5. unanswerable
  6. unsuitable
  7. unsuitable
  8. unanswerable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. a bottle of an unknown consumable with the word Jamaican on the label
  2. A hand holding a brown bottle with a gold paper label.
  3. A hand holds a bottle of something mostly out of frame.
  4. A person is holding a brown bottle with a gold label
  5. Someone holding a bottle that appears to be from Jamaica.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 19: VizWiz_train_00019032.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a computer and a keyboard.

Visual question: What is this?

Answers:

  1. keyboard
  2. keyboard
  3. keyboard
  4. keyboard
  5. keyboard
  6. computer keyboard
  7. keyboard
  8. keyboard
  9. keyboard
  10. keyboard

Reasons why answers differ:

Image captions:

  1. a device which has keys used for typing
  2. A pink color keyboard is on the floor.
  3. A silver keyboard with a slight dirty gray keyboard pads.
  4. Close up view of a keyboard with the f5 key in the top left corner and the M key in the lower right corner.
  5. High quality computer keyboard is shown by this image.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 20: VizWiz_train_00003753.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a window with a man on it.

Visual question: Book is this?

Answers:

  1. italian songs arias
  2. italian songs
  3. italian songs arias
  4. italian songs arias
  5. songs
  6. italian songs arias
  7. italian songs
  8. italian songs
  9. songs
  10. card

Reasons why answers differ:

Image captions:

  1. A book of Italian Songs and Arias with an image of a building and statue on the cover
  2. A book or an album cover with the title Italian Songs and Arias.
  3. a bunch of papers bind together which contains songs
  4. a paperback book of Italian songs and arias
  5. Here is a book of Italian song and arias

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 21: VizWiz_train_00011257.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A man laying in a bed with a book.

Visual question: How much

Answers:

  1. 1000
  2. 1000 yen
  3. 1000
  4. 1000
  5. 1000 international currency
  6. 1000
  7. 1000
  8. 1000
  9. 1000
  10. 1000

Reasons why answers differ:

Image captions:

  1. A 1,000 Asian bank note is sitting on top of a table.
  2. A 1000 note bill sitting on a wooden table.
  3. a large euro currency bill meant to buy stuff with
  4. A multicolored bank note that displays a person's portrait and the value of 1000.
  5. A piece of money for a country other than the USA sits on a brown desk.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 22: VizWiz_train_00021191.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A bag of fresh potatoes sit on a countertop
  2. A counter with shelves in the background containing various food items
  3. A large plastic bag full of chicken and behind it on the counter is a dozen eggs and a bag of potatoes.
  4. A plastic package of raw chicken breasts is on the table in front of a microwave.
  5. Package of Farm Fresh food in front of a microwave.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 23: VizWiz_val_00005692.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A canister of V05 hair gel on a black counter.
  2. A container of hair gel wax is on a counter.
  3. A jar of v05 Obey and play gel wax
  4. a small container of V05 gel wax on top of a counter
  5. An image showing a multivitamin pot on the floor.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 24: VizWiz_train_00001661.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bowl on a table.

Visual question: what country is this coin from?

Answers:

  1. norway
  2. unsuitable
  3. unsuitable
  4. unanswerable
  5. crotia
  6. unanswerable
  7. mexico
  8. canada
  9. canada
  10. new zealand

Reasons why answers differ:

Image captions:

  1. A Canadian loonie one dollar coin on a coffee table.
  2. A close up of a coin with a whale engraved on it.
  3. A round silver metal object that looks like a coin sitting on a wooden surface
  4. A silver coin with a whale on it sitting on top of a dark wood surface.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 25: VizWiz_train_00013976.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a blue and white plate.

Visual question: What's in this container?

Answers:

  1. cream cheese
  2. cream cheese
  3. plain cream cheese
  4. plain cream cheese
  5. unanswerable
  6. cream cheese
  7. imitation cream cheese
  8. imitation cream cheese
  9. cream cheese
  10. imitation cream cheese plain

Reasons why answers differ:

Image captions:

  1. A container of cream cheese showing the ingredients list
  2. A cream cheese lid is showing its ingredients label.
  3. A label of the back of a imitation cream cheese showing its ingredients.
  4. Imitation cream cheese package showing the ingredients on the label.
  5. Plain cream cheese container with ingredients and bar code.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 26: VizWiz_train_00015728.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A stack of books sitting on top of a book.

Visual question: Is this dayquil or nightquil?

Answers:

  1. unsuitable
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. unsuitable
  6. unanswerable
  7. unsuitable
  8. package ingredients
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A back view of an opened box of a food product that shows its ingredients and directions to use.
  2. Close up of dosing instructions on the back of a medicine with a blue a white label
  3. Instructions for cold medicine are shown on the back of this package.
  4. Quality issues are too severe to recognize visual content.
  5. The back of a white food package with blue printing containing nutritional facts.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 27: VizWiz_train_00017922.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bottle of beer sitting on a table.

Visual question: What are these?

Answers:

  1. campbells chunky chicken broccoli potatoes soup
  2. chicken broccoli potatoes soup
  3. chunky soup
  4. soup cans
  5. soup
  6. cambells chunky chicken broccoli potato soup
  7. cans soup
  8. chicken broccoli cheese potatoes
  9. cans soup
  10. rwgrwg

Reasons why answers differ:

Image captions:

  1. A red can of Campbell's Chunky brand chicken broccoli with potatoes soup.
  2. A red can of Campbell's chunky chicken broccoli
  3. Two cans of Campbell's Soup on a counter top.
  4. Two cans of Chunky Campbell's soup are sitting next to each other.
  5. Two red cans of Campbell's chunky soup, one being Chicken, broccoli and cheese with potatoes

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 28: VizWiz_train_00012511.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person ' s head.

Visual question: What is it say on this tag?

Answers:

  1. blank
  2. nothing
  3. nothing
  4. unanswerable
  5. nothing
  6. nothing
  7. nothing to say
  8. nothing
  9. nothing
  10. nothing

Reasons why answers differ:

Image captions:

  1. A hand holds a white card, no text visible.
  2. A person holds a white card in their hand.
  3. A white card has no text or anything on it
  4. In this picture is a image of a badge
  5. Persons hand holding a large white piece of cardboard.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 29: VizWiz_train_00008056.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a wall with a sign on it.

Visual question: What flavor is this coffee, please? Thank you. Have a nice day.

Answers:

  1. unanswerable
  2. scmalra
  3. i dont know
  4. sumatra
  5. starbucks sumatra
  6. sumatra
  7. unsuitable
  8. starbucks
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A 50 gram package of ground Starbucks Sumatra coffee.
  2. a one point seven six ounce bag of Starbucks Sumatra ground coffee
  3. A package of Starbucks product is on an old white table
  4. Bog of Starbucks Sumatra coffee that appears to be unopened.
  5. It looks like a bag of coffee beans from Starbucks

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 30: VizWiz_val_00006461.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A message on white paper with black text saying the water will be shut off.
  2. A paper has a apology note and a time written in blue pen.
  3. A piece of paper with text that explains to a tenant that their water will be shut off from 12pm-3pm and also says "Sorry for any inconvenience".
  4. a white paper showing some text notifying the tenants that the water in the house will be turned off between 12pm to 3pm
  5. Letter from Maintenance held up by a hand.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 31: VizWiz_train_00002960.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cell phone sitting on top of a couch.

Visual question: What is this?

Answers:

  1. cell phone
  2. smart phone
  3. blackberry cell phone
  4. blackberry
  5. blackberry phone
  6. blackberry
  7. iphone
  8. blackberry
  9. blackberry
  10. blackberry phone

Reasons why answers differ:

Image captions:

  1. A BlackBerry cell phone from AT&T sits on the leg of a person wearing blue jeans.
  2. A BlackBerry cell phone with a blank screen sitting on someone's leg.
  3. BlackBerry mobile phone on top of blue jeans.
  4. Rectangle, with smooth screen, says blackberry on the top, right green button, next to this is six white dots in a circle with one in the center, silver rimmed black button next, curved arrow next, red power button next, says at&t on bottom
  5. some type of old blackberry device that is not being turned on

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 32: VizWiz_train_00020642.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A picture of some rows of cards or pictures.
  2. A strip of postage stamps displaying flags of the different States on them.
  3. A strip of stickers or postage stamps is sitting on a beige surface.
  4. A strip of what looks like six labels or stamps is on a yellowish surface.
  5. I believe it is a roll of stamps or stickers.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 33: VizWiz_val_00002388.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cell phone sitting on top of a tv.

Visual question: What is this?

Answers:

  1. iphone
  2. iphone
  3. iphone
  4. cell phone
  5. cell phone
  6. smart phone
  7. iphone
  8. phone
  9. i phone
  10. iphone screen

Reasons why answers differ:

Image captions:

  1. A white cellular phone with cracks in the screen.
  2. a white plastic cell phone with the screen featuring various apps
  3. An iPhone that has a cracked screen and is showing the phone's home screen.
  4. an older iPhone version that is white and black
  5. Square yellow phone with bright small squares in it.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 34: VizWiz_train_00017511.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A man is standing in front of a wall.

Visual question: Name of that please?

Answers:

  1. unanswerable
  2. unsuitable
  3. unsuitable
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. unsuitable
  8. unanswerable
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. Looks like a stick of deodorant or a container or shampoo, and possibly a thumb and hand in the background.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 35: VizWiz_train_00020696.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A green small bottle of water in a living room
  2. A large bottle of Martinelli's Sparkling Apple Cider
  3. alcohol sparkling cider in green bottle liquid sweet taste
  4. Glass bottle of Martinelli's sparkling cider, patterned rug in background
  5. Sparkling cider in a glass bottle in front of a carpet.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 36: VizWiz_train_00011816.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person sitting at a table next to a pizza.

Visual question: Hi, what is this can?

Answers:

  1. black olives
  2. olives
  3. olives
  4. black olives
  5. black olives
  6. olives
  7. olives
  8. black olives
  9. black olives
  10. black olives

Reasons why answers differ:

Image captions:

  1. A box of Moist Deluxe cake mix, a can of black olives and a blue & black jacket sitting around a table.
  2. A box of yellow cake mix sitting on a light wood colored table.
  3. A can and box is placed on a table.
  4. A can of olives and cake mix sitting on a desk.
  5. An unopened canned good and a box of cake mixture both lying on the table.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 37: VizWiz_val_00007592.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A clear cup with red writing on it of water on a table with a curtain behind it.
  2. a clear glass beverage cup with clear liquid and some ice in it
  3. A clear glass from Golden City Rayong that has ice and a clear liquid.
  4. a glass of water in a golden city Rayong cup on a table
  5. glass contain ice water and it is marked in the glass golden city Rayong in red color.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 38: VizWiz_train_00015731.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a remote control on a table.

Visual question: What does this say?

Answers:

  1. unsuitable
  2. unsuitable
  3. unsuitable
  4. unanswerable
  5. unsuitable
  6. unanswerable
  7. unsuitable
  8. unanswerable
  9. instructions
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A piece of paper containing instructions for assembling a shelf, a remote control, and some buttons
  2. An instruction booklet, a remote and a button designed piece of art sit on a table.
  3. Instruction guide with a list of all the parts included
  4. Instructions for building a shelf with a TV remote in the bottom of the picture with random buttons.
  5. Instructions on how to assemble a shelving system.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 39: VizWiz_train_00003143.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a surfboard on a wall.

Visual question: What kind of hot pockets are these?

Answers:

  1. spinach artichoke
  2. spinach artichoke
  3. spinach artichoke
  4. lean
  5. spinach artichoke
  6. spinach artichoke
  7. culinary creations spinach artichoke
  8. spinach artichoke
  9. spinach artichoke
  10. spinach artichoke

Reasons why answers differ:

Image captions:

  1. A box of Lean Pockets Culinary Creations in a Spinach flavor.
  2. A close up of a box of frozen food.
  3. A close up picture of a Lean Pockets Culinary Creations box.
  4. A spinach and artichoke sandwich made by Lean Pockets.
  5. A white and green box containing 2 lean pockets filled with spinach and artichoke.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 40: VizWiz_train_00019846.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a red and white sign.

Visual question: please what is this store? thank you

Answers:

  1. burger king
  2. burger king
  3. unanswerable
  4. burger king
  5. burger king
  6. burger king
  7. burger king
  8. burger king
  9. burger king
  10. burger king

Reasons why answers differ:

Image captions:

  1. a red and white Burger King sign with Christmas decor above it
  2. a red Burger King sign with white text
  3. Store front for the Burger King fast food chain.
  4. The entrance to Burger King looking up at the large sign.
  5. The front signage of a Burger King restaurant.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 41: VizWiz_train_00019831.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A man reading a picture on a bed.

Visual question: What is this CD artist and title?

Answers:

  1. gibson
  2. gibson
  3. gibson
  4. unsuitable
  5. gibson
  6. gibson
  7. unsuitable
  8. unanswerable
  9. gibson
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A cd case has the picture of two men
  2. A CD case has two men on it and one is holding a guitar.
  3. A CD sitting up right on a white fabric material.
  4. A CD with the title Gibson and displays a picture of two men.
  5. CD case or small record cover with two men pictured.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 42: VizWiz_train_00012880.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a black and white mouse.

Visual question: What is this?

Answers:

  1. cup
  2. coffee cup
  3. black white mug
  4. coffee cup
  5. coffee cup
  6. coffee mug
  7. coffee mug white black stripe
  8. coffee cup
  9. mug
  10. teacup

Reasons why answers differ:

Image captions:

  1. a black and white colored mug on a black background
  2. A mug for that all important cup of coffee or tea.
  3. A white ceramic coffee cup with a black band on the top.
  4. A white coffee cup with a black stripe at the top.
  5. A white mug with a black rim at the top.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 43: VizWiz_train_00017163.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of an image of a clock.

Visual question: describe what is on the front of my shirt.

Answers:

  1. black patch
  2. unsuitable
  3. black
  4. unsuitable
  5. unsuitable
  6. patch
  7. unknown
  8. embroidery
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. An article of clothing that has stitching on it.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 44: VizWiz_val_00003717.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a book on a table.

Visual question: What is this?

Answers:

  1. wine
  2. wine
  3. bottle wine
  4. castillo de monsي©ran cariي±ena
  5. beer
  6. castillo de monsgran carinena
  7. castillo de
  8. bottle
  9. wine
  10. bottle

Reasons why answers differ:

Image captions:

  1. A bottle of alcohol is on top of a pair of jeans.
  2. A bottle of wine with a sticker from Best Buy wine enthusiasts
  3. A Castillo de Monseran bottle of wine is shown.
  4. appears to be a picture of a can of beer
  5. Pictured is the label on a bottle of wine.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 45: VizWiz_val_00005119.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A yellow and white box of artificial sweetener brand of splenda
  2. Box of splenda on a green fabric background
  3. Quality issues are too severe to recognize visual content.
  4. the top portion of a box of splenda sugar substitute
  5. Yellow packaging for a food item on a green cushion.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 46: VizWiz_train_00005628.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bathroom sink with a cup of water on it.

Visual question: What is this?

Answers:

  1. unsuitable
  2. unsuitable
  3. sink
  4. unsuitable
  5. sink
  6. unanswerable
  7. unanswerable
  8. sink
  9. unsuitable
  10. kitchen sink

Reasons why answers differ:

Image captions:

  1. A container is sitting on the edge of a kitchen sink.
  2. a container of food in front of a sink
  3. A container that is white with a pink lid and nutrition facts is at the forefront and red soap and a sink are in the background
  4. a kitchen sink with bottles in a window sill and a flower pot in the foreground.
  5. A tub of cleaner on the edge of a sink counter that is cluttered with cleaners.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 47: VizWiz_val_00001474.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a keyboard and a laptop.

Visual question: What is this?

Answers:

  1. keys
  2. keyboard
  3. keyboard
  4. keyboard
  5. keyboard
  6. keyboard
  7. keyboard
  8. keyboard
  9. laptop keyboard
  10. keyboard

Reasons why answers differ:

Image captions:

  1. A closeup of the keys to a computer keyboard.
  2. a keyboard with white and black lettering on it
  3. A partial white computer keyboard containing black keys
  4. A silver keyboard that is part of a laptop that has black keys.
  5. The middle section of a keyboard is showing and the keys are black with white lettering.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 48: VizWiz_train_00003111.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a refrigerator with a sticker on it.

Visual question: What is the temperature?

Answers:

  1. unanswerable
  2. unanswerable
  3. 2
  4. 72
  5. unanswerable
  6. 72
  7. 72
  8. unanswerable
  9. 72
  10. 72

Reasons why answers differ:

Image captions:

  1. a thermostat control panel with an up and down button and the cool, off, and heat settings on the bottom right
  2. A wall clock hanging in the wall are described
  3. Honeywell home security system is shown by here..
  4. Quality issues are too severe to recognize visual content.
  5. UP CLOSE SNAPSHOT OF A WHITE TEMPERATURE GAUGE

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 49: VizWiz_train_00008610.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a train on a wall.

Visual question: I think I turned on a light. What temperature is the oven set at?

Answers:

  1. unsuitable
  2. unanswerable
  3. unsuitable
  4. unanswerable
  5. unanswerable
  6. 0
  7. off
  8. unanswerable
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A box with various clocks and watches for timing.
  2. an analog display on an old oven showing a twist timer with white numbers and a black background
  3. An oven timer that is old with scratches.
  4. Appears to be an old cooking range with mechanical timers, a beat up black dashboard, and a white cooktop
  5. knobs on an old oven They have a clock and temperature reading with clock type hands

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 50: VizWiz_train_00023025.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a computer screen with an assessment question and multiple choice answers next to a graph
  2. A laptop computer screen showing a quiz question with a graph.
  3. A photo of a computer screen showing part of a quiz question and graph
  4. I see a stock chart written on a box
  5. On a PC screen a display of a multiple choice formal question is displayed next to a graph.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Showing images 0 - 0 out of 0 matching images.