Showing images 0 - 0 out of 0 matching images.

Images are displayed from Training and Validation sets only.
Hover over image to zoom in.

Image 1: VizWiz_train_00022102.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A black device on the lap of an individual wearing denim pants and a white and maroon plaid shirt.
  2. a black plastic device on a person's lap
  3. A black remote control in someone's lap who is wearing blue jeans.
  4. A man wearing blue jean sitting with one leg placed above another and something kept on lap.
  5. Someone in blue jeans sitting down with a black object on their lap.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 2: VizWiz_train_00006429.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bottle of coffee sitting on a table.

Visual question: What kind of soda pop?

Answers:

  1. unsuitable
  2. unanswerable
  3. coke 0
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. unsuitable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A can of Coke is on a coaster on a table.
  2. A can of juice on top of a green coaster.
  3. Black can of soda can but the name of it on the other side.
  4. cylindrical black can with white nutrition facts label showing, placed on top of a round green flat object, sitting on a brown glass table next to a green cushioned bench to the left of it.
  5. I see a black can of soda sitting on the table

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 3: VizWiz_val_00006516.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A person is wearing a blue shirt and is sitting on a couch.
  2. A person's left arm while wearing a short sleeved shirt.
  3. part of a blue shirt someone is wearing
  4. Quality issues are too severe to recognize visual content.
  5. The front arm or shoulder area of an individual in a blue shirt.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 4: VizWiz_train_00017518.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cup of coffee sitting on top of a table.

Visual question: What is this?

Answers:

  1. cup
  2. coffee mug
  3. coffee cup
  4. coffee
  5. coffee mug
  6. coffee
  7. coffee cup saucer
  8. wq
  9. cup
  10. mug

Reasons why answers differ:

Image captions:

  1. a Christmas mug, sitting on a saucer, a green lighter and some tissues all sitting on a table
  2. A coffee cup with a depiction of a teddy bear is on a saucer on the table.
  3. A dark tabletop containing a mostly empty cup of coffee, a box of tissues, a green lighter and an ashtray.
  4. A white coffee mug with red trim sitting on top of a white coaster on a wooden dresser.
  5. a white cup is placed on a wooden surface

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 5: VizWiz_train_00021297.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A box of Aunt Jemima buttermilk pancake mix.
  2. A box of buttermilk pancake and waffle mix made by Aunt Jemima.
  3. A box of pancake mix on a counter top.
  4. A red box of buttermilk pancake and waffle mix sitting on a counter
  5. some aunt Jemima buttermilk pancakes in a box

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 6: VizWiz_train_00020552.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A hp printer with some notebooks behind it.
  2. A old printer is black and silver in color
  3. a printer device that is black and silver with text on it
  4. A silver and black printer next to a spiral notebook
  5. All in one printer scanners like this can come in handy for many things.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 7: VizWiz_train_00016791.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person holding a picture of a sign.

Visual question: What color is this shirt?

Answers:

  1. black
  2. black orange yellow green
  3. black read yellow green writing
  4. black
  5. black red yellow green letters
  6. black
  7. black
  8. black
  9. black red yellow blue quicksilver logo
  10. black

Reasons why answers differ:

Image captions:

  1. A black t shirt with "Quick Silver" writing on it in red, yellow and green color shades.
  2. A black tee shirt with red yellow and green lettering on it.
  3. in this picture is a image of a shirt
  4. Quality issues are too severe to recognize visual content.
  5. Red, yellow, and green lettering on a black background.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 8: VizWiz_train_00016750.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A black meter sitting on top of a wooden floor.

Visual question: In front of you there's two coasters. I need to know which one is salt and pepper. Let me know with a left or right, which is salt and which is pepper. Alright, thanks.

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. left pepper right salt
  5. salt right pepper left
  6. left pepper right salt
  7. unanswerable
  8. unsuitable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A black salt and pepper shaker set on a counter, in front of a wooden bread box.
  2. A salt and pepper shaker made of stainless steel.
  3. A set of silver metallic salt and pepper shakers.
  4. Grey metallic salt and pepper shakers in front of a brown wood bread box
  5. Salt and pepper shakers are on top of a table.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 9: VizWiz_val_00003426.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bottle of water on a table.

Visual question: What is this?

Answers:

  1. water
  2. water
  3. natural crystal water
  4. natures crystal water
  5. water bottle
  6. water bottle
  7. water bottle
  8. water
  9. water bottle
  10. water

Reasons why answers differ:

Image captions:

  1. a plastic bottle of nature's Crystal Spring water empty
  2. A bottle of water with the text "nature's crystals"
  3. A water bottle made of relatively thin plastic that can be squished and regain its shape to some extent has a twist off cap.
  4. An empty bottle of Nature's Crystal Spring water
  5. Clear plastic bottle of water with a blue lid and blue and white label.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 10: VizWiz_train_00019846.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a red and white sign.

Visual question: please what is this store? thank you

Answers:

  1. burger king
  2. burger king
  3. unanswerable
  4. burger king
  5. burger king
  6. burger king
  7. burger king
  8. burger king
  9. burger king
  10. burger king

Reasons why answers differ:

Image captions:

  1. a red and white Burger King sign with Christmas decor above it
  2. a red Burger King sign with white text
  3. Store front for the Burger King fast food chain.
  4. The entrance to Burger King looking up at the large sign.
  5. The front signage of a Burger King restaurant.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 11: VizWiz_val_00007211.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A Vodafone compensation with chip on the somewhere.
  2. Chip technology like the one in this credit card have come a long way.
  3. part of a Vodafone Wellness credit card on a surface
  4. some sort of wellness credit card that is blue and red
  5. Upper left corner of a Vodafone wellness card with an electronic chip in it.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 12: VizWiz_train_00000747.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a plate of food on a table.

Visual question: What flavor is this?

Answers:

  1. unsuitable
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. granola chocolate
  6. unanswerable
  7. kashi honey almond flax flavored bar
  8. unsuitable
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A box is featuring a chewy granola bar.
  2. A box of granola bars on top of a hard dark pink surface.
  3. back of a box of Kashi granola bars on a plastic surface
  4. Small bits of almonds, soy grahams, and 7 types of grains are combined together in a small rectangular bar for consumption.
  5. The back of a granola bar box describing ingredients.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 13: VizWiz_train_00012446.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a pair of scissors.

Visual question: What color is this?

Answers:

  1. bisque chestnut
  2. beige
  3. oak
  4. cable
  5. black
  6. brown
  7. black
  8. tan
  9. kakhi
  10. tan

Reasons why answers differ:

Image captions:

  1. A black cord on a hardwood floor.
  2. A sustainable plywood and part of a black phone charger.
  3. A USB cord connector laid on a hardwood floor.
  4. A wonderful view of the fog windows in the room is very thick
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 14: VizWiz_train_00005128.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a blue and white picture.

Visual question: What color is this?

Answers:

  1. blue
  2. royal blue white
  3. blue white writing
  4. blue
  5. blue shirt
  6. blue
  7. blue
  8. blue
  9. blue white
  10. blue color

Reasons why answers differ:

Image captions:

  1. A blue piece of fabric with print on it is pictured.
  2. A blue shirt with white logo and lettering.
  3. A piece of fabric that has a logo on the front and some wording on top.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 15: VizWiz_train_00013331.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of an oven with a remote.

Visual question: What does the thermostat say?

Answers:

  1. unanswerable
  2. 26 73
  3. 26 73
  4. 73
  5. 73
  6. unsuitable
  7. 73 degrees
  8. unsuitable
  9. unanswerable
  10. 26 73

Reasons why answers differ:

Image captions:

  1. A LCD screen of a thermostat on a wall is shown
  2. a thermostat displaying the time and temperature on the wall
  3. Photo is of a wall mounted clock that is displaying the time.
  4. Quality issues are too severe to recognize visual content.
  5. The dial on the wall mounted thermostat shows a display of the temperature.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 16: VizWiz_train_00017671.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A book that is sitting on a table.

Visual question: What's in this box?

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. dont»t know label barcode
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. bile

Reasons why answers differ:

Image captions:

  1. A box that is taped closed with barcodes on it.
  2. a cardboard box with a bar code on it
  3. A package of sandpaper with two UPC codes on it lays flat on another object.
  4. a white plastic with barcode and some numbers on it
  5. universal product code for what looks to be a cork table mat

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 17: VizWiz_val_00001208.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a baseball player in a uniform.

Visual question: What video game is this?

Answers:

  1. espn nfl 2k
  2. football
  3. nfl 2k
  4. espn nfl 2k5
  5. espn 2k5
  6. espn 2k football
  7. nfl 2k
  8. football
  9. nfl 2k
  10. football

Reasons why answers differ:

Image captions:

  1. A PlayStation 2 ESPN NFL 2K5 (used): Video Games football number 81.
  2. a SEGA game disc label of ESPN 2K
  3. A video game case with a signed autograph of a football player.
  4. a white color paper showing a basketball player
  5. PS4 rugby games from a gaming company with a player

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 18: VizWiz_train_00006966.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a white refrigerator on a table.

Visual question: Can you tell me who this card is from?

Answers:

  1. unsuitable
  2. unanswerable
  3. no
  4. ab co
  5. no
  6. no
  7. no
  8. unanswerable
  9. no
  10. cab co

Reasons why answers differ:

Image captions:

  1. A small piece of paper with some fill in the blank spaces.
  2. Form with a blank dollar-amount line at the bottom.
  3. Paper card with lettering stating in part "ab Co." and a dollar sign below.
  4. Quality issues are too severe to recognize visual content.
  5. Rectangular piece of cardboard with 'AB CO.' written on it and blank dashed lines.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 19: VizWiz_train_00020095.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a box of food being photographed on its side
  2. A zoomed in ready to eat meal that has mashed potatoes in a black background.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 20: VizWiz_train_00023039.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A close up of a MacBook laptop computer keyboard
  2. A keyboard with letters and numbers is behind a screen
  3. a open silver laptop keyboard with black keys,
  4. A portion of a silver mac laptop's black keyboard.
  5. The left of a black Apple laptop keyboard

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 21: VizWiz_train_00002711.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a cell phone.

Visual question: Please tell me what this is. Thank you.

Answers:

  1. dr pepper
  2. dr pepper soda
  3. dr pepper
  4. dr pepper
  5. dr pepper soda
  6. dr pepper
  7. dr pepper
  8. dr pepper
  9. dr pepper can
  10. soda can

Reasons why answers differ:

Image captions:

  1. A can of Dr Pepper and a thumb with a Waste Management flyer posted on a wall in the background.
  2. A person is holding a red can of Dr pepper.
  3. A single can of Dr Pepper soft drink.
  4. canned drink, which is sustained by a person.
  5. Front label DR Pepper soda can in hand

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 22: VizWiz_train_00017698.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a person ' s book.

Visual question: Per the cooking instructions.

Answers:

  1. meat lasagn
  2. unanswerable
  3. unanswerable
  4. unsuitable
  5. unsuitable
  6. unsuitable
  7. unanswerable
  8. no
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A box containing a frozen family style meat lasagna
  2. A conventional and microwave instruction for cooking meat lasagna.
  3. The backside of a frozen food box containing its instructions on how to cook it
  4. Trader Giotto's family style meat lasagna with the ingredients list and cooking instructions telling you to preheat to 375F, pull back the corners to vent, cook for 60-70 minutes, remove the film for another 10 minutes, and let it cool for 5 minutes.
  5. Trader Giotto's meat lasagna cooking directions on the back of the box

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 23: VizWiz_val_00005408.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A symbol of apple product was captured with a bright spot.
  2. A apple MacBook laptop top cover that is clear and white
  3. The Apple icon on the back of a laptop.
  4. The lid of a white apple laptop with the apple logo on it.
  5. up close photo of a computer with apple symbol on it that is white

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 24: VizWiz_train_00006313.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a white wall on a table.

Visual question: I know this is a check and can you read it and tell me how much it is for? Thank you.

Answers:

  1. unanswerable
  2. unanswerable
  3. unsuitable
  4. unsuitable
  5. unsuitable
  6. unsuitable
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a white page with gray writing on the corner
  2. An open textbook with one blank page.
  3. Open book that has some text written or typed on the right side page.
  4. opened book with a blank page on left and words on right
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 25: VizWiz_train_00011741.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a person ' s face.

Visual question: what color is this

Answers:

  1. grey
  2. grey
  3. unsuitable
  4. grey
  5. grey
  6. grey
  7. dark olive green
  8. tan
  9. white
  10. grey

Reasons why answers differ:

Image captions:

  1. A piece of furniture that has blue and brown paint on it.
  2. a rumpled bed with a dark gray sheet.
  3. Bed sheets like this one can be so cool in the summer.
  4. Quality issues are too severe to recognize visual content.
  5. The back of a steamed light gray dress shirt.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 26: VizWiz_train_00009291.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a remote control.

Visual question: What is it?

Answers:

  1. remote control
  2. tv remote
  3. remote control
  4. remote control
  5. remote control
  6. tv remote
  7. remote
  8. tv remote
  9. tv remote
  10. remote

Reasons why answers differ:

Image captions:

  1. A black remote with different colored buttons sitting on a persons lap.
  2. A person has a TV remote right on top of their lap.
  3. A Sony remote control is resting on someone's lap.
  4. A TV remote is on top of a person's lap.
  5. television remote control placed sideways on jeans-clad legs

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 27: VizWiz_train_00007163.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cell phone sitting on a wooden table.

Visual question: Can I please get a specific answer? What is on this card and is it right side up or upside down? Thank you.

Answers:

  1. 9 upside down
  2. upside down 9 card
  3. 9 wands inverted
  4. upside down 9
  5. candles upside down
  6. upside down 9 images
  7. unsure says 9 upside down
  8. 9 wands right side up
  9. upside down playing card
  10. arrows upside down

Reasons why answers differ:

Image captions:

  1. A card laying on the table that has the word nine on the top.
  2. a red and green color game card showing mine
  3. Nine candle-like figures on card on a wooden desk.
  4. Part of a tarot card for the nine of wands sitting on top of a wooden surface.
  5. Tarot card showing nine swords with a sunset colored background

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 28: VizWiz_val_00005433.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A black and white mug on a wood surface.
  2. A mug holding a paper cup sitting on a wooden surface.
  3. A white cup with a black mug holder on a wooden surface.
  4. In this picture lies a coffee maker mug sitting.
  5. SMALL CUP PLACED ON THE ARM OF A COUCH

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 29: VizWiz_train_00001716.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A blue car parked on the side of the road.

Visual question: What is it?

Answers:

  1. car
  2. car
  3. car
  4. car
  5. car
  6. car
  7. car
  8. car
  9. car
  10. sedan

Reasons why answers differ:

Image captions:

  1. A blue, four door car with the windows rolled down.
  2. A shiny sedan is parked outside of a store.
  3. A small blue car is parked in the room.
  4. Bright blue four door vehicle with black interior.
  5. Side view of a blue car in a garage.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 30: VizWiz_train_00004897.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a green plant on the ground.

Visual question: I think this is spinach but can you tell me what kind of vegetable this is?

Answers:

  1. mustard
  2. spinach arugula lettuce
  3. arugula
  4. lettuce
  5. spinach
  6. spinach
  7. collard greens
  8. spinach
  9. spinach
  10. lettuce

Reasons why answers differ:

Image captions:

  1. A bundle of green veggies with a red elastic around it
  2. a section showing leaves of plant scattered on the floor
  3. Bunch of green leaves placed on the table
  4. Green leaves with a single red stripe holding together part of the green leaves
  5. some leaves wrapped with a rope on a white piece of cloth

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 31: VizWiz_train_00011381.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person on a table.

Visual question: Make out what this says?

Answers:

  1. candle in jar
  2. no
  3. lavender chamomile
  4. lavender chamomile
  5. lavender chamomile
  6. lavender chamomile
  7. lavender chamomile
  8. lavender chamomile
  9. scented candle
  10. lavender chamomile

Reasons why answers differ:

Image captions:

  1. A blue jar of Lavender Chamomile in a person's lap.
  2. A lavender Chamomile candle in a mug jar on a man's lap.
  3. A lavender chamomile candle, placed within a glass jar with a handle on the side, and a twist top.
  4. image shows a food bottle on a leg named Lavender Chamomile.
  5. Our own candle company lavender chamomile candle with handle.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 32: VizWiz_train_00012038.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a clock on a table.

Visual question: I know this is coffee creamer, are you able to tell what flavor?

Answers:

  1. vanilla latte
  2. vanilla latte
  3. vanilla latte
  4. vanilla latte
  5. vanilla latte
  6. vanilla
  7. no
  8. unanswerable
  9. vanilla latte
  10. vanilla

Reasons why answers differ:

Image captions:

  1. A package of 24 Coffee creamers in single containers.
  2. A package of single serve vanilla latte coffee creamers.
  3. a small box of coffee creamer pods inside of it
  4. Box of Coffee House vanilla latte creamer on a counter top.
  5. Front package of a 20 count of vanilla latte coffee creamer singles.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 33: VizWiz_train_00003232.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a blue blanket covered in snow.

Visual question: What is this?

Answers:

  1. carpet
  2. unanswerable
  3. carpet
  4. off white
  5. carpet
  6. carpet
  7. carpet
  8. rug
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A bottle of unknown pills is laying on the carpet.
  2. A bottle with a blue cap sitting on plush carpeting.
  3. a raw material which is used to make pillows,cushion
  4. A white bottle with a blue cap placed horizontally on cotton or woolen cloth.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 34: VizWiz_train_00014502.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a window with a dark background.

Visual question: What does it say on the box?

Answers:

  1. unsuitable
  2. unsuitable
  3. unsuitable
  4. ask
  5. unsuitable
  6. unsuitable
  7. unanswerable
  8. unsuitable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 35: VizWiz_val_00001949.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person holding a book on a bed.

Visual question: what is this

Answers:

  1. chicken
  2. unsuitable
  3. creamy parmesan chicken
  4. food
  5. tv dinner
  6. food
  7. mac cheese
  8. food
  9. marie collenders meal
  10. creamy parmesan chicken

Reasons why answers differ:

Image captions:

  1. A hand is holding a container of noodle mix.
  2. A microwave dinner that says creamy parmesan chicken.
  3. A person is holding up a TV dinner.
  4. An instant food product is colored yellow and has an illustration of pasta on it.
  5. Creamy Parmesan Chicken flavored Marie Callender's microwavable meal.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 36: VizWiz_train_00000401.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A man is standing on a suitcase with a white wall.

Visual question: What is this item

Answers:

  1. box
  2. unanswerable
  3. barcode
  4. box
  5. bottom box only barcode visible
  6. unsuitable
  7. unanswerable
  8. unanswerable
  9. box
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A hand is holding a white cardboard box with a UPC label showing, and there is a printed shipping label, plastic stool, and area rug that can be seen underneath it.
  2. A large white box resting on a person's knees with a barcode showing.
  3. A white box with a barcode in the middle
  4. White box with barcode sitting on lap.
  5. White package that shows a barcode on the surface

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 37: VizWiz_train_00017452.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A sign that is on the side of the road.

Visual question: Can you tell me the content of this notice?

Answers:

  1. no smoking
  2. no smoking
  3. no smoking
  4. yes
  5. no smoking
  6. no smoking maximum penalty $5000
  7. no smoking
  8. no smoking
  9. no smoking
  10. no smoking

Reasons why answers differ:

Image captions:

  1. An instruction is given at the top of some surface
  2. Looks like a photo of the inside Metro car.
  3. Quality issues are too severe to recognize visual content.
  4. Stickers on a gray surface warning no smoking, buckle up and mind your head
  5. Warning labels on the wall of a public transportation vehicle.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 38: VizWiz_val_00001149.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: If there is any text on the screen, what does it say? Thanks.

Answers:

  1. os x utilities
  2. os x utilities
  3. os x utilities
  4. unsuitable image
  5. os x utilities
  6. dsx utilities
  7. os x utilities
  8. computer utilities
  9. os x utilities
  10. osx utilities

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A mac laptop with the os x utilities screen open.
  2. a picture of a laptop showing os x utilities page and clutter sitting around it
  3. a tablet screen with a os x utilities screen up
  4. Laptop with a grey screen and a white and grey pop up box in front of a window with white blinds closed.
  5. Utility window options are being shown on the computer screen.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 39: VizWiz_train_00012006.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A television sitting on top of a wooden table.

Visual question: What is this?

Answers:

  1. speaker
  2. speaker
  3. speaker
  4. silver speaker
  5. speaker
  6. small speaker
  7. speaker
  8. speaker
  9. speaker
  10. speaker

Reasons why answers differ:

Image captions:

  1. A silver mini speaker is shown on a reddish-brown table.
  2. A small silver speaker sits against the wall on the top left corner of a desk.
  3. A small speaker is right up against the wall.
  4. A small speaker with a silver screen sitting on a wooden desk against the wall.
  5. speaker fixed in a table with black luggage in room

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 40: VizWiz_train_00013798.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A white toilet sitting on top of a wall.

Visual question: Starting from twelve o'clock, in that direction from top to bottom, going clockwise, what are the different readings on this dial? Thank you.

Answers:

  1. rinse
  2. cycles
  3. rinse spin off 2nd rinse spin off permanent press 10 6 4
  4. spin off 2nd rinse spin off 10 permanent press 6 4
  5. rinse spin 2nd rinse permanent press
  6. spin off 2nd rinse spin off permanent press
  7. rince spin off 2nd rince spin permanent press
  8. spin off 2nd rinse spin off 10 permanent press 6 4
  9. spin off 2nd rinse spin dry permanent press rinse
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A dial on a washing machine with text describing its settings.
  2. A portion of a white washing machine's control knob displays the different wash cycles.
  3. A white dial on a washing machine with grey and black lettering
  4. The upper right corner of a washing machine.
  5. washing machine dial, showing the permanent press cycle.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 41: VizWiz_train_00005787.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person holding a cell phone on a table.

Visual question: Can you tell what this is

Answers:

  1. beans
  2. yes
  3. beans
  4. can beans
  5. unsuitable
  6. can beans
  7. bushs beans
  8. baked beans
  9. beans
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A can of beans is being held on its side on a wooden table or counter.
  2. A person is holding a can of Bush's beans on a kitchen table.
  3. A picture of the food is on the packaging.
  4. Can of Busch's baked beans on it's side on top of a table
  5. hand holding a can that is light blue and blue letters

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 42: VizWiz_val_00005034.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A black cat is sitting on top of a white toilet, a person's hand is also near the car.
  2. A black cat laying on a toilet and looking at the camera.
  3. A black cat looking up from where it is laying on a closed toilet seat.
  4. A black furry cat with light blue eyes lying on the toilet.
  5. A long haired black cat sitting on the closed lid of a toilet seat in the bathroom.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 43: VizWiz_train_00005004.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bottle of beer sitting on top of a table.

Visual question: What is in this can?

Answers:

  1. unsuitable
  2. unsuitable
  3. cheese
  4. cheese soup
  5. cheese
  6. unsuitable
  7. unsuitable
  8. cheese
  9. unsuitable
  10. soup

Reasons why answers differ:

Image captions:

  1. A can of a food item is sitting on a table.
  2. A can of soup on a brown table.
  3. A red can of some type of food sitting on a wooden table.
  4. can of food on a table next to a laundry machine
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 44: VizWiz_train_00014807.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cat ' s head.

Visual question: Describe the picture.

Answers:

  1. girl sleeping
  2. back head
  3. blurry
  4. hair
  5. girls head
  6. girl black hair blue shirt
  7. unsuitable
  8. hair
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A child's long brown hair is put up in a hair tie.
  2. A woman has her hair tied up in a bun
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. someone is laying down with a bunch of hair on them

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 45: VizWiz_train_00017432.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a keyboard on a table.

Visual question: What product is this?

Answers:

  1. book
  2. unanswerable
  3. unsuitable
  4. unanswerable
  5. coffee
  6. unsuitable
  7. unanswerable
  8. coffee
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. Black wording adorns the outside of a silver package that rests on a brown table.
  2. book end with blurred printed words, looks folded
  3. Quality issues are too severe to recognize visual content.
  4. The side of a plastic gray container showing its instructions in Black
  5. The side of some type of tube that is in front of a wood table.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 46: VizWiz_train_00000313.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A living room with a couch and a television.

Visual question: What is in front of me and the wall?

Answers:

  1. stand
  2. television
  3. looks like entertainment center tv
  4. tv stand
  5. rug
  6. entertainment center
  7. entertainment center
  8. entertainment center television
  9. dog bed
  10. entertainment center

Reasons why answers differ:

Image captions:

  1. A living room with a wooden entertainment center with a TV in it.
  2. A living room with green carpet, a wooden entertainment center that holds a TV and a red and green couch.
  3. A wooden cabinet next to a sofa in a living room.
  4. Living room with green carpet with wooden TV stand.
  5. The inside of a person's living room with a red couch

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 47: VizWiz_train_00007081.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A black suitcase sitting on top of a bed.

Visual question: What is it?

Answers:

  1. wallet
  2. wallet
  3. wallet
  4. wallet
  5. wallet
  6. wallet
  7. wallet
  8. wallet
  9. eyeglass case
  10. bag

Reasons why answers differ:

Image captions:

  1. a black colored wallet with a black rope tied around it
  2. A black leather-like wallet wrapped with a black cord sitting on decorative fabric.
  3. A red with a red pattern on the middle and a black wallet tied with a black string made with leather
  4. A wallet has a black string around it.
  5. Leather wallet on top of a table that has tablecloth and placemat

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 48: VizWiz_train_00011392.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person holding a can of food in a refrigerator.

Visual question: What is in this can?

Answers:

  1. unsuitable
  2. unsuitable
  3. cat food
  4. cat food
  5. unsuitable
  6. unanswerable
  7. unsuitable
  8. unsuitable
  9. tuna
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A can of some type of food being held up.
  2. A hand holding a canned food product with a white French doors opened in the background.
  3. A man holding a small plastic object container
  4. A person is holding a bottle of water.
  5. A person is holding a can of food that is short.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 49: VizWiz_train_00010187.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A yellow toy bear sitting on top of a table.

Visual question: Just like to know what color this stegosaurus is, as my color recognizers don't seem to pick it up too well.

Answers:

  1. green yellow
  2. yellow little green
  3. yellow body lime green on top armor brown on points
  4. yellow green specks
  5. yellow green
  6. tips brown back green rest body yellow
  7. yellow green
  8. toy
  9. yellow green upper body orange tipped spikes
  10. mostly yellow

Reasons why answers differ:

Image captions:

  1. A green and yellow dinosaur toy standing on a counter with the bathroom in background
  2. A toy dinosaur that has yellow and green colors on it.
  3. a yellow and green plastic dinosaur toy figurine
  4. A yellow and green plastic toy dinosaur with a spiky back
  5. The dinosaur is bright yellow in color and has a spiky horn going across its back.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 50: VizWiz_train_00002632.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A painting of a wall with graffiti on it.

Visual question: What note is it?

Answers:

  1. 5 pounds
  2. 5 pounds
  3. pound
  4. 5 pounds
  5. 5 pounds
  6. 5 pounds
  7. 5 pounds
  8. doller
  9. 5 pounds
  10. england 5 pounds

Reasons why answers differ:

Image captions:

  1. A British pound note is held up in front of a table.
  2. A currency from England that says 5 Pounds.
  3. A foreign currency is held up in front of a table.
  4. A Paper bill worth Five European dollars that has a rectangular shape
  5. five English pounds bill displayed in front of a water bottle

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Showing images 0 - 0 out of 0 matching images.