Showing images 0 - 0 out of 0 matching images.

Images are displayed from Training and Validation sets only.
Hover over image to zoom in.

Image 1: VizWiz_train_00001964.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A blurry picture of a person on a couch.

Visual question: What is in this picture?

Answers:

  1. table
  2. unsuitable
  3. table
  4. dog
  5. table clothes dishes
  6. cat
  7. unanswerable
  8. unanswerable
  9. room
  10. blanket 2 people box baby bottle

Reasons why answers differ:

Image captions:

  1. A plate of food is on a wooden table next to a bottle and napkin,
  2. a table top with a bunch of random stuff on it
  3. A white bag is on the kitchen table by the plate.
  4. A wooden table with things on top of it like a baby's bottle and a plate.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 2: VizWiz_train_00007058.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cup of food.

Visual question: Please can you tell me what this product is?

Answers:

  1. white rolls
  2. ready to bake white rolls
  3. dough
  4. rolls
  5. fresh dough
  6. white rolls
  7. dough
  8. dough for crusty white rolls
  9. jus roll bake fresh dough
  10. canned rolls

Reasons why answers differ:

Image captions:

  1. A box of easy open bake-able biscuits is on it's side on the table.
  2. A can of refrigerated dough for six rolls with picture of rolls on front.
  3. a container of jus rol bake it fresh dough
  4. a round container of biscuit dough in front of a water pitcher.
  5. Tasty and sweet Jus-Rol dough and this is have six crusty white rolls.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 3: VizWiz_train_00000804.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of some food on a table.

Visual question: What is this?

Answers:

  1. pineapple
  2. pineapple
  3. pineapple
  4. pineapple
  5. pineapple
  6. pineapple
  7. top pineapple
  8. food
  9. pineapple
  10. pineapple

Reasons why answers differ:

Image captions:

  1. A close-up of the golden top of a pineapple shown on a kitchen counter or table
  2. A piece of fruit is on top of a table.
  3. A pineapple sitting sideways on a table, shot with the top of the pineapple in the center of the image
  4. A yellow fruit is on top of a wooden surface.
  5. Top of a very ripe pineapple where the leaves have turned brown.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 4: VizWiz_train_00023297.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 5: VizWiz_val_00006236.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. What looks like a wall or some other green surface.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 6: VizWiz_train_00006357.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A book that is sitting on the floor.

Visual question: Can you determine what kind of hot cereal this might be? Thanks for your help.

Answers:

  1. apples cream
  2. oatmeal
  3. no
  4. regular
  5. no show other side
  6. strawberry
  7. strawberry oatmeal
  8. food
  9. unsuitable
  10. oatmeal

Reasons why answers differ:

Image captions:

  1. A packet of a mixer for liquid has instructions.
  2. a single packet of food with preparation directions on it
  3. A single-serving pouch of oatmeal with cooking instructions on the package.
  4. A small package of microwavable oatmeal is located on a counter.
  5. Tan colored with red letters of oatmeal packet on a granite style countertop.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 7: VizWiz_train_00023089.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A close up of light colored, wooden plywood.
  2. A wooden floor or some kind of wood paneling that has a large crack between the pieces, could be a table with an extension as well.
  3. Pale colored wooden boards sitting next to each other.
  4. Quality issues are too severe to recognize visual content.
  5. Two boards made of wood to make some kind of flooring or platform

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 8: VizWiz_train_00002240.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A book that is sitting on a table.

Visual question: What is this?

Answers:

  1. this listen free
  2. advertisement
  3. channel lineup
  4. radio station guide
  5. sirius channel guide
  6. sirius radio listen free chart
  7. television channel guide
  8. list cable music channels
  9. radio channel guide
  10. sirux xm channel card

Reasons why answers differ:

Image captions:

  1. A list of free songs to listen in a songbook
  2. A menu of free radio stations for the dates November 3rd thru November 17th.
  3. A partial channel listing for Sirius music stations
  4. screen for sirius radio showing the listings available.
  5. Sirius radio genre channel menu on wood table.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 9: VizWiz_val_00000544.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: Is this?

Answers:

  1. laptop
  2. computer screen
  3. screen
  4. unanswerable
  5. computer
  6. monitor screen
  7. unanswerable
  8. unanswerable
  9. computer screen
  10. unsuitable image

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A blue screen with a pop up window on it.
  2. A computer screen showing some sort of program active
  3. A Windows blue screen with some Piriform software open.
  4. An error message popup on a Windows-based computer.
  5. an open dialogue window on a windows computer screen

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 10: VizWiz_train_00014451.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up view of a white water.

Visual question: Can you tell the name of this eye-drop?

Answers:

  1. no
  2. no
  3. unanswerable
  4. unanswerable
  5. no
  6. unanswerable
  7. unanswerable
  8. no
  9. no
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A plastic container of topical ophthalmic antibiotics displayed on a counter.
  2. A white bottle of pills with black lettering on the side.
  3. Quality issues are too severe to recognize visual content.
  4. Small white bottle on its side that is on a white surface
  5. The bottom of a plastic bottle of medicine sits on a white surface.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 11: VizWiz_train_00009587.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person wearing a bag with a black jacket.

Visual question: What this?

Answers:

  1. brown leather coat
  2. coat
  3. leather jacket
  4. jacket
  5. jacket
  6. leather jacket
  7. jacket
  8. leather jacket
  9. jacket
  10. jacket

Reasons why answers differ:

Image captions:

  1. A brown bag is on the center of a green surface.
  2. A brown leather jacket hanging on a yellow hook next to a green wall.
  3. A leather jacket hanging from a yellow hook.
  4. brown leather jacket hanging from yellow peg in front of a green wall
  5. I see a brown leather bomber jacket handing on a hook.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 12: VizWiz_train_00022038.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a picture of an entertainment set up in a home
  2. A TV screen is sitting on a wood TV stand underneath a wall clock.
  3. In a white living room, a TV is on.
  4. Older box TV on sitting on wood cabinet with glass doors and drawers.
  5. Television sitting on a wooden cabinet with glass doors at each side.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 13: VizWiz_val_00004053.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a black and white toilet.

Visual question: What number is this?

Answers:

  1. 26
  2. 26
  3. 26
  4. 26
  5. 26
  6. paper
  7. 26
  8. 26
  9. 26
  10. 26

Reasons why answers differ:

Image captions:

  1. a card is printed with the number "26"
  2. A sheet of paper with the number 26 printed on it in large font.
  3. A very close shot of a table number at a restaurant.
  4. A white object that has a number two and six in black.
  5. a white, flat object with the number twenty six printed on the front in large, black font

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 14: VizWiz_train_00023138.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A black box of a branded frozen pizza with large lettering.
  2. A box of thin and crispy New York pizza is sideways on a table.
  3. A pizza box with a picture of a pizza and the words, "Edge Thin & Crispy" written on it.
  4. BLACK PIZZA BOX PLACED ON A COUNTER TOP
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 15: VizWiz_train_00016041.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a wall next to a window.

Visual question: Can someone tell me the content of this tin?

Answers:

  1. unsuitable
  2. unsuitable
  3. unsuitable
  4. no
  5. unsuitable
  6. unanswerable
  7. unsuitable
  8. unsuitable
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A green framed document is displayed against a beige background
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 16: VizWiz_train_00006304.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a cell phone.

Visual question: What is the title of this cd?

Answers:

  1. doris day
  2. unsuitable
  3. unanswerable
  4. unsuitable
  5. unsuitable
  6. unsuitable
  7. boy
  8. unsuitable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A picture has a woman with wavy hair in it.
  2. a side view photo of a white woman
  3. A very wonderful view and worth seeing at all times, my friend
  4. picture or postcard with pink wall and blue curtain in the background
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 17: VizWiz_train_00023270.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A brown and white dog laying with his head down on a striped blanket.
  2. A brown dog sleeping on a colorful blanket.
  3. a long haired brown and white dog lying on a towel
  4. BROWN AND WHITE DOG LAYING ON A BLANKET
  5. The dog is laying on it's side sleeping on a bright orange and white striped blanket.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 18: VizWiz_val_00007186.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A brown leather bag, satchel, or holster of some kind.
  2. A brown medieval equipment it has a shape of a shell with threads around it
  3. A cone-shaped pouch in a dark grey material with a metallic blue lining and cord is lying on a white tile floor.
  4. A dark, crumpled cone-shaped object on a tile floor.
  5. an object looks decorative and it is on the floor

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 19: VizWiz_val_00005667.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A black keyboard mouth with white lettering on the top.
  2. A black Logitech brand wireless wheel mouse sitting on a red and white surface
  3. A black, Logitech computer mouse that has no wire
  4. A computer mouse is placed on a red surface and it is from a logitech brand.
  5. Wireless logitech mouse with scroll wheel on red mouse pad.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 20: VizWiz_train_00005987.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a vase on a table.

Visual question: What is this item?

Answers:

  1. unanswerable
  2. diffuser
  3. vase
  4. incense
  5. air freshener
  6. vase
  7. vase
  8. vase
  9. vase
  10. decor

Reasons why answers differ:

Image captions:

  1. A box for a new vase has text and images printed on it.
  2. In this is a picture of a image of a magazine
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Sandal color flower vash is on a tray and near a small glass tumbler.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 21: VizWiz_train_00015857.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cat ' s head.

Visual question: What is it, again?

Answers:

  1. unsuitable
  2. blanket
  3. unanswerable
  4. unsuitable
  5. mattress pad
  6. blanket
  7. blanket
  8. blanket
  9. mattress
  10. blanket

Reasons why answers differ:

Image captions:

  1. A comforter or other type of quilted fabric.
  2. A white comforter has stitching in a straight line
  3. Picture is an up close view of white fabric.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 22: VizWiz_train_00013512.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a wooden table with a mirror on it.

Visual question: What is this?

Answers:

  1. napkin holder napkins
  2. napkins
  3. filled napkin holder
  4. napkins in napkin holder
  5. napkin holder
  6. napkins
  7. paper napkins in holder
  8. napkin holder
  9. napkin holder napkins
  10. napkins

Reasons why answers differ:

Image captions:

  1. A metal napkin holder is sitting on a table with napkins.
  2. A metal napkin holder that has square white napkins in it on top of a wooden table
  3. a silver flower napkin holder with white napkins in it
  4. Napkins are on display in a silver napkin holder on the table.
  5. White napkins inside of a napkin stand on a brown table.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 23: VizWiz_val_00000756.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What room number is that?

Answers:

  1. 1503c
  2. door sign
  3. 1503c
  4. 1503c
  5. 1503c
  6. 1503c
  7. 1503c
  8. 1503c
  9. 1503c
  10. 1503c

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a sign on a great room door with the room number and indicates its in use
  2. appears to be a picture of a paper saying great room
  3. Hotel room that has a signpost outside the door that says 1503C Great Room, In use.
  4. Steel plaque for a hotel meeting room hanging on a wooden door.
  5. The sign on the door says great room and has the daily schedule listed.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 24: VizWiz_train_00023931.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: How much property do I have?

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. unsuitable image
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. unsuitable image
  10. unsuitable image

This image does not have annotations for Reasons Why Answers Differ.

This image does not have annotations for Captions.

This image does not have annotations for Skills.

This image does not have annotations for Quality Issues.

This image does not have annotations for Text Presence.

Image 25: VizWiz_train_00000600.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: There is a blue and white luggage in the window.

Visual question: What do you see on the windows boot up?

Answers:

  1. unsuitable
  2. unanswerable
  3. unsuitable
  4. unanswerable
  5. nothing
  6. machine
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. some sort of backpack or bag that is very blue
  3. Some type of TV screen with a USB and a memory stick plugged into it, underneath is a blue bag
  4. The corner of a screen, with some cords plugged into something and a blue bag behind it.
  5. Unidentified electronic device with USB cord and thumb drive connected next to vinyl bag with padded arm strap.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 26: VizWiz_train_00018217.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a book on a table.

Visual question: Is this box right side up?

Answers:

  1. yes
  2. yes
  3. yes
  4. yes
  5. yes
  6. yes
  7. purex
  8. yes classic purex
  9. yes
  10. yes

Reasons why answers differ:

Image captions:

  1. A blue and yellow box of Classic Purex Renuzit.
  2. A blue, white and yellow box labeled Classic Purex Renuzit with directions listed on the bottom.
  3. A box of Purex cleaning product with a blue label.
  4. Blue box of classic Purex on a brown table
  5. Pictured is part of a box of Purex laundry detergent.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 27: VizWiz_val_00001462.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up view of a white car.

Visual question: What color is the car in this photograph?

Answers:

  1. white
  2. white
  3. white
  4. white
  5. white
  6. white
  7. white
  8. white
  9. white
  10. white

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A garage floor with a white vehicle on top of it.
  2. A wheel and wheel well of a vehicle.
  3. A white car is parked over a crack in the pavement.
  4. An image of a white vehicle showing a partial tire.
  5. The bottom of a white car showing the bottom of the tire and door.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 28: VizWiz_train_00012747.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A chair sitting on top of a wooden floor.

Visual question: What is this?

Answers:

  1. chair
  2. office chair wheels
  3. chair
  4. turnable chair
  5. chair
  6. chair
  7. office chair
  8. office stool
  9. chair
  10. stool

Reasons why answers differ:

Image captions:

  1. A gray doctors chair is shown in a waiting room against a white wall with a dark door in the background
  2. A grey circular barstool on wheels that has a silver colored frame
  3. A lone gray stool chair leaned against the white wall.
  4. A rolling chair against the wall with a door in the background.
  5. Adjustable Doctors chair with wheels on tile floor

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 29: VizWiz_train_00001706.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cup on a table.

Visual question: What is the price?

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. 10 dollars
  5. unanswerable
  6. 0
  7. cup
  8. unanswerable
  9. unanswerable
  10. no prica

Reasons why answers differ:

Image captions:

  1. A plum colored travel coffee mug with blue floral-like design on a granite countertop with a blue audio device and a piece of paper.
  2. A purple container sitting on a countertop with several items in the background
  3. a thermos that's red and has blue floral designs around the bottom
  4. A travel coffee cup that is maroon and gray that is sitting on a table.
  5. An purple and grey coffee much with blue flowers swirling around the cup, sitting on granite countertops.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 30: VizWiz_train_00010659.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A man sitting under a bed with a light.

Visual question: What colour are these socks? Please describe.

Answers:

  1. black
  2. black
  3. unsuitable
  4. black
  5. black blurry
  6. lack
  7. black
  8. black
  9. black
  10. black

Reasons why answers differ:

Image captions:

  1. A black object is on top of a wooden counter in a room.
  2. A black sock is resting on a counter top.
  3. Individual with a checkered shirt with an article of clothing on the counter with a band near the top of it.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 31: VizWiz_val_00006598.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A blue, red and white plaid fabric with white fringe.
  2. A green and red blanket with frilly white lace on the edges.
  3. A piece of blue and red clothing is on top of a bed inside.
  4. A plaid design piece of clothing with a white lace running through it.
  5. appears to be a picture of colored fabric

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 32: VizWiz_train_00018032.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A black sky with a dark background on it.

Visual question: What color is this bag?

Answers:

  1. image blank
  2. unsuitable
  3. unsuitable
  4. unsuitable
  5. unsuitable
  6. unsuitable
  7. unsuitable
  8. black
  9. nothing
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A black picture with no light images showing.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 33: VizWiz_train_00009744.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a small dog.

Visual question: What color is this pepper?

Answers:

  1. red
  2. orange
  3. orange
  4. red
  5. orange
  6. orange
  7. red
  8. red
  9. orange
  10. orange

Reasons why answers differ:

Image captions:

  1. A hand holding a shiny orange red pepper.
  2. A human hand is holding a large orange bell pepper.
  3. A man's hand is holding a bright orange pepper.
  4. a person holding a large orange bell pepper
  5. A person is holding a red/orange pepper in their hand, which appears to be large in size.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 34: VizWiz_val_00002650.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a blue blanket with a black background.

Visual question: What is this floor made out of?

Answers:

  1. carpet
  2. carpet
  3. carpet
  4. carpet
  5. carpet
  6. unanswerable
  7. carpet
  8. unanswerable
  9. agtfsfd
  10. carpet

Reasons why answers differ:

Image captions:

  1. A blue cushion is laying on a light brown couch.
  2. An area rug in shades of brown is shown in front of piece of brown fabric furniture.
  3. Blue jeans next to a brown couch with shoes on
  4. Quality issues are too severe to recognize visual content.
  5. The blue comforter is next to the brown carpet on the floor.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 35: VizWiz_train_00019800.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person ' s hand is holding something.

Visual question: What is the name of this bottle?

Answers:

  1. 0
  2. unsuitable
  3. unanswerable
  4. unanswerable
  5. unsuitable
  6. not clear image
  7. unsuitable
  8. unsuitable
  9. unsuitable
  10. lysol

Reasons why answers differ:

Image captions:

  1. A close up of a hand holding onto something long and purple and shiny.
  2. A hand appears to be holding a cup.
  3. A hand holding a light purple glass or can.
  4. a person that is holding something next to a white wall
  5. I am looking at a picture of a person holding a purple object.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 36: VizWiz_train_00009794.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bright white sky.

Visual question: What's this about?

Answers:

  1. unsuitable
  2. unsuitable
  3. unsuitable
  4. unsuitable
  5. unsuitable
  6. unsuitable
  7. unsuitable
  8. unsuitable
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 37: VizWiz_train_00020353.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A purple box that says Yahoo Korea on it
  2. A purple laptop or notebook from Yahoo Korea
  3. A purple rectangular box is lying on a brown table.
  4. A purple Yahoo! branded box is resting on a wooden table.
  5. A rectangular purple box with the Yahoo logo is sitting a wooden table

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 38: VizWiz_train_00004145.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a book.

Visual question: Could you please tell me what that is? Thank you.

Answers:

  1. can
  2. beans
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. can
  7. unsuitable
  8. soup
  9. unanswerable
  10. can soup

Reasons why answers differ:

Image captions:

  1. A hand holding a can of food with the nutritional label shown.
  2. Hand holding a package of nuts and showing the back label with the nutrition facts
  3. the back of a can of food showing nutritional information
  4. The nutrition facts from a can of food.
  5. The side of a can of progresso soup where the nutrition facts are displayed.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 39: VizWiz_train_00011125.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person ' s face.

Visual question: What color is this?

Answers:

  1. white
  2. white
  3. white
  4. black brown tan
  5. white
  6. brown
  7. white brown black
  8. grey
  9. bed
  10. white

Reasons why answers differ:

Image captions:

  1. A gray piece of fabric has a person's hand shake on it.
  2. A shadow of a person holding a camera over a grey blanket.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. white and brown fabric that looks like a shirt or a sheet.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 40: VizWiz_val_00002700.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a building with a light.

Visual question: What flavor is this energy bar?

Answers:

  1. unsuitable
  2. cream
  3. unsuitable
  4. unsuitable
  5. unanswerable
  6. unanswerable
  7. unsuitable
  8. unanswerable
  9. unsuitable
  10. pink

Reasons why answers differ:

Image captions:

  1. a container/ box / bottle that contains liquid / goods.
  2. A product package, tubular in shape, with blurry black text and a UPC symbol on it.
  3. A white packet is placed on a piece of cloth.
  4. The back of a bar wrapper laying on a rug.
  5. The back of a candy bar where ingredients are.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 41: VizWiz_train_00020207.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A hand is holding a coke can and there are blue curtains and a computer screen in the background.
  2. A PERSON HOLDING A SMALL RED SODA CAN
  3. A person's hand holding up a can of Coca Cola with a blue curtain in the background and other furniture items
  4. Caucasian right hand holding a coca cola aluminum can in front of a blue curtain.
  5. Hand holding a can of Coca Cola, in front of a curtained window

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 42: VizWiz_train_00003340.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a picture of a sign.

Visual question: What is the code please?

Answers:

  1. 633
  2. 633
  3. 633
  4. 633
  5. 633
  6. 633
  7. 633
  8. 633
  9. 633
  10. 633

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A phone screen with a word verification box in the center.
  2. A screenshot of a smartphone that says 633 Truecaller.
  3. A screenshot of an Arabic language app with numbers
  4. imagine how you would describe this image on the phone to a friend.
  5. Truecaller Arabic name and the number 633 mobile phone screen shot.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 43: VizWiz_train_00020599.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A document that needs to be signed and read is displayed.
  2. A form for the Office of Disability Services that requires filling out.
  3. An unfilled out form regarding requirements for disabilities.
  4. Quality issues are too severe to recognize visual content.
  5. verification statement for disability testing to be signed by authorized professional

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 44: VizWiz_train_00018803.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A remote control sitting on a red couch.

Visual question: What is this?

Answers:

  1. sony remote control
  2. remote
  3. remote control
  4. remote control
  5. sony remote control
  6. remote control
  7. sony remote control
  8. remote control
  9. remote
  10. remote control

Reasons why answers differ:

Image captions:

  1. a black and silver Sony remote control with black and colored buttons
  2. a large black Sony brand television remote on a red chair
  3. A Sony brand remote control on a red leather couch.
  4. A very wonderful view and worth seeing at all times, my friend
  5. black rectangle with buttons laying on a red bean bag

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 45: VizWiz_val_00004425.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A partial Nestle cookie or brownie mix box is being held up for a photograph.
  2. A box of cookie and brownie mix, the brand is nestle toll house.
  3. A yellow and brown box of Nestle Cookie-Brownie Delights
  4. Nestle Toll House cookie package kit partially visible
  5. The front of a package of Nestle Toll house brownie cookie mix that is 4 pounds and 10 ounces in weight.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 46: VizWiz_train_00018827.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a refrigerator on a wall.

Visual question: What flavor is this?

Answers:

  1. pink lemonade
  2. pink lemonade
  3. pink lemonade
  4. pink lemonade
  5. pink lemonade
  6. pink lemonade
  7. pink lemonade
  8. pink lemonade
  9. pink lemonade
  10. pink lemonade

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A package of pink lemonade kool aid sits on a white cabinet.
  2. A package of pink lemonade Kool-Aid is the image.
  3. A pink lemonade Kool-Aid branded drink mix packet.
  4. one packet of Kool Aid brand drink mix
  5. Unopened package of Pink Lemonade flavored Kool-Aid powder.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 47: VizWiz_val_00003467.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a plate of food.

Visual question: What is the flavor of this bag of noodles?

Answers:

  1. chicken
  2. creamy chicken
  3. creamy chicken
  4. creamy chicken
  5. creamy chicken
  6. creamy chicken
  7. chicken
  8. creamy chicken
  9. creamy chicken
  10. creamy chicken

Reasons why answers differ:

Image captions:

  1. A package of Chicken ramen soup with noodles and chicken displayed on the pack
  2. A package of instant ramen noodle soup is unopened.
  3. A packet of food stuff is quite attractive as it is printed on the packet.
  4. A packet of creamy chicken flavored instant ramen soup.
  5. Package of a microwaveable noodle food item on a white counter.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 48: VizWiz_train_00023330.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A can of snuff that is sitting on a bed and has the brand name of the snuff.
  2. A canister of Copenhagen dipping tobacco on a fabric surface.
  3. a copper Copenhagen lid on a fabric surface
  4. Can of chewing tobacco on off white fabric
  5. SMALL ROUND CONTAINER SITTING ON A COUNTER TOP

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 49: VizWiz_train_00011617.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cat that is sitting on the floor.

Visual question: What color is my dog?

Answers:

  1. brown white
  2. brown white
  3. white brown
  4. tan white
  5. white brown grey
  6. light brown white
  7. tan white dark brown
  8. tan white
  9. brown white tan
  10. tan

Reasons why answers differ:

Image captions:

  1. A brown/white Pomeranian dog sitting on yellow table.
  2. A small brown and tan dog standing on the table
  3. A small dog is being handled by a person.
  4. a small dog sitting on a yellow surface
  5. A small dog with lots of fur sitting on a table with plants in the background

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 50: VizWiz_train_00022118.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Showing images 0 - 0 out of 0 matching images.