Showing images 1 - 50 out of 50 matching images.

Images are displayed from Training and Validation sets only.
Hover over image to zoom in.

Image 1: VizWiz_train_00022188.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 2: VizWiz_train_00013370.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a laptop on a table.

Visual question: What is this color?

Answers:

  1. unanswerable
  2. white
  3. red white
  4. red
  5. red white
  6. red white
  7. red
  8. red white
  9. red
  10. red

Reasons why answers differ:

Image captions:

  1. A ace of hearts playing card on a black and white background
  2. A very fresh Ace of hearts playing card.
  3. Ace of hearts playing card on embroidery background
  4. An ace of hearts playing card laid face up.
  5. An Ace of Hearts playing card lying on a table.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 3: VizWiz_train_00012244.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A couch that is laying on the floor.

Visual question: What is this?

Answers:

  1. couch
  2. dog biscuit
  3. dog bone
  4. dog treat
  5. dog bone
  6. dog bone
  7. dog bone
  8. dog bone next to couch
  9. bone
  10. dog bone

Reasons why answers differ:

Image captions:

  1. A beige couch with a bright carpet and a brown dog bone.
  2. A living room area showing a coach and a piece of bone for dogs.
  3. A medium sized brown dog toy on the floor next to a couch
  4. a room with various pieces of furniture in it and a dog bone on the floor
  5. The side of a grey Living room couch, and a dog chewy on the carpet.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 4: VizWiz_val_00003645.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a blue and white sign.

Visual question: What does this label read?

Answers:

  1. 5th season
  2. 5th season
  3. spice
  4. wefwef
  5. seasoning
  6. 5th season
  7. 5th season
  8. move camera back i can only see that 5th season brand
  9. 5th season
  10. 5th season

Reasons why answers differ:

Image captions:

  1. 5th season food product that could be food seasoning
  2. A blue container of 5th season brand seasoning, being held in someone's hand.
  3. A spice shaker bottled from the brand 5th season.
  4. Photo is of a product from the 5th Season brand.
  5. Small plastic blue bottle with white and blue text on it.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 5: VizWiz_train_00019413.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person holding a cell phone next to a building.

Visual question: Hi, Can you tell me what this is? thank you again you've been very helpful.

Answers:

  1. unsuitable
  2. unsuitable
  3. electronic device
  4. unsuitable
  5. unsuitable
  6. no
  7. unsuitable
  8. hide key
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A hand and an unopened pack of a USB charger are on the tile floor.
  2. A package with a phone charger inside of it.
  3. Package containing a multiple power source USB charger.
  4. Pictured is a USB charger in the middle of the floor.
  5. Unopened package for a multiple power source USB charger lying on a white tiled floor

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 6: VizWiz_val_00006245.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A black machine has a pause and play button as well as volume buttons.
  2. A close up of the volume buttons on a remote control.
  3. A close-up of an old audio equipment remote control with black plastic buttons.
  4. a remote controller for electronics with many command buttons on it
  5. The brown and silver hand-held control for a music device that plays tapes and cd is shown.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 7: VizWiz_train_00017791.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a laptop on a table.

Visual question: What is this green?

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. you sure you want to exit startup restart your computer
  7. unanswerable
  8. paper
  9. yes to exit
  10. no

Reasons why answers differ:

Image captions:

  1. A computer screen made by Lenovo has a window open.
  2. a few pop ups on a computer screen with the mouse in the middle
  3. a Lenovo brand laptop is displaying a pop up on the screen.
  4. a windows screen of a Lenovo laptop with a grey notification
  5. A wonderful view of the fog windows in the room is very thick

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 8: VizWiz_train_00007769.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a sign.

Visual question: What does this say

Answers:

  1. marshalls
  2. marshalls 6.99
  3. 6.99
  4. marshalls young mens compare at $14.00 $6.99
  5. marshals compare at 14.00 6.99
  6. marshalls young mens 6.99
  7. price tag
  8. marshall compare at $14 $6.99
  9. $6.99
  10. marshalls young mens $6.99

Reasons why answers differ:

Image captions:

  1. A Marshalls price tag for an item of young men's clothing marked down from fourteen dollars to six dollars and ninety nine cents.
  2. A price tag for Marshall's in the young men's department.
  3. a tag for a clothing item from the store Marshalls for
  4. A tag from Marshalls marked six dollars and ninety nine cents is attached to a grey shirt.
  5. I see a tag that is listed as Marshalls six dollars

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 9: VizWiz_train_00019009.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: This is a picture of the wall in the room.

Visual question: What is this item?

Answers:

  1. door
  2. door
  3. door
  4. door
  5. door
  6. door
  7. door
  8. door
  9. door
  10. door

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A brown wooden door in a bathroom is shown.
  2. A closed brown door with patterns is present.
  3. A pale yellow wall and wood trim, and a light colored wood panel door with a gold handle.
  4. A wooden door with a metallic lever handle is surrounded by a white trim.
  5. Wooden door with a silver lever style handle and an off-white frame

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 10: VizWiz_train_00019619.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a purple and white vase.

Visual question: What is this

Answers:

  1. empty cd holder
  2. cd case
  3. cd case
  4. cd case
  5. cd case
  6. cd case
  7. unsuitable
  8. cd holder
  9. cd
  10. empty dvd case

Reasons why answers differ:

Image captions:

  1. A dark purple empty CD case setting on a white counter.
  2. A purple cd case with a clear front.
  3. a purple colored compact disk case placed on a white surface
  4. Plastic CD case with purple layer reflecting light.
  5. some type of CD that is in a case

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 11: VizWiz_train_00015780.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A view of a person laying on a bed.

Visual question: What is this?

Answers:

  1. package bowtie pasta
  2. pasta
  3. pasta
  4. bow tie pasta
  5. pasta
  6. unanswerable
  7. bag pasta
  8. food
  9. bowtie egg noodles
  10. bow tie pasta

Reasons why answers differ:

Image captions:

  1. A clear plastic bag containing bow tie pasta.
  2. A red and green bag of pasta sitting on a grey countertop.
  3. A red and transparent bag of food placed on a grey table.
  4. a snack item which is packed in packet
  5. Pasta is in the bag on the table.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 12: VizWiz_val_00004768.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a bright white light with a light blue top right edge
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 13: VizWiz_train_00019783.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a child in a mirror.

Visual question: What is this CD?

Answers:

  1. singer patti page
  2. unanswerable
  3. hard copy
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. artist patti page
  8. unanswerable
  9. unsuitable
  10. unsure

Reasons why answers differ:

Image captions:

  1. A CD of DVD case that looks to be at least 50 years old and
    features a female recording artist
  2. A Patti Paige music CD lying across a man's hairy knees.
  3. CD cover of a female with short blonde hair laying on top of a man's hairy thighs.
  4. I see a singer artist on someone lap
  5. The CD case from a female country singer sits on a man's lap.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 14: VizWiz_val_00005208.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A red Hong Kong company law handbook from 2002.
  2. appears to be a picture of something pink with words
  3. Cover of Butterworths Hong Kong Company Law Handbook
  4. Red cover of a handbook on corporate law
  5. The cover of a red book about company law is shown.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 15: VizWiz_train_00008470.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a painting of a vase.

Visual question: What is the item contained in the box in the picture?

Answers:

  1. unanswerable
  2. unsuitable
  3. unsuitable
  4. unanswerable
  5. i dont know
  6. unsuitable
  7. unanswerable
  8. food
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A box with brown, yellow, white and light blue cover.
  2. A square object with colorful brown, blue and green pattern occupies most of the frame.
  3. A surface that is brown and blue and white sitting on a counter.
  4. a yellow brown and blue piece of abstract art
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 16: VizWiz_val_00004856.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A backyard scene with swing set and various toys.
  2. A children's playset, some balls, and some kid size toy cars.
  3. A playground set that is next to a tree.
  4. A swing set and toys appear in a backyard.
  5. Showing the side branches of the tree and background with clouds.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 17: VizWiz_val_00006589.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A box of cereal is on a black counter.
  2. A box of food on a metal table.
  3. A box of Honey Bunches of Oats cereal with a yellow and blue label.
  4. A yellow and blue wrapper with of Oa written on it.
  5. Blue cereal box with oats on top of black table.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 18: VizWiz_train_00018357.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A wooden counter topped with lots of food.

Visual question: Can you tell me the layout of this cabinet?

Answers:

  1. seasonings
  2. messy
  3. left celery salt 2 back 1 right oregano behind that mrs dash packet cold brew tea
  4. unanswerable
  5. spices packet tea on shelf
  6. deep
  7. tea bag in front spices wrap from left edge all around back
  8. spices tea
  9. no
  10. table

Reasons why answers differ:

Image captions:

  1. A shelf with various McCormick spices on it
  2. A space full of various spices and a pack of Lipton tea.
  3. A stash of seasonings in a cupboard with a tea bag in the front.
  4. A tea bag is on the table by the spices.
  5. A wooden cabinet shelf with seasonings, celery salt, oregano, Mrs Dash, pepper, lipton cold brew tea bag, and others.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 19: VizWiz_train_00007015.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A man sitting in front of a living room.

Visual question: describe the kitty in the picture

Answers:

  1. unsuitable
  2. unanswerable
  3. very blurry kitty
  4. unsuitable
  5. unanswerable
  6. unsuitable
  7. white grey
  8. white grey
  9. brownish white face
  10. grey white

Reasons why answers differ:

Image captions:

  1. A person is sitting right by the podium in the room.
  2. A room lit by a wall lamp with a doorway and a counter
  3. Picture of a cat and a light fixture in a living room.
  4. The face of a cat who is looking to the left side.
  5. The side of the face of a cat.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 20: VizWiz_train_00007940.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person wearing a hat.

Visual question: What is the name of this DVD?

Answers:

  1. unsuitable
  2. unanswerable
  3. unanswerable
  4. stams
  5. unsuitable
  6. unanswerable
  7. unsuitable
  8. unsuitable
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A close up view of some type of card with people's faces.
  2. a close-up of a Christmas movie is shown, with a picture of a man looking into the distance.
  3. A cover to a DVD movie with houses and people.
  4. A white man is in the foreground and a white woman is in the background.
  5. The front cover of something showing 2 men wearing red clothes

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 21: VizWiz_train_00020455.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A black and white piece of clothing with an orange "Puma" label depicting a stylized puma as a logo.
  2. An article of black and white clothing with its Puma brand label showing
  3. An article of clothing with an orange tag from the brand Puma.
  4. An orange PUMA tag on a piece of clothing.
  5. tag from and article of clothing from Puma.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 22: VizWiz_train_00004056.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a coffee cup on a table.

Visual question: Is that a beer or a coke?

Answers:

  1. diet coke
  2. that coke
  3. coke
  4. coke
  5. coke
  6. coke
  7. diet coke
  8. coke
  9. coke
  10. diet coke

Reasons why answers differ:

Image captions:

  1. A white soda can is on top of a black desk.
  2. An open can of diet coke sitting on a brown table.
  3. An open can of diet coke sitting on a black surface.
  4. an open can of diet coke soda pop
  5. I see a can of diet coke on a table.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 23: VizWiz_train_00008948.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person ' s hand holding a remote.

Visual question: At Christmas.

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. brother
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. An electrical equipment or appliance with a label that says Brother on it.
  2. brother model sticker on the bottom of product
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. The label on a Brother brand labeling cartridge.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 24: VizWiz_train_00021300.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A green color with black shaded full surface.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Teal blue blur with lighter color on the right side and no images.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 25: VizWiz_train_00019424.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: There is a book that is on the floor.

Visual question: Please could you tell me what's in the packet?

Answers:

  1. food
  2. unanswerable
  3. cookies
  4. gluten free cookies
  5. cookies
  6. cookies
  7. unsuitable
  8. cookies
  9. cookies
  10. no

Reasons why answers differ:

Image captions:

  1. A box of food laying on a blue surface.
  2. a red and green color paper was placed on a blue surface
  3. package of a cookie bake recipe sitting on a blue counter.
  4. Package of gluten free, wheat free, and egg free cookies
  5. packet of free from cookies, on top of a black table.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 26: VizWiz_train_00003465.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bed that is on it.

Visual question: What color are the beads in the charm on this necklace?

Answers:

  1. silver
  2. peachy pink
  3. metal
  4. silver
  5. orange
  6. silver
  7. white
  8. silver
  9. unsuitable
  10. silver light purple

Reasons why answers differ:

Image captions:

  1. A beaded necklace with pendant beats a silver and clear
  2. A quilt with a necklace with many beads and a pendant in the center on it.
  3. a silver and white necklace on a bedspread
  4. A silver beaded necklace is on the table.
  5. I see a fancy necklace on top of a bed's comforter.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 27: VizWiz_train_00013898.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a man with glasses and a tie.

Visual question: What design is this?

Answers:

  1. poker
  2. cards
  3. face
  4. poker face
  5. tiki face playing cards
  6. tiki poker
  7. cards
  8. tiki poker cards
  9. poker tiki man
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. a black colored article of clothing with a picture of a tiki head and cards on it with text stating poker face puffin' and bluff
  2. a shirt emblem that reads "Poker Face, Puffin bluff"
  3. A shirt with a skeleton face wearing glasses.
  4. A stylized logo with text is displayed on a black t-shirt
  5. Black t-shirt reading 'poker face' and 'puffin bluffin' with a smoking tiki head in the enter

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 28: VizWiz_train_00012320.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a glass of a white wall.

Visual question: Does this look clean?

Answers:

  1. yes
  2. yes
  3. yes
  4. yes
  5. yes
  6. yes
  7. yes
  8. yes
  9. unsuitable
  10. yes

Reasons why answers differ:

Image captions:

  1. A picture looking down into a bathtub in a house.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. the bottom of a beige bathtub and The tub is empty

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 29: VizWiz_train_00021828.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A digital control panel showing readouts and buttons.
  2. an electronic fitness device with different number readouts
  3. an old treadmill that shows distance and time information as well as controls for changing speed or workout type
  4. Appliance handle with a monitor that shows numbers and buttons in a room.
  5. The control panel of a treadmill showing several different measurements.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 30: VizWiz_train_00017879.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A room with a bed and a window.

Visual question: Does the purple go with these capris?

Answers:

  1. unsuitable
  2. yes
  3. unsuitable
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. yes
  8. unsuitable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A dark colored side table that contains a porcelain cat, a box of kleenex, and a brown basket of snacks.
  2. A desk with a tissue box, makeup and a ceramic cat.
  3. a person with white pants and a pink top in front of a coffee table
  4. I am looking at a picture of a table in a room with a white ceramic cat under the table.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 31: VizWiz_train_00006194.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cup of food sitting on top of a table.

Visual question: What is in this can please?

Answers:

  1. unanswerable
  2. cream based soup
  3. food
  4. unanswerable
  5. soup
  6. unsuitable
  7. unanswerable
  8. unanswerable
  9. cream based soup
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a can of Campbell's brand chunky soup of unknown variety
  2. A can of soup is on top of the counter.
  3. A can of soup with a red, green and blue label.
  4. A rounded can have different color on it and some printed letters.
  5. Up close of a can of soup with a red and green label.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 32: VizWiz_val_00006261.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A black plastic computer keyboard marked with a Dell logo.
  2. A charging cord is laying on the keyboard of a laptop.
  3. A cord is in front of a laptop computer.
  4. dell laptop but half of the monitor is only visible
  5. Pictured is the keyboard of a dell laptop with a cord in front of it.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 33: VizWiz_train_00002808.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a remote control controller.

Visual question: What is this item?

Answers:

  1. remote
  2. remote control
  3. remote
  4. remote control
  5. remote control
  6. remote control
  7. remote
  8. remote control
  9. remote
  10. remote

Reasons why answers differ:

Image captions:

  1. A black and grey remote control for a television on a tiled surface
  2. a black remote control on a piece of fabric
  3. A black remote controller with white and black buttons on top of a brown fabric blanket.
  4. A black remote laying on dark tile surface
  5. Black remote control for a TV sitting on a blanket

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 34: VizWiz_train_00003197.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a pile of books.

Visual question: Does this box have instructions for 8X8 pan size. If so, can I know what they are?

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. unsuitable
  5. yes
  6. unanswerable
  7. not shown
  8. unanswerable
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A box of Ghirardelli triple chocolate brownie mix.
  2. A box of Ghirardelli triple chocolate brownie mix.
  3. A box of triple chocolate brownie mix made by Ghirardelli
  4. Ghirardelli triple chocolate brownie mix on a counter
  5. Image shows a Ghirardelli Triple Chocolate brownie box.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 35: VizWiz_train_00008168.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person with a light.

Visual question: Caffeinated or decaffeinated coffee?

Answers:

  1. unsuitable
  2. unanswerable
  3. unsuitable
  4. unsuitable
  5. unanswerable
  6. unanswerable
  7. unsuitable
  8. caffeinated
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a dark pink object that is shiny and has some sort of decorative bend on the top
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 36: VizWiz_train_00011498.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person laying on top of a bed.

Visual question: What is the brand and name of the set of glasses?

Answers:

  1. anchor
  2. anchor
  3. anchor
  4. anchor
  5. anchor
  6. anchor
  7. anchor
  8. anchor
  9. anchor
  10. anchor

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A anchor glass packet keep in the bed described in the image.
  2. A box of glasses sitting on a beautiful quilt.
  3. A white box with a green anchor logo in the upper left corner, blue lettering spelling Anchor and pictures of a series of glasses.
  4. Box of Anchor brand clear glass drinking glasses, quantity of twelve.
  5. Front view of a box containing Anchor brand see through drinking glasses.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 37: VizWiz_val_00002329.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A can of coffee sitting on top of a table.

Visual question: What kind of soup is this? Thank you.

Answers:

  1. progresso exact flavor cannot be seen
  2. unsuitable
  3. unanswerable
  4. progresso vegetable
  5. unsuitable
  6. unsuitable
  7. unsuitable
  8. progresso
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a blue can of progresso brand soup with a recipe on it
  2. a blue can of progresso soup of which type cannot be properly identified
  3. A can of Progresso soup with cooking directions on the back label
  4. A can of soup on a counter with the back of the label in view.
  5. Vegetable soup is in the can on the counter

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 38: VizWiz_train_00008580.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A book and a plate of food on a table.

Visual question: What is this?

Answers:

  1. kellogg low fat granola rains
  2. kelloggs low fat granola raisins
  3. granola raisins cereal
  4. cereal
  5. kelloggs low fat granola raisins cereal
  6. granola raisins
  7. granola cereal raisins
  8. granola
  9. cereal
  10. cereal

Reasons why answers differ:

Image captions:

  1. a box o Kellogs low fat granola cereal with raisins
  2. A box of cereal laying down on a wood surface.
  3. A box of Kellogg's Low Fat Granola with Raisins laying flat on a wood surface.
  4. A box of low fat granola cereal lying on a wooden surface.
  5. A box of low fat granola cereal on a wood countertop.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 39: VizWiz_train_00023089.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A close up of light colored, wooden plywood.
  2. A wooden floor or some kind of wood paneling that has a large crack between the pieces, could be a table with an extension as well.
  3. Pale colored wooden boards sitting next to each other.
  4. Quality issues are too severe to recognize visual content.
  5. Two boards made of wood to make some kind of flooring or platform

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 40: VizWiz_val_00003608.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A toilet in a pot sitting on the floor.

Visual question: Alright. Now we're recording. I will record a bit and then Robert will record a bit. One, two, three, four, five, six, seven, eight.

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. unanswerable

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A cat's playing tower is filled with toys next to a cabinet near a door.
  2. A three tier grey and white cat tree lined with grey carpet and white twine
  3. Carpet covered cat scratch toy approximately three feet high with three levels for play.
  4. Large cat tower and also a small cupboard behind it.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 41: VizWiz_train_00015304.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a picture.

Visual question: What is this?

Answers:

  1. unanswerable
  2. food label
  3. magazine
  4. girl
  5. unanswerable
  6. unanswerable
  7. unsuitable
  8. unanswerable
  9. dvd
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A photo of a person near some flowers and buds and in an advertisement with the words like floral and creative writing
  2. A screenshot of a female with writing above her head.
  3. A view of a magazine page with a celebrity of some sort.
  4. A woman is seen in front of purple flowers.
  5. Advertisement for 'Getting creative with salmon and floral cocktails'

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 42: VizWiz_train_00003269.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A red and white sign on a table.

Visual question: How much sodium is in this product?

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. unsuitable
  7. toppers
  8. unanswerable
  9. ingredients on other side box
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a box of multi grain town house toppers crackers
  2. A package of Town House Toppers is on a table.
  3. A picture of food is on the packaging
  4. blue and red box of multi grain crackers
  5. I see a toppers townhouse with grain

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 43: VizWiz_train_00014825.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bottle of wine.

Visual question: What is this?

Answers:

  1. mans body wash
  2. axe
  3. axe phoenix body wash
  4. axe
  5. soap
  6. unsuitable
  7. axe body spray
  8. axe deodorant
  9. deodorant
  10. axe phoenix body wash

Reasons why answers differ:

Image captions:

  1. A black bottle of an Axe body wash.
  2. A black, white and blue bottle of AXE Phoenix.
  3. Are you black ax bottle showing the ingredient label
  4. some type of liquid that is in a container
  5. The back of a bottle of AXE Phoenix sitting on a table.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 44: VizWiz_train_00002562.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bottle of beer sitting on top of a table.

Visual question: What is the product contained in the bottle in the picture?

Answers:

  1. pledge revitalizing oil
  2. pledge oil
  3. revitalizing oil
  4. pledge
  5. furniture polish
  6. revitalizing oil
  7. revitalizing oil
  8. cleaner
  9. pledge
  10. pledge cleaning oil

Reasons why answers differ:

Image captions:

  1. a bottle of pledge dishwashing soap on top of a wooden surface
  2. A bottle of Pledge revitalizing oil is sitting on the floor in front of some objects.
  3. A container of orange pledge revitalizing oil orange scented
  4. An orange bottle of wood cleaner with orange oil.
  5. Cleaning solution in a bottle that is on a desk.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 45: VizWiz_train_00018070.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: There is a sign on the wall.

Visual question: What does this say?

Answers:

  1. infestation program manual
  2. manifestation program manual
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. something station program manual
  8. unsuitable
  9. manifestation program manual
  10. festival program manue

Reasons why answers differ:

Image captions:

  1. A book stating it's a program manual of some sorts.
  2. A gift card with a colorful painting lined with a pink border
  3. A manual for some kind of program with an abstract painting on the cover.
  4. A manual with a water colored front sits on a fabric area.
  5. A pink book that is a manual for learning

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 46: VizWiz_train_00012318.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a sign on the wall.

Visual question: Hi, I don't need you to read this I just need to know if this document is right side up or not. Thank you.

Answers:

  1. upside down
  2. no
  3. upside down
  4. upside down
  5. upside down
  6. not right side up
  7. upside down
  8. not
  9. upside down
  10. upside down

Reasons why answers differ:

Image captions:

  1. A partial page of text with some sentence fragments describing musical notation like "F minor."
  2. Close up of a white paper print out with black text printed on it.
  3. part of a page from a book, white with black text, discussing music.
  4. Quality issues are too severe to recognize visual content.
  5. Text written on white paper in a foreign language I can't read.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 47: VizWiz_val_00005288.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a packet of beef stroganoff shown in the image
  2. A skillet meal box is laying on a grey surface.
  3. Great Value brand of Beef Stroganoff Skillet Meal
  4. In the image there is a box of beef stroganoff.
  5. some type of box of beef stroganoff that is not opened

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 48: VizWiz_train_00021041.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a close up picture of what looks like a white and brown puppy dog
  2. A cute puppy is napping on the floor next to his toys.
  3. A dog laying on the carpet with an orange stuffed animal besides.
  4. a white and brown haired dog on a carpeted area
  5. A white/brown dog lying down on the carpet.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 49: VizWiz_train_00007680.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A sign that is next to a wall.

Visual question: What is this?

Answers:

  1. barcode on can
  2. unanswerable
  3. unanswerable
  4. upc
  5. jar
  6. sriracha sauce
  7. bar code #0 24463 06116 3
  8. tomato sauce
  9. bar code
  10. qr code

Reasons why answers differ:

Image captions:

  1. a barcode from a bottle of sriracha that is half empty
  2. A bottle of red sauce displays nutrition facts
  3. Close up of a Sriracha brand plastic bottle of hot sauce.
  4. The barcode on the side of a bottle of some kind of food product is displayed.
  5. the UPC code of some kind of red condiment, label reads 24463 06116

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 50: VizWiz_val_00004022.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a white toilet in a room.

Visual question: What is this? What color of fish, please. Thank you.

Answers:

  1. unanswerable
  2. white
  3. white sheet
  4. white sheet no fish
  5. unanswerable
  6. white sheet
  7. sheet
  8. white fabric
  9. white fabric
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A crumpled up white blanket is being shown.
  2. A white shirt with lines running through it
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. White sheet unfolded on the ground underneath other white background.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Showing images 1 - 50 out of 50 matching images.