Showing images 0 - 0 out of 0 matching images.

Images are displayed from Training and Validation sets only.
Hover over image to zoom in.

Image 1: VizWiz_val_00005391.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A can of food has ingredients and a nutrition label.
  2. A can of food is on top of a table.
  3. A can of salmon on a wooden shelf.
  4. A small can of pink salmon with French language nutrition information
  5. Can of food product on its side with green, red and white label with gold and black writing.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 2: VizWiz_train_00020297.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A 20 oz box containing Stouffer's brand macaroni and cheese.
  2. A box of Stouffer's frozen macaroni and cheese.
  3. A red box of macaroni left on a table
  4. A Stouffer's macaroni and cheese microwave dinner on a counter top.
  5. Stouffer's brand single serving package of mac and cheese

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 3: VizWiz_train_00011735.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cup on a table.

Visual question: What is that?

Answers:

  1. bottle vitamins
  2. sports drink bottle bottle fiber gummies
  3. fiber gummies gatorade bottle
  4. bottle
  5. plastic bottle
  6. gatorade bottle
  7. fiber gummy vitamins
  8. bottle
  9. gatorade bottle
  10. bottle

Reasons why answers differ:

Image captions:

  1. A bottle of Fiber gummies and a bottle of Gatorade sitting on a white counter.
  2. a bottle of gatorade already started and a bottle of fiber advance
  3. A bottle of gummy vitamins and a Gatorade bottle
  4. A drink bottle and a bottle of Fiber Advance gummies are sitting on a white counter.
  5. Two bottle jar on the table on bottle was include some toys

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 4: VizWiz_train_00002159.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bottle of beer sitting on a table.

Visual question: Can you tell what flavor this is?

Answers:

  1. extra sweet tea
  2. sweet tea
  3. sweet tea
  4. extra sweet tea
  5. ultra sweet tea
  6. extra sweet tea
  7. extra sweet tea
  8. sweet tea
  9. extra sweet tea
  10. sweet

Reasons why answers differ:

Image captions:

  1. A brown bottle of extra sweet tea on a white flat surface.
  2. A large bottle of sweet tea laying sideways on a white surface.
  3. A photo of Tradewinds extra sweet tea bottle on a counter top.
  4. An image of a bottle of dark tea.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 5: VizWiz_val_00001785.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a pink wall with a purple background.

Visual question: What color is this shirt?

Answers:

  1. red
  2. red
  3. red
  4. red
  5. red
  6. red
  7. red
  8. red
  9. pink
  10. red

Reasons why answers differ:

Image captions:

  1. A bright pink polo shirt has a neck placket.
  2. A person is wearing a pink shirt with a collar.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 6: VizWiz_train_00005814.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person standing in front of an orange frisbee.

Visual question: The container in the picture contains body butter. What flavor is the body butter?

Answers:

  1. satsuma beurre corporel
  2. satsuma
  3. orange
  4. madarian oranges
  5. satsuma
  6. unanswerable
  7. satsuma beurre corporel
  8. satsuma
  9. peach
  10. satsuma

Reasons why answers differ:

Image captions:

  1. A container sitting on the edge of a partial sink that says body butter on it.
  2. A round tub of body butter rests on a counter with a shell-shaped sink.
  3. Satsuma Body Butter on a counter with tile floor showing.
  4. Satsuma body butter round container on white sink counter.
  5. Someone is standing next to a bathroom sink, with a round orange container of body butter.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 7: VizWiz_train_00014737.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a book on a table.

Visual question: What candy is this?

Answers:

  1. tea candy
  2. tea candy
  3. tea candy
  4. tea candy citrus green tea
  5. tea candy
  6. unsuitable
  7. tea candy
  8. tea candy
  9. tea candy
  10. packet

Reasons why answers differ:

Image captions:

  1. A box of tea candy citrus flavor has a orange box.
  2. a box of tea candy citrus green tea kind sitting on a white carpet
  3. A box of tea candy with some citrus green tea.
  4. A yellow box of all natural citrus green tea.
  5. a yellow package of tea with black writing on it

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 8: VizWiz_train_00000974.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person using a keyboard.

Visual question: What are these buttons labeled as?

Answers:

  1. unanswerable
  2. letters numbers
  3. cell phone keypad
  4. unsuitable
  5. alphabet
  6. qwerty keyboard
  7. letters
  8. qwerty keyboard
  9. unsuitable
  10. character

Reasons why answers differ:

Image captions:

  1. A phone displayed horizontally with a small black keyboard with white letters.
  2. A picture of a hand holding a small black keyboard.
  3. A small keyboard is black with white and blue characters.
  4. A small wireless keyboard the size of a small cell phone is held in a hand.
  5. half of a black computer QWERTY keyboard in someone's hands

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 9: VizWiz_train_00022902.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A blue container with red coloring and white lettering sits on a wooden table.
  2. a container/ box / bottle that contains liquid / goods.
  3. A picture of the food is on the packaging.
  4. Front side of a cardboard box filled with milk
  5. I see product packaging with a glass of milk and a cow on it.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 10: VizWiz_train_00015724.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a car in the oven.

Visual question: What time is it? I need to know now. I need to know now. What time is it?

Answers:

  1. 11:26
  2. 11:26
  3. 11 26
  4. 11:26
  5. 11:26
  6. 11:26
  7. 11:26
  8. 11:26
  9. 1126
  10. 11:26

Reasons why answers differ:

Image captions:

  1. Alarm clock showing 11 26 sitting on a table
  2. Alarm clock with 11:26 on it sitting on top of a table
  3. An alarm clock is upside down on a table.
  4. Answering machine with a clock that reads 11:26 is sitting on a brown table.
  5. Digital message recorder that also displays the time.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 11: VizWiz_train_00022135.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. The fabric has a plaid pattern that appears to be blue and pink.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 12: VizWiz_train_00010821.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person holding a cup of food on a table.

Visual question: hello can you please tell me what a caN of beans is thank you

Answers:

  1. unanswerable
  2. garbanzo chick peas
  3. bush brand pinto beans
  4. unanswerable
  5. something you eat
  6. pento
  7. unanswerable
  8. unanswerable
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A hand holding something, in which the letters " the vegetables with more" is printed.
  2. A large can of name brand baked beans with a blue label and yellow logo
  3. a person hand holding a blue color can with a label of the vegetable with more
  4. Can of beans showing "the vegetable with more" symbol
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 13: VizWiz_train_00003996.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: There is a book that is on the floor.

Visual question: What is this?

Answers:

  1. unsuitable
  2. unsuitable
  3. house
  4. unsuitable
  5. unanswerable
  6. unanswerable
  7. unsuitable
  8. unsuitable
  9. package
  10. box

Reasons why answers differ:

Image captions:

  1. A box containing a single serving of macaroni and cheese.
  2. A box of macaroni and cheese is on a person's lap who is wearing ripped jeans.
  3. A person is kneeling right by the dresser.
  4. Macaroni cheese packaged food box placed on lap.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 14: VizWiz_train_00019552.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A green plate filled with broccoli on a table.

Visual question: Does this broccoli look moldy at all?

Answers:

  1. no
  2. no
  3. no
  4. no
  5. yes
  6. no
  7. maybe spots
  8. no
  9. no
  10. no

Reasons why answers differ:

Image captions:

  1. a bunch of raw broccoli florets in a bowl
  2. A fresh vegetable is on the plate and shown in this image.
  3. a green ceramic plate with green plain broccoli florets in a pile on it
  4. A plate full of raw broccoli sitting atop a wooden table.
  5. A teal plate with raw broccoli florets on it

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 15: VizWiz_train_00000298.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a person on a cell phone.

Visual question: What is the title of this DVD please?

Answers:

  1. unanswerable
  2. unsuitable
  3. back box
  4. unsuitable
  5. masterclass
  6. unsuitable
  7. unsuitable
  8. unsuitable
  9. masterclass in finding nation s funny bone
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a DVD case for a comedy show by Michael McIntyre
  2. A DVD case is laying on a wood surface.
  3. some type of movie DVD case with a movie DVD in it
  4. the back of a DVD case sitting on a table
  5. The back of a DVD of a comedy routine

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 16: VizWiz_train_00006816.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A laptop computer sitting on top of a keyboard.

Visual question: What the screen says. Because I'm reinstalling windows and I'm an end-user. The last time I had someone read it out and say it was twenty minutes left.

Answers:

  1. unsuitable
  2. to improve appearance ofadjust your screen resolution
  3. unanswerable
  4. unanswerable
  5. says to improve appearanceadjust your screen resolution
  6. unable to see full screen move right up slightly
  7. unsuitable
  8. unsuitable
  9. unanswerable
  10. to improve appearance adjust your screen resolution

Reasons why answers differ:

Image captions:

  1. A black laptop that is open with a pop up screen showing.
  2. A black laptop with a blue screen and a gray window sits in a flat surface
  3. a computer message about adjusting a screen resolution
  4. A computer screen is displaying a window with a message
  5. Laptop monitor displaying Windows style blue error screen.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 17: VizWiz_train_00023135.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a black colored plastic Compaq brand wired computer mouse
  2. A black Compaq computer mouse on the edge of a desk, with part of a CD at the right hand corner of the desk,
  3. A black mouse, DVD, and book on a desk
  4. A Compaq branded black wired mouse is on the desk.
  5. a computer mouse that is primarily used to control your computer

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 18: VizWiz_val_00003152.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bottle of toothpaste on a table.

Visual question: What medicine is in this bottle?

Answers:

  1. unsuitable
  2. unsuitable
  3. unsuitable
  4. unsuitable
  5. unknown
  6. unsuitable
  7. pain reliever
  8. pain killer
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A blurry image of a bottle of equate medicine.
  2. a bottle of advil or some type of medicine
  3. A bottle of equate medicine sits on a couch
  4. A bottle of medicine is right on the carpet.
  5. A white bottle has blue and white writing on it.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 19: VizWiz_train_00003761.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person wearing glasses.

Visual question: Is this woman confused?

Answers:

  1. yes
  2. no
  3. unanswerable
  4. yes
  5. no
  6. unanswerable
  7. maybe
  8. no
  9. no
  10. thinking

Reasons why answers differ:

Image captions:

  1. A picture of a lady holding a pencil on a cover of a magazine.
  2. A portrait of a grandmother wearing black glasses is on an advertisement copy.
  3. A woman wears eyeglasses and holds a pencil in a thoughtful pose on the cover of a magazine.
  4. An older woman is shown with grayish blonde hair and glasses, resting her chin on her fist and holding a pencil with blue words partially visible across her forehead.
  5. Appears to be a cover of a magazine, there is an older woman pictured holding a pencil to her chin deep in thought.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 20: VizWiz_train_00019979.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What does this say?

Answers:

  1. dewalt
  2. dewalt
  3. dewalt manual
  4. dewalt
  5. dewalt
  6. dewalt
  7. dewalt
  8. default 1 hour charger
  9. dewalt
  10. dewalt 1 hour charger

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A number of papers are pinned together and the top paper has many prints.
  2. an instruction manual for a Dewalt brand 1 hour charger
  3. an instruction manual for a dewalt electric tool
  4. An instruction manual for a DeWalt one hour charger.
  5. The warranty or manual for a dewalt tool.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 21: VizWiz_train_00003610.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person holding a small red and white sign.

Visual question: What does that say?

Answers:

  1. schlarfly beer brewed in saint louis
  2. schlafly
  3. clafly
  4. schlafly
  5. schlafly beer
  6. brewed in schlafly beer saint louis
  7. schlafly beer brewed in saint louis
  8. brewed in schlafly beer st louis
  9. schlafly beer
  10. hand

Reasons why answers differ:

Image captions:

  1. A wooden coaster has the logo of a beer company
  2. Hand holding a coaster for a brewing company.
  3. I see a table with four legs on it and a person holding a can
  4. In a room with tan carpet and a folding table with blue plastic cups on it, a hand holds a drink coaster from Schlafly Beer Co.
  5. Orange and white coaster reading Schlafly Beer Brewed in Saint Louis.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 22: VizWiz_train_00017520.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A black and white clock sitting on a table.

Visual question: Could you tell me what this card says or what it appears to be?

Answers:

  1. son birthday card
  2. son
  3. card says son
  4. son
  5. son
  6. son
  7. son
  8. son
  9. son
  10. son

Reasons why answers differ:

Image captions:

  1. a bedroom that has a window, and it's dark outside
  2. A brown card or package with SON written on it is in focus in a room with a cardboard box sitting in front of a white framed window and a very large poster with a photo of a woman on it, next to an open door.
  3. A photo frame is in a room near a box.
  4. a window with a white blind, a poster of a woman and the corner of a book
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 23: VizWiz_train_00008621.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person ' s tie.

Visual question: What type of chocolate is this?

Answers:

  1. crunchie
  2. candy
  3. munching
  4. crunchie
  5. crunchy
  6. crunchie
  7. munchie
  8. crunchie
  9. crunch
  10. munchie

Reasons why answers differ:

Image captions:

  1. A candy bar sitting on a white button up shirt.
  2. A picture of a person's torso in a white button up shirt taken from a top down angle while they are laying down with a candy bar with crumbs on sitting on their chest.
  3. Candy bar sitting on top of a white button down shirt
  4. person in a white shirt with a candy bar wrapper sitting on chest area
  5. Pictured is a candy wrapper laying on top of light pink shirt.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 24: VizWiz_train_00015231.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a computer on a table.

Visual question: What is in this bottle?

Answers:

  1. lysol bathroom cleaner 4 in 1
  2. bathroom cleaner
  3. bathroom cleaner
  4. bathroom cleaner
  5. bathroom cleaner
  6. bathroom cleaner
  7. bathroom cleaner
  8. bathroom cleaner
  9. cleaner
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A bottle of bathroom cleaner is laying on a table.
  2. A bottle of Lysol bathroom cleaner 4 in 1.
  3. A spray bottle of four in one bathroom cleaner
  4. Quality issues are too severe to recognize visual content.
  5. The bottom part of a bathroom cleaner bottle.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 25: VizWiz_train_00018825.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: An orange sitting on top of a table.

Visual question: what is that

Answers:

  1. orange
  2. orange
  3. orange
  4. lemon
  5. orange
  6. orange
  7. orange
  8. lemon
  9. lemon
  10. orange

Reasons why answers differ:

Image captions:

  1. a lemon sitting on a white linoleum countertop
  2. An average sized orange that is fairly light in color
  3. An orange sitting on a countertop taken from the top down
  4. Shiny orange citrus fruit sitting on a light gray surface.
  5. the top of a lemon that is sitting on a gray carpet

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 26: VizWiz_train_00023567.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: Is my light on, question number one, and the other question I have is what time is it in Central Daylight Time?

Answers:

  1. unanswerable
  2. light on unanswerable
  3. yes unanswerable
  4. yes 5:20 pm
  5. yes light on unanswerable
  6. yes 11:30 cdt
  7. yes.12:09 pm
  8. yes
  9. yes 11:12 am
  10. yes unanswerable

This image does not have annotations for Reasons Why Answers Differ.

This image does not have annotations for Captions.

This image does not have annotations for Skills.

This image does not have annotations for Quality Issues.

This image does not have annotations for Text Presence.

Image 27: VizWiz_train_00012951.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A sign with a clock on it.

Visual question: What type of spice is this?

Answers:

  1. oregano
  2. oregano
  3. oregano
  4. oregano
  5. oregano
  6. oregano
  7. oregano
  8. oregano
  9. oregano
  10. oregano

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A bottle with a label of oregano captured horizontally.
  2. A container of Marshalls Creek Spices Oregano with a yellow, white, and red label on the front.
  3. A Marshalls Creek Spices label on a bag of oregano.
  4. a plastic container of 5 ounces of oregano spice
  5. Front label of a container of Marshalls Creek Oregano spice weighing 5 ounces.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 28: VizWiz_val_00006945.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A flat screen iPad tablet is powered on and displaying the home screen.
  2. A turned on iPad on it's home screen on wooden table.
  3. A yellow background with a white iPad laying on it showing the icons home page
  4. An older white iPad on showing the basic install apple apps.
  5. The computer has many icons on the front and fake raindrops.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 29: VizWiz_train_00002881.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A view of a cloudy sky with a large light.

Visual question: what color is this?

Answers:

  1. grey
  2. grey
  3. unsuitable
  4. grey
  5. white
  6. white
  7. white
  8. unsuitable
  9. grey
  10. grey

Reasons why answers differ:

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 30: VizWiz_train_00008797.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a sign on a table.

Visual question: What type of kcup is this, coffee or tea? And what flavor?

Answers:

  1. unable to see box move camera up
  2. unanswerable
  3. unsuitable
  4. unanswerable
  5. unanswerable
  6. unsuitable
  7. unanswerable
  8. unanswerable
  9. i dont know
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A shiny white surface with a yellow box on top of it.
  2. A white object with an orange object on top of it.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. White object with a shiny finish, and several dirt and scrape marks.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 31: VizWiz_train_00000340.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cat sitting on a bed next to a wall.

Visual question: What is this?

Answers:

  1. unsuitable
  2. floor
  3. mattress
  4. unsuitable
  5. floor
  6. unsuitable
  7. unanswerable
  8. unanswerable
  9. sofa cushion
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A bed sitting on a beige colored piece of carpet.
  2. A cloth seat is shown near a black object.
  3. A very light colored carpet directly in front and to the left is a bed with just the boxspring visible.
  4. I see a cream carpet next to a beige box spring mattress.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 32: VizWiz_val_00000537.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: what type of tablets are these?

Answers:

  1. unanswerable
  2. unsuitable image
  3. unanswerable
  4. unsuitable image
  5. unsuitable image
  6. unsuitable image
  7. unsuitable image
  8. unsuitable image
  9. unsuitable image
  10. unanswerable

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a baby blue and white package with purple text
  2. A blue box with information of the product sitting on a brown surface.
  3. a box of pills showing the ingredients on a granite counter
  4. Quality issues are too severe to recognize visual content.
  5. The side of a cardboard product box sitting on a black surface.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 33: VizWiz_val_00001008.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: whats this?

Answers:

  1. tv
  2. television
  3. television
  4. old tv monitor
  5. tv
  6. tv
  7. monitor
  8. television screen
  9. tv
  10. monitor

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A large grey Samsung CRT television with audio/video output cables.
  2. A television, which is on, sitting on a small wooden table on top of a carpet.
  3. a white Samsung television placed on a wooden standing
  4. Front view of a television that is on with three items plugged into the front, left plus is yellow, middle plug is white and right plug is red.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 34: VizWiz_train_00015369.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A purple picture of a pink sky.

Visual question: What color is this fabric?

Answers:

  1. pink
  2. pink
  3. purple
  4. pink
  5. pink
  6. purple
  7. unsuitable
  8. pinkish
  9. unanswerable
  10. pink

Reasons why answers differ:

Image captions:

  1. a pink striped fabric or t shirt clothing
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Up close view of pink and white striped shirt.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 35: VizWiz_train_00010038.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bottle of wine sitting on top of a bed.

Visual question: What's in this bottle?

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. lotion
  5. unanswerable
  6. shampoo
  7. unsuitable
  8. unanswerable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A green bottle of shampoo that is resting in someone's lap.
  2. A product in a light green tube on top of a gray fabric piece.
  3. A small plastic bottle being held in a lap.
  4. A very wonderful view and worth seeing at all times, my friend
  5. Plastic bottle situated between the pant legs of an individual wearing a sweater.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 36: VizWiz_train_00017070.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A man laying on a banana on a skateboard.

Visual question: What is this that I am looking at?

Answers:

  1. foot
  2. foot
  3. foot shoe
  4. foot
  5. carpet foot black sandal
  6. slipper
  7. sandal
  8. leg woman
  9. foot
  10. foot

Reasons why answers differ:

Image captions:

  1. A foot wearing a flip flop with red nail polish.
  2. a woman with red toenails wearing a black leather sandal
  3. A women wearing sandals with red toenail paint on a dark carpet.
  4. Appears to be a picture of a foot with sandals
  5. Someone with painted toenails is standing in black sandals on a grey patterned carpet.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 37: VizWiz_train_00001999.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: There is a picture of a laptop screen.

Visual question: Please solve this captcha.

Answers:

  1. ek7s
  2. ek7s
  3. ek7s
  4. ek7s
  5. ek7s
  6. ek7s
  7. ek7s
  8. ek7s
  9. ek7s
  10. ek7s

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a captcha image on a computer reading Keys
  2. A caption text is displayed on a phone screen
  3. A phone screen with a verification box to enter letters to continue with the application being used to complete a download with buttons along the bottom for updates more downloads, to search and to file as well as get to the home screen
  4. A screenshot of a captcha from an iPhone device.
  5. A screenshot on a phone showing a CAPTCHA screen

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 38: VizWiz_train_00013399.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a drink.

Visual question: What is this can?

Answers:

  1. soup
  2. soup
  3. campbells soup
  4. canned soup
  5. campbells soup
  6. soup
  7. vegetable soup
  8. soup
  9. soup
  10. campbells soup

Reasons why answers differ:

Image captions:

  1. A Campbell's soup can held in someone's hand on a black counter.
  2. a can of Campbell's soup being held by a human hand
  3. A can of soup on a counter from Campbell's.
  4. a person's hand holding a can of Campbell's soup
  5. Can of soup with photo of soup displayed on the front and a red top label

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 39: VizWiz_val_00004764.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A red can of baked beans and sausages, the nutritional information is also listed on the front.
  2. A red can of mixed beans and sausages
  3. A red can of small baked beans and cooked sausages.
  4. A regular sized can of baked beans with sausages
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 40: VizWiz_train_00021229.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A beige long thing being held by a hand.
  2. A person is holding an electronic device in their hand.
  3. Quality issues are too severe to recognize visual content.
  4. Someone holding a long narrow something in their hand.
  5. Someone is holding something golden and rectangular and thin in their right hand.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 41: VizWiz_train_00007019.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person on a table.

Visual question: What is in that bowl?

Answers:

  1. nothing
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. water
  9. unsuitable
  10. pan

Reasons why answers differ:

Image captions:

  1. A black piece of plastic on top of a rusted metal surface.
  2. A metal object with a recycle logo stamped in the lower left corner on top of a piece of yellow speckled granite.
  3. A recyclable table tray sits on the countertop
  4. black rounded square object on a wooden table
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 42: VizWiz_val_00001854.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person on a table.

Visual question: What's the name of the perfume?

Answers:

  1. unanswerable
  2. unsuitable
  3. unsuitable
  4. unanswerable
  5. unsuitable
  6. unsuitable
  7. unsuitable
  8. unsuitable
  9. no name
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A glass of water has fingers under it.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 43: VizWiz_train_00007283.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bottle of water.

Visual question: What is this?

Answers:

  1. water bottle
  2. water bottle
  3. bottled water
  4. water
  5. nestle pure life water
  6. can
  7. water
  8. nestle pure life
  9. bottled water
  10. bottled water

Reasons why answers differ:

Image captions:

  1. A bottle of water and a Sprite soda can are sitting on a wooden surface.
  2. A clear plastic bottle with a blue label that reads Nestle Pure Life water.
  3. A half empty bottle of Nestle water sits on a wood table and a can of Sprite is behind it.
  4. A half-empty bottle of Nestle Pure Life water sitting on a wooden table
  5. Beautiful view from behind the walls hidden under dark mist

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 44: VizWiz_train_00003442.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A view of a gray, of the sky.

Visual question: what is this please

Answers:

  1. unsuitable
  2. unsuitable
  3. nothing
  4. unsuitable
  5. blank image
  6. unsuitable
  7. unanswerable
  8. nothing
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. a gray pebbled counter top surface in a stone material
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 45: VizWiz_train_00009918.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a plate of food.

Visual question: What is in this box?

Answers:

  1. beef stroganoff
  2. beef stroganoff
  3. beef stroganoff
  4. beef stroganoff
  5. beef stroganoff
  6. beef stroganoff
  7. beef stroganoff
  8. beef strognoff
  9. beef stroganoff
  10. beef stroganoff

Reasons why answers differ:

Image captions:

  1. A container of prepared microwaveable beef stroganoff meal.
  2. A microwave beef stroganoff dinner with red label on a woven placemat.
  3. A packed box of beef stroganoff put on table
  4. Image is a beef stroganoff red in color
  5. Photo is of a frozen Beef Stroganoff microwavable meal.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 46: VizWiz_train_00018954.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A window with a sign on the side of it.

Visual question: Is there anybody in front of me?

Answers:

  1. no
  2. no
  3. no
  4. gbgdf
  5. no
  6. no
  7. unanswerable
  8. no
  9. no
  10. no

Reasons why answers differ:

Image captions:

  1. A view from a condo apartment balcony of a nighttime city skyline
  2. a window looking out to a small deck.
  3. A yellow wooden cut patterned divider behind glass doors.
  4. Glass window or door with yellow railing outside of the door or window and a view of what looks like a city but is too hard to tell because of a glare in the glass
  5. Reflections from camera flashes have made it difficult to see the railing out this window.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 47: VizWiz_train_00000386.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A little girl sitting in front of a sign.

Visual question: What is the title of this book?

Answers:

  1. grandpas teeth
  2. grandpas teeth
  3. grandpas teeth
  4. grandpas teeth
  5. grandpas teeth
  6. grandpas teeth
  7. grandpas teeth
  8. grandpas teeth
  9. grandpas teeth
  10. grandpas teeth

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A couple of publications with a graphic of an elderly man and red lettering.
  2. A pamphlet or book that reads "Rod Clement Grandpa's Teeth" with a picture of an old man on the front, and the top shown of a copy of the same book behind it
  3. A rectangular book about an old person's teeth.
  4. Quality issues are too severe to recognize visual content.
  5. The front album of a DVD cover with a carpet

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 48: VizWiz_train_00011608.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A can of coffee sitting next to a refrigerator.

Visual question: What flavor of pop is this?

Answers:

  1. cherry cola
  2. cherry coke
  3. cherry
  4. cherry
  5. cherry coke 0
  6. cherry coke 0
  7. cherry
  8. coke cherry 0
  9. cherry coke
  10. cherry coke

Reasons why answers differ:

Image captions:

  1. A can of soda and paper bowl are on top of a table.
  2. An old tube TV is on a cabinet mostly off frame showing a show with a yellow background
  3. An opened can of Coke cherry zero calories.
  4. Cherry coke is in the can on the table.
  5. The can of cola is on the table by a white bowl.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 49: VizWiz_train_00003796.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a pink flower on a purple surface.

Visual question: what color is this top ?

Answers:

  1. green
  2. pink
  3. purple
  4. dark pink embroidered leaves butterfly
  5. purple
  6. pink
  7. pink
  8. pink
  9. pink
  10. purple

Reasons why answers differ:

Image captions:

  1. A pink top or sweater with an embroidered butterfly and leaf design.
  2. a purple blanket with butterflies and flowers embroidered
  3. A stitched pattern on top of purple fabric.
  4. Embroidery of a pink butterfly on a pink shirt.
  5. The torso of a fuschia shirt, showing a v-neck cut with thread embroidery of a pink and green butterfly next to two blue circles and two brown sets of leaves.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 50: VizWiz_train_00019895.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a blue and white plate.

Visual question: I would like to know the expiration date on the yogurt.

Answers:

  1. unanswerable
  2. unanswerable
  3. unsuitable
  4. unanswerable
  5. unanswerable
  6. yogurt
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. yogurt

Reasons why answers differ:

Image captions:

  1. A container of low-fat yogurt sold by a company called PriceRite
  2. A plastic package of nonfat yogurt is shown on a light wooden table
  3. The lid of a container of Price Rite brand Low fat Yogurt
  4. The white container has blue text on the lid
  5. top of Price Rite Low fat Yogurt on top of a wood table.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Showing images 0 - 0 out of 0 matching images.