Showing images 0 - 0 out of 0 matching images.

Images are displayed from Training and Validation sets only.
Hover over image to zoom in.

Image 1: VizWiz_train_00008805.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person sitting on a table.

Visual question: What flavor are these energy bars?

Answers:

  1. peanut butter crunch
  2. unanswerable
  3. peanut butter crunch
  4. peanut butter crunch
  5. peanut butter crunch
  6. er
  7. unsuitable
  8. unsuitable
  9. peanut butter crunch
  10. peanut butter crunch

Reasons why answers differ:

Image captions:

  1. A box of PureFit branded weight loss drink.
  2. a weight loss snack bar peanut butter flavored by pur fit
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. some sort of granola bars that is brown and white

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 2: VizWiz_train_00015021.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A plate of food with a sign on it.

Visual question: What kind of cereal is this?

Answers:

  1. corn flakes
  2. corn flakes
  3. corn flakes
  4. cornflake cereal
  5. corn flakes
  6. corn flakes
  7. cornflake
  8. corn flakes
  9. corn flakes
  10. spartan corn flakes

Reasons why answers differ:

Image captions:

  1. a box of cornflake cereal with a picture of the cereal and red raspberries in a spoon on the front
  2. A box of generic Corn Flakes is facing the camera.
  3. A white 18 oz box of corn flake cereal.
  4. A white and red box of corn flake cereal.
  5. I see a box of awesome delicious corn flakes

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 3: VizWiz_train_00002974.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A brown and white dog on a leash.

Visual question: What is this?

Answers:

  1. lab
  2. dog
  3. dog
  4. dog
  5. dog
  6. golden retriever
  7. yellow lab
  8. dog on leash
  9. dog
  10. bassett hound

Reasons why answers differ:

Image captions:

  1. A dog wearing a harness is sitting on the ground.
  2. A sad-looking yellow lab, with a harness attached to a handle for its owner, is sitting on the sidewalk near a brick building, staring into the camera.
  3. A white colored lab dog sits with it's leash on the sidewalk.
  4. A yellow lab is sitting with a leather harness and leash looking at the camera.
  5. One of the pet dogs is a cute white dog.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 4: VizWiz_val_00005986.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A woman's blouse that must be her favorite shirt.
  2. Green, blue, and white striped fabric which is wrinkled.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. The front of someone's shirt is shown in the image.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 5: VizWiz_train_00005005.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a book on a table.

Visual question: What currency and how much?

Answers:

  1. italian 2 dollars
  2. 2 dollars singapore
  3. 2 dollars
  4. singapore
  5. singapore 2 dollars
  6. 2 singapore dollars
  7. singapore 2 dollars
  8. singapore: 2 dollars
  9. 2 dollars
  10. says 2 dollars i dont know currency says singapore on

Reasons why answers differ:

Image captions:

  1. A Singapore $2 bill sits on a wood table.
  2. A two dollar bill from where I presume to be is Singapore, it is pink and blue with the picture of a sailboat/ship on it.
  3. A two dollar bill of legal tender from Singapore.
  4. A two dollar multi colored bill from Singapore.
  5. Back view of Singapore currency that shows 2 dollars.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 6: VizWiz_train_00021139.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a card which displays a coca cola of two
  2. A Coca Cola vending machine with two large visual art buttons.
  3. Close up picture of 2 side by side Coca-Cola bottles on a drink machine.
  4. Coca cola machine with large bottles in the front of it.
  5. Two bottles of twenty ounce Coca-Colas in a drink machine.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 7: VizWiz_train_00021645.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A bottle of Red Robin seasoning on a wood counter.
  2. A container of Red Robin seasoning with a red cap.
  3. A person sits at a table and in the foreground is a package of seasoning.
  4. appears to be a picture of a bottle of spices
  5. Jar of Red Robin Seasoning on a wooden table.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 8: VizWiz_train_00000890.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a car on the ground.

Visual question: What does this say?

Answers:

  1. talking timer
  2. nothing
  3. talking timer
  4. talking timer
  5. talking timer
  6. talking timer
  7. talking timer
  8. talking timer
  9. talking timer
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A person wearing a black shirt standing at a receptionists desk.
  2. a picture of someone stomach and black shirt with a box sitting on a marble surface saying talking timer
  3. a stainless steel nameplate that reads Talking Timer
  4. A view as if someone were looking down at their feet.
  5. Talking Timer is what the box on the counter says.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 9: VizWiz_train_00008722.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cup of coffee is sitting on a table.

Visual question: What kind of soup is this?

Answers:

  1. cream mushroom
  2. campbells cream mushroom
  3. cream mushroom
  4. cream mushroom
  5. mushroom
  6. campbells cream mushroom
  7. cream mushroom
  8. cream mushroom
  9. mushroom soup
  10. cream mushroom

Reasons why answers differ:

Image captions:

  1. A can of Campbell's cream of mushroom soup sitting on top of a green counter.
  2. A can of soup sitting on a green counter top with a wood edge.
  3. A soft drink cane take in the table shown in the image.
  4. Can of Campbell's brand condensed Cream of Mushroom soup
  5. Front view of a Campbell's brand of cream of mushroom soup.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 10: VizWiz_train_00008542.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A sign that is sitting on a wall.

Visual question: What does this say? What does this say?

Answers:

  1. unanswerable
  2. to earn 2500 points after qualifying
  3. unsuitable
  4. too close to text but about travel points
  5. travel agencies
  6. unsuitable
  7. unsuitable
  8. unanswerable
  9. earn 2500 points after qualifying
  10. how to earn 2500 points

Reasons why answers differ:

Image captions:

  1. A letter with black text suggests information for traveling.
  2. A part of your travel agencies brochure, and hotels.
  3. A piece of paper with instructions on how to qualify for points for something.
  4. piece of paper with advertising copy telling you how to earn points
  5. This looks like a document someone received by staying at a hotel and receiving points because of that.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 11: VizWiz_train_00021605.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A blue box of pepcid AC 30-count tablets.
  2. a blue paper pack of Pepsi AC Tablet
  3. Blue box with acid reducer tablets is held in hands.
  4. Pepcid AC box containing thirty tablets for heartburn and indigestion.
  5. Someone is holding a blue box of Pepcid AC.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 12: VizWiz_train_00015502.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A can of soda sitting on a table.

Visual question: What kind of soda is this?

Answers:

  1. fdsfdsf
  2. mountain dew
  3. mountain dew
  4. mountain dew
  5. mtn dew
  6. mountain dew
  7. mountain dew
  8. mountain dew
  9. mountain dew
  10. mountain dew

Reasons why answers differ:

Image captions:

  1. A can of Mountain Dew soda rests on a table with the staircase of a home in the background.
  2. a drink which is in tin container in green color
  3. a green can with green and red text , it's a can of mountain dew
  4. A green color juice can is found in the table
  5. Mountain Dew is in the can on the table.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 13: VizWiz_val_00001504.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A man is holding a piece of scissors.

Visual question: Can you tell me what is written on the coin holder? Thank you.

Answers:

  1. no
  2. no
  3. no
  4. unsuitable
  5. unsuitable
  6. no
  7. unanswerable
  8. unsuitable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A blurry picture of a hand holding a coin that is in a box, the hand and coin are halfway off the screen and the background is a gray floor with packaged toys on it.
  2. A person is holding in their hand a packaged collectible coin.
  3. Hand holding a commemorative coin in paper packaging
  4. someone holding a single coin in a protective white box
  5. someone is holding an old coin of some sort in their hand

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 14: VizWiz_train_00008918.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a laptop with a sign.

Visual question: What is this?

Answers:

  1. chocolate candies
  2. candy
  3. chocolate
  4. unsuitable
  5. thorntons chocolates
  6. milk dark white chocolate covered candy
  7. classic collection chocolates
  8. thorntons classic collection
  9. assorted chocolates
  10. candy

Reasons why answers differ:

Image captions:

  1. A box of chocolates with a pink and white label.
  2. a pack of Thorntons labeled as classic collection
  3. A white box of chocolate candies with pictures of candies on it.
  4. Quality issues are too severe to recognize visual content.
  5. White box of chocolates classic collection with filled centers.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 15: VizWiz_train_00007869.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a person holding a bag.

Visual question: What kind of tablets are these? Are they pain killers? Thank you.

Answers:

  1. yes pain killer
  2. ibuprofen yes
  3. ibuprofen yes
  4. ibuprofen 200 mg
  5. ibuprofen caplets
  6. ibuprofen yes
  7. ibuprofen 200mg caplets nsaid pain relievers
  8. ibuprofen 200mg
  9. ibuprofen
  10. ibuprofen yes

Reasons why answers differ:

Image captions:

  1. A finger holding a punch pouch of Ibuprofen with a couple missing.
  2. a white sachet of ibuprofen 200 mg caplets
  3. Blister pack of ibuprofen 200mg tablets, two already open.
  4. Foil pack of Ibuprofen 200mg caplets with three caps missing.
  5. press through pill holders containing 200 mg ibuprofen caplets

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 16: VizWiz_train_00005809.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a green field.

Visual question: What color is this top?

Answers:

  1. green
  2. green
  3. green
  4. green
  5. light green
  6. beige
  7. gold
  8. green
  9. grey
  10. green

Reasons why answers differ:

Image captions:

  1. A close up of a green knit fabric material stretched tight.
  2. A large piece of greenish blue thinly striped fabric.
  3. A light green blanket or pillow with no patterns.
  4. A shirt or fabric that's green and very zoomed in
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 17: VizWiz_train_00019314.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cell phone sitting next to a book.

Visual question: What is in this can?

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. heinz
  5. soup
  6. bottle
  7. unanswerable
  8. soup
  9. unanswerable
  10. beans

Reasons why answers differ:

Image captions:

  1. A can of food is on top of a table.
  2. A can with a red label that reads 105 calories and low fat.
  3. An image of a can is being depicted on a wooden counter top.
  4. Quality issues are too severe to recognize visual content.
  5. the back of a can of food showing nutritional information

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 18: VizWiz_train_00022090.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A close-up of the Apple logo is shown.
  2. A closeup of the Apple logo found on the top of laptops
  3. an apple logo that is white and grey
  4. In this photo is the apple symbol on a laptop
  5. White apple logo on a silver surface that has a sticker on it as well.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 19: VizWiz_val_00004537.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A dark grey tray sitting on a cooking range with one French fry lying on it and next to a brown package.
  2. A lone French fry sits on a baking pan on top of a stove.
  3. a tray with a snail on it besides it a pan and a brochure.
  4. Empty pans on top of a stove next to instructions for a breakfast mix.
  5. In this image is a cooking pan and a frying pan on a stove.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 20: VizWiz_train_00015326.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a pile of books.

Visual question: How much popcorn is in this box?

Answers:

  1. dunno
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. unsuitable
  6. unsuitable
  7. unanswerable
  8. unanswerable
  9. microwave popcorn
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A bag of food with lettering that is not in the English language takes up the entire frame.
  2. A bag of Pop Secret popcorn sitting on a comforter.
  3. Blue, yellow and white paper bag of pop secret microwave popcorn.
  4. Something is packed in a paper which has a yellow, blue and black writings.
  5. The white object on the brown comforter has a lot of writing on it.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 21: VizWiz_train_00013706.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a computer screen.

Visual question: What comment do you see in this picture?

Answers:

  1. unanswerable
  2. no comments
  3. unanswerable
  4. unanswerable
  5. unsuitable
  6. unsuitable
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. blue

Reasons why answers differ:

Image captions:

  1. A computer monitor screen displaying a blue screen saver.
  2. A computer screen is shown with a blue screen
  3. An illuminated blue screen on a monitor of some sort.
  4. Controls for a laptop are displayed under the screen.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 22: VizWiz_val_00001501.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a computer keyboard.

Visual question: Can you see what it says on the screen?

Answers:

  1. unsuitable
  2. unsuitable
  3. no
  4. phone receiving
  5. yes
  6. no
  7. unsuitable
  8. hang up
  9. yes
  10. phone receiver off hook

Reasons why answers differ:

Image captions:

  1. A computer screen is lit up in a very dark room.
  2. a phone showing some information about sending and receiving of messages
  3. A picture of a phone informations screen in black
  4. Quality issues are too severe to recognize visual content.
  5. UP CLOSE SNAPSHOT OF A COMPUTER SCREEN.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 23: VizWiz_train_00009525.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A container of food with a knife on it.

Visual question: Can you tell me what is in this bag?

Answers:

  1. mirepoix style lend
  2. unsuitable
  3. mixed vegetables
  4. mirepoix style blend vegetables
  5. mirepoix blend
  6. mirepoix blend
  7. mirepoix style blend
  8. mirepoix style blend onions carrots celery
  9. mirepoix style blend onions carrots celery
  10. pepper blend

Reasons why answers differ:

Image captions:

  1. A bag a frozen vegetables labeled as mirepoix style blend on an dark orange background with an image of the vegetables in a bowl above the text.
  2. A bag of frozen mixed veggies from the Kroger grocery store
  3. Bag of chopped and frozen vegetable blend with onions, carrots and celery.
  4. frozen vegetables labeled mirepoix style blend onions carrots and celery
  5. Kroger mirepoix style blend bag of vegetables containing onions, carrots and celery.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 24: VizWiz_train_00019785.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A man is sitting in front of a television.

Visual question: Is my TV on? I have one more question. What is the history of your company? Can you give me a brief history? I'm curious.

Answers:

  1. yes no history
  2. unanswerable
  3. yes
  4. yes no
  5. tv on
  6. yes tv on i do not know anything about company i am turk worker
  7. yes
  8. unanswerable
  9. yes
  10. yes

Reasons why answers differ:

Image captions:

  1. A black flat screen TV with two people on the screen who are standing and looking at the camera.
  2. A television sitting on a table is turned on.
  3. A television that is sitting on a desk is on and the desk has a baseball cap, a remote, and other items on it while there is a shelf with awards and soda cans along with other items above the television.
  4. Some kind of TV show with a man and woman host
  5. TV screen showing a news program, electric razor in front of the TV.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 25: VizWiz_val_00003786.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A man wearing a black shirt is standing in a room.

Visual question: What color are my pants?

Answers:

  1. black white stripes
  2. black
  3. black white stripes
  4. black
  5. black white
  6. black
  7. black white stripes down sides
  8. black
  9. black
  10. black

Reasons why answers differ:

Image captions:

  1. A close up of a person's lap wearing a black and white tracksuit.
  2. a person sitting down wearing black [ants with white stripes
  3. in this photo is a pair of legs wearing black pants and black shoes.
  4. legs on a couch wearing black sweats with white stripes on the side
  5. The photographer's black athletic pants with white stripes.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 26: VizWiz_train_00020506.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A black curtain that is see through on a window.
  2. a sage green knitted curtain hung on a window
  3. A window with a loosely woven curtain hanging in front of it
  4. Here is a closeup of a brownish colored curtain at a window near a brown wooden table with a lamp on it.
  5. The old brown curtain is above the table.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 27: VizWiz_train_00008909.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bottle of milk sitting on a wooden table.

Visual question: What is this medication?

Answers:

  1. aspirins
  2. unanswerable
  3. tylenol
  4. unanswerable
  5. unsuitable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A bottle of pills is on top of a table.
  2. a bottle of pills that is white and on a counter
  3. a small white plastic bottle of pills on a counter
  4. A small, white medicine bottle is sitting atop a wooden table.
  5. White bottle of medicine on wooden table surface .

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 28: VizWiz_train_00018273.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a glass of water.

Visual question: Is it a door?

Answers:

  1. no
  2. unsuitable
  3. unsuitable
  4. no
  5. unanswerable
  6. yes
  7. no
  8. no
  9. yes
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. Container surrounded with a wire grate exterior in a room with a bookcase with records.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 29: VizWiz_train_00006599.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A black and white photo of a dark bed.

Visual question: What was this cassette tape?

Answers:

  1. no idea
  2. unsuitable
  3. unsuitable
  4. unanswerable
  5. no tape
  6. no cassette
  7. unanswerable
  8. bob dylan
  9. no
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A close up of human skin of a caucasian person.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 30: VizWiz_val_00003468.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A living room with a desk and a lamp.

Visual question: What is this?

Answers:

  1. unanswerable
  2. office
  3. living room
  4. tables
  5. 2 desks in room
  6. unsuitable
  7. livingroom
  8. night stand
  9. bedroom
  10. side table chair tv tray

Reasons why answers differ:

Image captions:

  1. A living room space showing a chair, a small end table and laptop on a small table.
  2. A room or living room with two small coffee tables, jewelry boxes and a small lump on one table, and a black laptop or DVD player on the other table, a gumball machine in the background.
  3. An end table next to a TV tray with a gumball machine on it.
  4. Two wooden tables with a laptop, candy machine, a lampshade and some other stuff messily lying on them.
  5. Various items of furniture, a chair, tables, and a gumball machine.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 31: VizWiz_train_00012936.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a train on a window.

Visual question: Is this shake and bake for chicken or pork?

Answers:

  1. unanswerable
  2. no
  3. unanswerable
  4. unsuitable
  5. unanswerable
  6. unable to see box
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a red chair and a wood table beside the window
  2. A red wooden chair is at the edge of a table with circle design on a black background
  3. a table that has the trunk of a tree design on it with a chair next to it
  4. Part of a table with a ring design and a red chair.
  5. some type of red chair under near a table top

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 32: VizWiz_train_00007274.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a toilet with a hat on it.

Visual question: What do I see

Answers:

  1. toilet paper
  2. toilet tissue
  3. toilet paper
  4. toliet paper
  5. toilet paper
  6. toilet paper
  7. toilet paper
  8. toilet paper
  9. toilet paper
  10. toilet paper

Reasons why answers differ:

Image captions:

  1. a picture of toilet paper stacked up in a corner
  2. A stack of toilet paper rolls in the corner of a bathroom.
  3. A tall narrow stack of five rolls of toilet paper is in a corner.
  4. A tall stack of toilet paper in the corner of a tiled room.
  5. Five rolls of white toilet paper stacked atop each other in the corner of a tiled bathroom.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 33: VizWiz_train_00007603.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a book on a table.

Visual question: What is in this box.

Answers:

  1. sesame chicken
  2. unanswerable
  3. sesame chicken tenders pasta
  4. unsuitable
  5. unsuitable
  6. unanswerable
  7. chicken pasta
  8. sesame chicken pasta
  9. chicken
  10. sesame breaded chicken tender pasta

Reasons why answers differ:

Image captions:

  1. A blurry box of organic chicken dinner with pasta
  2. A frozen dinner containing Sesame Breaded Chicken Tenders with pasta.
  3. A white box of food stuff with green decals and black text
  4. Partial food label advertising a chicken and pasta meal with no preservatives and made with white meat chicken.
  5. The packaging for the product has the ingredients on the back

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 34: VizWiz_train_00004905.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A computer screen with a laptop on it.

Visual question: What's this?

Answers:

  1. laptop computer showing image smart phone
  2. omoby image reception
  3. laptop computer
  4. computer
  5. omoby phone
  6. laptop
  7. unanswerable
  8. computer
  9. computer
  10. lap top computer

Reasons why answers differ:

Image captions:

  1. A laptop screen is open with the keyboard visible and on the screen is a website for a design platform.
  2. a laptop screen with omoby asking you to download for phone
  3. A laptop with a screen displaying a download for an oMoby application.
  4. Laptop screen showing the initial screen for an IQ test.
  5. Right side of standard computer keyboard with monitor behind it

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 35: VizWiz_train_00022405.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A big black colored speaker with wires hanging on the edge of a white table
  2. A dresser with several items sitting on it, most cannot be identified but there looks to be a phone which is plugged into the charge cord, and some decorative items
  3. A sliver of a bed with a butterfly bedspread has a white nightstand with a few sundries to the right of the bed.
  4. A white dresser with a white and purple blanket next to it
  5. On a dresser near a television set is a bit of children's clutter.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 36: VizWiz_val_00001609.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person sitting on it.

Visual question: What is this package?

Answers:

  1. juice box
  2. juice
  3. juice
  4. unanswerable
  5. strawberry kiwi juice
  6. juice box
  7. fruitables juice
  8. unanswerable
  9. juice
  10. strawberry kiwi

Reasons why answers differ:

Image captions:

  1. a package of strawberry kiwi flavored juice boxes
  2. A piece of a box had a strawberry and a carrot on it.
  3. A strawberry juice drink box with a clear empty straw wrapper on top.
  4. some fresh vegetables of strawberry and carrot packed and placed.
  5. Strawberry fruit juice cartons are ready to quench some thirst.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 37: VizWiz_train_00007753.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a sign on a wall.

Visual question: What movie is this?

Answers:

  1. star wars
  2. star wars trilogy
  3. star wars trilogy
  4. star wars
  5. star wars
  6. star wars
  7. star wars
  8. star wars trilogy
  9. star wars
  10. star wars trilogy

Reasons why answers differ:

Image captions:

  1. A box for the Star Wars trilogy movies with an opened dresser in the background.
  2. An original Star Wars trilogy movie box set
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Star and wars are the only words that are visible.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 38: VizWiz_train_00020996.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. The top side of a screen monitor over a men's blue pants
  5. unclear image, white, grey, tan, with bright white light, very blurry.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 39: VizWiz_train_00009323.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a keyboard with a mouse.

Visual question: What is this?

Answers:

  1. mac keyboard
  2. keyboard
  3. keyboard
  4. keyboard
  5. computer keyboard
  6. keyboard
  7. keyboard
  8. keyboard
  9. keyboard
  10. keyboard

Reasons why answers differ:

Image captions:

  1. A metallic computer keyboard placed on someone's lap
  2. A QWERTY keyboard with white keys on a silver background is shown.
  3. A silver keyboard with white keys and black lettering.
  4. An Apple Magic Keyboard from an iMac computer
  5. Part of a keyboard is shown along with the bottom part of someone's leg.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 40: VizWiz_val_00001237.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cat on the ground.

Visual question: What kind of fish is this?

Answers:

  1. unanswerable
  2. angel fish
  3. unanswerable
  4. tang
  5. angelfish
  6. angel fish
  7. sdfsdv
  8. clown
  9. clay fish
  10. exotic fish

Reasons why answers differ:

Image captions:

  1. A fish tank with a black, yellow and white fish in an aquarium.
  2. a fish tank with a yellow, white, and black fish and an abstract painting on the wall in the behind the tank
  3. a striped black and white angel fish in its fish tank
  4. Close up of a black, yellow and white fish in a tank with sand and white rocks.
  5. White, black and yellow fish in an aquarium with coral and pebbles.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 41: VizWiz_train_00001253.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bed with a blanket.

Visual question: What is this object?

Answers:

  1. bed
  2. pillow
  3. pillow
  4. unsuitable
  5. pillow
  6. pillow
  7. pillow
  8. pillow
  9. pillow bed
  10. pillow

Reasons why answers differ:

Image captions:

  1. A pillow in a blue and yellow striped case with a yellow diamond and blue accents on the front.
  2. A yellow pillow and some other clothes on a bed
  3. Bed with a pillow case and sheet that is blue and yellow in color.
  4. Bed with triangles and stripe patterns with a polka dot purse sitting on it.
  5. Pillow thrown on a bed in front of a window.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 42: VizWiz_train_00011461.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A blurry picture of a person on the ground.

Visual question: How do you cook the pizza?

Answers:

  1. unanswerable
  2. unanswerable
  3. unsuitable
  4. unanswerable
  5. unsuitable
  6. unsuitable
  7. oven
  8. oven
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A close up of pepperoni and sausage pizza toppings.
  2. A multi colored image with various shapes and sizes of different colors.
  3. a pizza box with some pepperoni on it
  4. A slice of pepperoni cheese pizza with other toppings.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 43: VizWiz_train_00014948.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a plate of food on a table.

Visual question: What's in the bowl?

Answers:

  1. pie
  2. unanswerable
  3. pie
  4. pie
  5. cherry cobbler
  6. cherry pie ice cream
  7. dessert
  8. cherry pie
  9. cherry crisp dessert
  10. cherry pie

Reasons why answers differ:

Image captions:

  1. A slice of Dutch berry pie with ice cream is in a bowl.
  2. berry crumble on a floral plate on a sunflower tablecloth
  3. cherry cobbler dessert in a white bowl with green floral trim
  4. In the image there is a piece of cherry pie on a plate.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 44: VizWiz_train_00020135.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A photo of a box with writing on it that says special with a bar scan code on the front.
  2. Appears to be a picture of a box
  3. Price tag for a package of chicken tender strips
  4. Quality issues are too severe to recognize visual content.
  5. The barcode and price tag on a package with a special tag showing.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 45: VizWiz_val_00002629.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A living room with a television and a tv.

Visual question: What is the object in this image?

Answers:

  1. monitor
  2. tv
  3. several objects
  4. rug tv
  5. unanswerable
  6. tv
  7. rug
  8. carpet
  9. rug
  10. carpet

Reasons why answers differ:

Image captions:

  1. A room with a red and color rug with a TV on a table and boxes in the back.
  2. a room with a television and a pile of trash
  3. A room with a TV sitting on a table and some boxes and other assorted material in the corner
  4. Quality issues are too severe to recognize visual content.
  5. Room with white walls, a table containing a TV, a red rug and various boxes and property.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 46: VizWiz_train_00002476.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a sign on a wall.

Visual question: Do you know this logo?

Answers:

  1. no
  2. no
  3. no
  4. no
  5. no
  6. yes asterisk
  7. no
  8. aim
  9. vector graphic
  10. no

Reasons why answers differ:

Image captions:

  1. A golden logo that is circular with eight segments and other mascots faint in the background.
  2. A yellow logo in the foreground with various other logos in the background in black and white.
  3. an icon of an 8 petaled yellow flower with a thick black lining
  4. I can see a yellow icon in the middle of the image and a silhouette of penguin on the right.
  5. Icons and mascots of different brand or companies scattered everywhere in one image with one centered and fully colored.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 47: VizWiz_train_00008030.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A dark shot of a black sky at night.

Visual question: Can you tell me what is in this bag or what type of bag it is?

Answers:

  1. unanswerable
  2. unsuitable
  3. unsuitable
  4. no
  5. unanswerable
  6. unsuitable
  7. no
  8. no
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 48: VizWiz_train_00016662.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cup of coffee sitting on a table.

Visual question: What is this?

Answers:

  1. can green beans
  2. green beans
  3. unanswerable
  4. green beans
  5. green beans
  6. green beans
  7. green beans
  8. green beans
  9. 1 can green beans
  10. green beans

Reasons why answers differ:

Image captions:

  1. A can of green beans is right by the toaster.
  2. A can of green beans is sitting on a counter in front of an oven.
  3. A can of green beans sitting on a countertop in front of a microwave.
  4. A tin can of green beans is in front of the microwave toaster.
  5. An unopened can of green beans in front of a microwave.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 49: VizWiz_train_00014776.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a clock on a wall.

Visual question: What flavor is this?

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. coffee
  5. unanswerable
  6. unanswerable
  7. doesnt say unless world coffee
  8. cafe gourmet
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A box with world timothy's coffee written on it
  2. A brown box of "Timothy's World Coffee" resting on a table.
  3. A brown, square box of Timothy's world Coffee.
  4. An unopened box of k cup coffee sitting on a granite counter top.
  5. the back label of a black box of coffee sitting on a gray countertop.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 50: VizWiz_train_00017406.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A plant is sitting on a wooden table.

Visual question: What is this?

Answers:

  1. plant
  2. plant
  3. plant
  4. plant
  5. plant
  6. plant
  7. plant
  8. plant
  9. potted plant
  10. plant

Reasons why answers differ:

Image captions:

  1. A potted plant is sitting near a window on a table.
  2. A tall green plant has big leaves on it
  3. A tall green plant in a black planter on a side table.
  4. In this picture is a live plant in flower pot
  5. Plant with green long stem and big leaf placed on black bucket

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Showing images 0 - 0 out of 0 matching images.