What to label |
All objects of the defined
categories,
unless:
If
this is not possible because too many objects, mark image as bad. |
Viewpoint |
Record
the viewpoint of the ‘bulk’ of the object e.g. the body rather than the head. Allow viewpoints within 10-20 degrees. If
ambiguous, leave as ‘Unspecified’. |
Bounding box |
Mark
the bounding box of the visible area of the object (not the estimated total extent of the object). Bounding
box should contain all visible pixels, except where the bounding box would
have to be made excessively large to include a few additional pixels (<5%)
e.g. a car aerial. |
Truncation |
If
more than 15-20% of the object lies outside the bounding box mark as
Truncated. The flag indicates that the bounding box does not cover the total
extent of the object. |
Occlusion |
If
more than 5% of the object is occluded within the bounding box, mark as
Occluded. The flag indicates that the object is not totally visible within
the bounding box. |
Image quality/
illumination |
Images
which are poor quality (e.g. excessive motion blur) should be marked
bad. However, poor illumination (e.g.
objects in silhouette) should not count as poor quality unless objects cannot
be recognised. Images
made up of multiple images (e.g. collages) should be marked bad. |
Clothing/mud/ snow etc. |
If
an object is ‘occluded’ by a close-fitting occluder e.g. clothing, mud, snow
etc., then the occluder should be treated as part of the object. |
Transparency |
Do
label objects visible through glass, but treat reflections on the glass as
occlusion. |
Mirrors |
Do
label objects in mirrors. |
Pictures |
Label
objects in pictures/posters/signs only if they are photorealistic but not if
cartoons, symbols etc. |
Aeroplane |
Includes
gliders but not hang gliders or helicopters |
Bicycle |
Includes
tricycles, unicycles |
Bird |
All
birds |
Boat |
Ships,
rowing boats, pedaloes but not jet skis |
Bottle |
Plastic,
glass or feeding bottles |
Bus |
Includes
minibus |
Car |
Includes
cars, vans, large family cars for 6-8 people etc. Excludes
go-carts, tractors, emergency vehicles, lorrys/trucks etc. Do
not label where only the vehicle interior is shown. Include
toys that look just like real cars, but not ‘cartoony’ toys. |
Cat |
Domestic
cats (not lions etc.) |
Chair |
Includes
armchairs, but not stools or benches |
Cow |
All
cows |
Dining table |
Only
tables for eating at. |
Dog |
Domestic
dogs (not wolves etc.) |
Horse |
Includes ponies, donkeys, mules etc. |
Motorbike |
Includes
mopeds, scooters, sidecars |
People |
Includes
babies, faces (i.e. truncated people) |
Potted plant |
Indoor
plants or outdoor plants clearly in a pot.
Excludes flowers in vases |
Sheep |
Sheep,
not goats |
Sofa |
Excludes
sofas made up as sofa-beds |
Train |
Includes
train carriages, excludes trams |
TV/monitor |
Standalone
screens (not laptops), not advertising displays |
What to segment |
Objects whose bounding boxes have been labelled according to the above guidelines. You may need to exclude backpacks, handbags etc. which
were included in the bounding box. |
Accuracy |
Segment
within 5 pixels. Labelled pixels MUST
be the object; This
may occasionally involve labelling pixels outside the bounding box. |
Mixed pixels |
Pixels
which are mixed e.g. due to transparency, motion blur or the presence of a
border should be considered to belong to the object whose colour contributes
most to the mix. |
Thin structures |
Aim
to capture thin structures where possible, within the accuracy constraints. Structures of around one pixel thickness
can be ignored e.g. wires, rigging. |
Objects on tables etc. |
If
a number of small objects are occluding an object e.g. cutlery/silverware on
a dining table, they can be considered part of that object. |
Difficult images |
Images
which are overly difficult to segment to the required accuracy can be left
unlabelled e.g. a nest of bicycles. |