Home >> Media Group >> Research >> ViPER
Downloads -- Documentation -- Developers -- Discussion
LAMP     The Language and Media Processing Laboratory
  • Format used in ViPER suite
  • Originally defined for evaluation purposes
  • First uses include:
    • Text detection
    • Face detection
    • Person detection
A Descriptor
  • It is a record describing some element of the video.
  • It is an object that conforms to a user defined schema.
  • It is composed of several named, typed attributes.
  • Has a unique id and an associated span in which it is valid.
  • One of three types: File, Content, or Object
    Refers to data that reflects the video as a whole, or other metadata about the video, such as file format and frame rate.
    Instances of this type may only occur one at a time, and any given instance may not change over the course of its life. Each instance has a time span and a set of attributes.
    Refers to an object that may have many instances at any given time, and whose instances may change over time.
An Attribute
  • Each descriptor has several attributes.
  • An attribute can be one of several data types:
    strings of characters
    enumerated value: one of several user defined words
    bbox, polygon, etc.
    one of several different shapes
    reference to another descriptor
The File Format
  • Simple XML based format.
  • The config section defines the descriptors
  • The data section instantiates descriptors for one or more media files

example one

example two

example three

xml schemata

viper structure

data types

  1. viper
    1. config
      • descriptor definition
      • descriptor definition
    2. data
      • sourcefile
        • descriptor
      • sourcefile
        • descriptor