Multimedia data is different from text data, which can take advantages of literal information such as index, abstraction, keywords. There is no transition level between the whole video file, video's biggest granular, and the individual frame, video's smallest granular. The traditional way of video browsing is play sequentially according to timestamp of each frame. By this method, it occupies too much of user's time before he finds his interest part. Its lack of intelligent skipping make it a bad experience ...