Scalable Mobile Video Question-Answering System
with Locally Aggregated Descriptors and Random Projection
http://tinyurl.com/7df3e5n
建議事項:
(1)可以試著用youtube的相關影片來提高正確性
(2)control dataset避免找不到圖片
(3)evaluation可以用MAP
2011年12月11日 星期日
2011年12月8日 星期四
Final Project - Proposal
- Team Member
R99922107 李勻
R00944020 曾奕翔
R00921049 周怡廷
– Problem definition
We want to automatically add tags or descriptions to videos.
Summary for video is also possible
– Possible approaches
1. Using shot detection to extract some key frames.
2. Using google search to find the most familiar picture of the key frames.
3. Extract the text description from the results by using TF IDF or other feature.
4. A problem we need to solve is If we can't find any pictures familiar enough to all the query,
this approach may fail.
features:
1. Color histogram 2. Texture
– Possible evaluation methods
We can collect several different type videos, and for each type we examine whether the
labels our program generated are related or not. Since it is hard to know how many tags are
related to the video, we may focus on precision of our result.
R99922107 李勻
R00944020 曾奕翔
R00921049 周怡廷
– Problem definition
We want to automatically add tags or descriptions to videos.
Summary for video is also possible
– Possible approaches
1. Using shot detection to extract some key frames.
2. Using google search to find the most familiar picture of the key frames.
3. Extract the text description from the results by using TF IDF or other feature.
4. A problem we need to solve is If we can't find any pictures familiar enough to all the query,
this approach may fail.
features:
1. Color histogram 2. Texture
– Possible evaluation methods
We can collect several different type videos, and for each type we examine whether the
labels our program generated are related or not. Since it is hard to know how many tags are
related to the video, we may focus on precision of our result.
訂閱:
文章 (Atom)