节点文献

一种多模态跨媒体检索的融媒体影视系统

A film and television media convergence system based on multimodal cross-media retrieval

  • 推荐 CAJ下载
  • PDF下载
  • 不支持迅雷等下载工具,请取消加速工具后下载。

【作者】 李春芳刘永久王楷翔杨睿张凌飞李敏邓智铭石民勇

【Author】 LI Chunfang;LIU Yongjiu;WANG Kaixiang;YANG Rui;ZHANG Lingfei;LI Min;DENG Zhiming;SHI Minyong;School of Computer and Cyber Sciences, Communication University of China;

【机构】 中国传媒大学计算机与网络空间安全学院

【摘要】 视频是最有影响力的传播媒介,然而其非线性检索仍然困难。本文创新性工作包括:基于图像识别提取字幕,基于卷积神经网络识别人脸,通过字幕和人脸解决了影视视频的非线性检索问题;从字幕文本提取重要实体,用海量知识库和电子书补全影视关联知识,构建了文本、电子书和视频融合的跨媒体应用;以字幕词云和人物实体词云,实现影视的概览理解和检索导航;以众包实现字幕、电子书、人脸和实体信息的修正。以近代史献礼电影、中国诗词大会和科技纪录片为例系统完整地实现了一个示范性融媒体影视系统。

【Abstract】 Video is the most influential media, but it.s difficult to nonlinearly search video content. The creative work of this paper includes: Based on image processing to recognize video subtitle and convolutional neural networks to recognize faces of characters, the problem of film and TV video nonlinear retrieval is solved. Further, we extract important entities from subtitle text and enhance their relevant knowledge with large scale knowledge base and e-books, which constructs a cross-media application system of video, text, and e-book. Word cloud of subtitles and character entities are designed to facilitate video overview understanding and navigating retrieval. Crowdsourcing technology is used to update the amendments of subtitles, e-books, face recognition and entities information. A typical crossmedia convergence system are completely implemented including movies in modern history, conference of the Chinese poetry, and information technology documentary video.

【基金】 国家社科基金艺术学项目资助(18BC034);中央高校基本科研业务费资助(CUC210A008)
  • 【文献出处】 中国传媒大学学报(自然科学版) ,Journal of Communication University of China(Science and Technology) , 编辑部邮箱 ,2021年04期
  • 【分类号】J90-05;TP391.41
  • 【被引频次】1
  • 【下载频次】248
节点文献中: 

本文链接的文献网络图示:

本文的引文网络