From downloaded WebPages, we extracted content information, such as title, key words, category, time and text, by means of IE.

 
  • 利用信息抽取的方法,从下载的网页中抽取得到语料库建库所需的内容信息,如标题、关键词、类别、时间、正文等。
今日热词
目录 附录 查词历史