您的位置: 首页 > 农业专利 > 详情页

METHOD AND SYSTEM OF SELECTING WORD SEQUENCE FOR TEXT WRITTEN IN LANGUAGE WITHOUT WORD BOUNDARY MARKERS
专利权人:
Alibaba Group Holding Limited
发明人:
DAI, Neng
申请号:
EP20090836668
公开号:
EP2370909(A4)
申请日:
2009.12.04
申请国别(地区):
欧洲专利局
年份:
2018
代理人:
摘要:
The present disclosure discloses a method and apparatus of selecting a word sequence for a text written in a language without word boundary in order to solve the problem of having excessively large computation load when selecting an optimal word sequence in existing technologies. The disclosed method includes: segmenting a segment of the text to obtain different word sequences; determining a common word boundary for the word sequences; and performing optimal word sequence selection for portions of the word sequences prior to the common word boundary. Because optimal word sequence selection is performed for portions of word sequences prior to a common word boundary, shorter independent units can be obtained, thus reducing computation load of word segmentation.
来源网站:
中国工程科技知识中心
来源网址:
http://www.ckcest.cn/home/

意 见 箱

匿名:登录

个人用户登录

找回密码

第三方账号登录

忘记密码

个人用户注册

必须为有效邮箱
6~16位数字与字母组合
6~16位数字与字母组合
请输入正确的手机号码

信息补充