您的位置: 首页 > 农业专利 > 详情页

名称を体系的に大量に正規化する方法
专利权人:
ワークデイ,インコーポレーテッド
发明人:
ギバーツ・ブラディミル,シーガル・バーゼル・クリフ
申请号:
JP20160531747
公开号:
JP6118468(B2)
申请日:
2014.07.21
申请国别(地区):
日本
年份:
2017
代理人:
摘要:
A method for normalizing raw titles to canonical titles is described. The method includes designating a set of canonical titles, generating a set of n-grams for each canonical title, assigning a set of attributes to each n-gram, assigning a set of labels to each of the attributes, and storing the labeled canonical title and labeled n-grams in a database. In some examples, a new title may be mapped to an existing canonical title in the database by generating a set of n-grams for the new title, looking up the n-grams in the database of canonical titles, retrieving the set of labels assigned to n-grams in the database that match n-grams from the new title, and assigning those labels to the corresponding attributes of the new title. The new title may then be mapped to a canonical title on the basis of similarly labeled attributes.
来源网站:
中国工程科技知识中心
来源网址:
http://www.ckcest.cn/home/

意 见 箱

匿名:登录

个人用户登录

找回密码

第三方账号登录

忘记密码

个人用户注册

必须为有效邮箱
6~16位数字与字母组合
6~16位数字与字母组合
请输入正确的手机号码

信息补充