Invalidity search model for chinese patent based on the claim structure information
LIU Yuqin,WANG Xuefeng,LV Lin
(School of Management & Economics, Beijing Institute of Technology, Beijing 100081, China)
Abstract:Chinese patent independent claim contains a preamble portion and a characterizing portion. Invalidity search model for Chinese patent proposed in the paper draws on the structure information. Forty split words are extracted from patent database artificially; these words can divide independent claims into preamble portion and characterizing portion effectively and automatically. For it is impossible to compute similarity on the whole database twostep search method is used in practice: at 1step Boolean query is applied to improve recall, at 2step vector space model is used to compute similarities of preamble portion and characterizing portion between applying patent (query) and previous patents (documents) obtained at 1step respectively, and then combines them properly to sort the search results in order to improve precision. Experiment data set comes from SIPO; search results with split claims are contrasted with that without them; different methods of termweighting are compared. Evaluation results show that the model works well.......