http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.
변환된 중국어를 복사하여 사용하시면 됩니다.
SSF: Sentence Similar Function Based on word2vector Similar Elements
Yuan, Xinpan,Wang, Songlin,Wan, Lanjun,Zhang, Chengyuan Korea Information Processing Society 2019 Journal of information processing systems Vol.15 No.6
In this paper, to improve the accuracy of long sentence similarity calculation, we proposed a sentence similarity calculation method based on a system similarity function. The algorithm uses word2vector as the system elements to calculate the sentence similarity. The higher accuracy of our algorithm is derived from two characteristics: one is the negative effect of penalty item, and the other is that sentence similar function (SSF) based on word2vector similar elements doesn't satisfy the exchange rule. In later studies, we found the time complexity of our algorithm depends on the process of calculating similar elements, so we build an index of potentially similar elements when training the word vector process. Finally, the experimental results show that our algorithm has higher accuracy than the word mover's distance (WMD), and has the least query time of three calculation methods of SSF.
SSF: Sentence Similar Function Based on word2vector Similar Elements
Xinpan Yuan,Songlin Wang,Lanjun Wan,Chengyuan Zhang 한국정보처리학회 2019 Journal of information processing systems Vol.15 No.6
In this paper, to improve the accuracy of long sentence similarity calculation, we proposed a sentence similaritycalculation method based on a system similarity function. The algorithm uses word2vector as the systemelements to calculate the sentence similarity. The higher accuracy of our algorithm is derived from twocharacteristics: one is the negative effect of penalty item, and the other is that sentence similar function (SSF)based on word2vector similar elements doesn’t satisfy the exchange rule. In later studies, we found the timecomplexity of our algorithm depends on the process of calculating similar elements, so we build an index ofpotentially similar elements when training the word vector process. Finally, the experimental results show thatour algorithm has higher accuracy than the word mover’s distance (WMD), and has the least query time ofthree calculation methods of SSF.
Research on Fault Diagnosis of Wind Power Generator Blade Based on SC-SMOTE and kNN
Cheng Peng,Qing Chen,Longxin Zhang,Lanjun Wan,Xinpan Yuan 한국정보처리학회 2020 Journal of information processing systems Vol.16 No.4
Because SCADA monitoring data of wind turbines are large and fast changing, the unbalanced proportion of data in various working conditions makes it difficult to process fault feature data. The existing methods mainly introduce new and nonrepeating instances by interpolating adjacent minority samples. In order to overcomethe shortcomings of these methods which does not consider boundary conditions in balancing data, an improved oversampling balancing algorithm SCSMOTE (safe circle synthetic minority oversampling technology) is proposed to optimize data sets. Then, for the balanced data sets, a fault diagnosis method based on improvedknearest neighbors (kNN) classification for wind turbine blade icing is adopted. Compared with the SMOTE algorithm, the experimental results show that the method is effective in the diagnosis of fan blade icing fault and improves the accuracy of diagnosis.