python中lcut什么意思_Python posseg.lcut方法代碼示例

本文整理匯總了Python中jieba.posseg.lcut方法的典型用法代碼示例。如果您正苦於以下問題:Python posseg.lcut方法的具體用法?Python posseg.lcut怎麽用?Python posseg.lcut使用的例子?那麽恭喜您, 這裏精選的方法代碼示例或許可以為您提供幫助。您也可以進一步了解該方法所在模塊jieba.posseg的用法示例。

在下文中一共展示了posseg.lcut方法的9個代碼示例,這些例子默認根據受歡迎程度排序。您可以為喜歡或者感覺有用的代碼點讚,您的評價將有助於我們的係統推薦出更棒的Python代碼示例。

示例1: extract_dictionary_feature

​點讚 3

# 需要導入模塊: from jieba import posseg [as 別名]

# 或者: from jieba.posseg import lcut [as 別名]

def extract_dictionary_feature(file_name, col_tag=0, col_content=1):

# ????

adv = codecs.open('./data/vocabulary/adv.txt', 'rb', encoding='utf-8').read().split('\n')

inverse = codecs.open('./data/vocabulary/inverse.txt', 'rb', encoding='utf-8').read().split('\n')

negdict = codecs.open('./data/vocabulary/negdict.txt', 'rb', encoding='utf-8').read().split('\n')

posdict = codecs.open('./data/vocabulary/posdict.txt', 'rb', encoding='utf-8').read().split('\n')

contents = pd.read_excel(file_name, header=None)

print 'cut words...'

cw = lambda x: [pair for pair in psg.lcut(x) if pair.word not in stopwords]

contents['pairs'] = contents[col_content].apply(cw)

matrix = reviews2matrix(list(contents['pairs']), posdict, negdict, inverse, adv)

x = matrix2vec(matrix)

y = list(contents[col_tag])

return x, y

開發者ID:wac81,項目名稱:Book_DeepLearning_Practice,代碼行數:18,

示例2: delNOTNeedWords

​點讚 3

# 需要導入模塊: from jieba import posseg [as 別名]

# 或者: from jieba.posseg import lcut [as 別名]

def delNOTNeedWords(content,customstopwords=None):

# words = jieba.lcut(content)

if customstopwords == None:

customstopwords = "stopwords.txt"

import os

if os.path.exists(customstopwords):

stop_words = codecs.open(customstopwords, encoding='UTF-8').read().split(u'\n')

customstopwords = stop_words

result=''

return_words = []


本文来自互联网用户投稿,文章观点仅代表作者本人,不代表本站立场,不承担相关法律责任。如若转载,请注明出处。 如若内容造成侵权/违法违规/事实不符,请点击【内容举报】进行投诉反馈!

相关文章

立即
投稿

微信公众账号

微信扫一扫加关注

返回
顶部