Keras[文本多分类]训练和测试的准确性好，但预测能力差

stop_words = set(stopwords.words('english')) for index, row in data.iterrows(): print ("Index: ", index) txt_clean = ' '.join(re.sub("([^a-zA-Z ])", " ", data.loc[index,'txt_clean']).split()).lower() word_tokens = word_tokenize(txt_clean) filtered_sentence = [w for w in word_tokens if not w in stop_words] cleaned_text = '' for w in filtered_sentence: cleaned_text = cleaned_text + ' ' + w data.loc[index,'txt_clean'] = cleaned_text

model = Sequential() model.add(Embedding(50000, 100, input_length=500)) model.add(SpatialDropout1D(0.2)) model.add(LSTM(150, dropout=0.2, recurrent_dropout=0.2)) model.add(Dense(6, activation='softmax')) model.summary() model.compile(loss='categorical_crossentropy', optimizer='rmsprop', metrics=['accuracy']) history = model.fit(X_train, Y_train, epochs=epochs, batch_size=batch_size, validation_split=0.1) accr = model.evaluate(X_test,Y_test) print('Test set\n Loss: {:0.3f}\n Accuracy: {:0.3f}'.format(accr[0],accr[1]))

model = load_model('model.h5') data = data.sample(n=15000) model.compile(optimizer = 'rmsprop', loss = 'categorical_crossentropy', metrics = ['accuracy']) tokenizer = Tokenizer(num_words=50000) tokenizer.fit_on_texts(data['txt_clean'].values) (Prediction data sample values and not the same as in the training)) CATEGORIES = ['A','B','C','D','E','F'] for index, row in data.iterrows(): seq = tokenizer.texts_to_sequences([data.loc[index,'txt_clean']]) padded = pad_sequences(seq, maxlen=500) pred = model.predict(padded) pred = pred[0] print (pred, pred[np.argmax(pred)]))

2条回答

网友

1楼 · 编辑于 2024-04-24 12:44:25

请尝试使用pickle或joblib对您的标记器进行酸洗，以保存您的keras标记器使用它进行训练和预测。你知道吗

下面是保存keras标记器的示例代码：

import pickle

# saving
with open('tokenizer.pickle', 'wb') as handle:
    pickle.dump(tokenizer, handle, protocol=pickle.HIGHEST_PROTOCOL)

# loading
with open('tokenizer.pickle', 'rb') as handle:
    tokenizer = pickle.load(handle)

网友

2楼 · 编辑于 2024-04-24 12:44:25

这是ML或DL中存在的一个经典问题。可能有几个原因

过度拟合，尝试使模型更深入或添加一些规范化。你知道吗
训练和测试数据集是不同的
使用与培训期间相同的预处理步骤
类不平衡，即训练数据集包含测试数据集所缺少的特定类的更多数据。你知道吗
尝试使用双向LSTM或GRU改变模型架构

相关问题更多 >

编程相关推荐

热门问题

热门文章