且构网

分享程序员开发的那些事...
且构网 - 分享程序员编程开发的那些事

在 keras 模型中使用预训练的词嵌入?

更新时间:2023-12-02 19:54:16

我认为你的怀疑是有道理的.更改是在 nocoder>ofellow> 上的noTokenizer 使其保留 num_words 个单词而不是 num_words - 1 个单词.但是 Tokenizer 的这个变化是

I think your doubt is valid. The change was made in this commit of the code to keep the word with index = MAX_NUM_WORDS. Before that there was a commit on Tokenizer to make it keep num_words words instead of num_words - 1 words. But this change of Tokenizer was reverted afterwards. So I guess the author of the example update might have assumed that Tokenizer kept num_words words when the update was committed.