且构网

分享程序员开发的那些事...
且构网 - 分享程序员编程开发的那些事

如何在 seq2seq 中使用 output_projection(权重、偏差)?

更新时间:2023-12-02 12:11:34

我不清楚语法,他们目前是否创建"了权重和偏差的变量,或者这些实际上是他们得到的变量?

The syntax is unclear to me, are they currently "creating" the variables for the weights and biases or are these actually variables that they are getting?

首次初始化图形时,会创建这些变量,并在连续批处理中学习和重用这些参数.

When the graph is first initialized, those variables are created and for the consecutive batch processing those parameters are learned and reused.

使用output_projecttion

它不完全是 logits(它是预计的输出);输出投影变量用于投影前一个单元格的输出,并将投影输出与当前单元格输入组合.

It is not exactly logits (it's projected output); the output projection variables are used to project the output of the previous cell and the projected output is combined with the current cell inputs.

projected_output = tf.matmul(outputs, w) + b