Embedding vs inserting word vectors directly to input layer

Question

I used gensim to build a word2vec embedding of my corpus. Currently I'm converting my (padded) input sentences to the word vectors using the gensim model. This vectors are used as input for the model.

model = Sequential()
model.add(Masking(mask_value=0.0, input_shape=(MAX_SEQUENCE_LENGTH, dim)))
model.add(Bidirectional(
    LSTM(num_lstm, dropout=0.5, recurrent_dropout=0.4, return_sequences=True))
)
...
model.fit(training_sentences_vectors, training_labels, validation_data=validation_data)

Are there any drawbacks using the word vectors directly without a keras embedding layer?

I'm also currently adding additional (one-hot encoded) tags to the input tokens by concatenating them to each word vector, does this approach make sense?

Nate Raw · Accepted Answer

In your current setup, the drawback will be that you will not be able to set your word vectors to be trainable. You will not be able to fine tune your model for your task.

What I mean by this is that Gensim has only learned the "Language Model". It understands your corpus and its contents. However, it does not know how to optimize for whatever downstream task you are using keras for. Your model's weights will help to fine tune your model, however you will likely experience an increase in performance if you extract the embeddings from gensim, use them to initialize a keras embedding layer, and then pass in indexes instead of word vectors for your input layer.

Embedding vs inserting word vectors directly to input layer

Tags:

deep-learning

nlp

keras

gensim

word2vec

kelvan

1 Answers

Nate Raw

Recent Activity

Donate For Us

Embedding vs inserting word vectors directly to input layer

Tags:

deep-learning

nlp

keras

gensim

word2vec

kelvan

1 Answers

Nate Raw

Related questions

Recent Activity

Donate For Us