SemanticScuttle - klotz.me » Tags: nlp+word2vec+text

A Word is Worth a Thousand Vectors | Stitch Fix Technology – Multithreaded This bookmark is certified by an admin user.

2015-08-26 Tags: nlp, word2vec by klotz

gensim: Topic modelling for humans This bookmark is certified by an admin user.

2015-08-26 Tags: gensim, similarity, nlp, word2vec by klotz

From word2vec to doc2vec: an approach driven by Chinese restaurant process | Kifi Engineering Blog This bookmark is certified by an admin user.

2016-03-15 Tags: word2vec, nlp by klotz

Byte Rot: Five crazy abstractions my Deep Learning word2vec model just did This bookmark is certified by an admin user.

2016-03-15 Tags: word2vec, nlp by klotz

word2vec usage questions : MachineLearning This bookmark is certified by an admin user.

2016-03-15 Tags: word2vec, context, nlp by klotz

Voynich Manuscript: word vectors and t-SNE visualization of some patterns | Pyevolve This bookmark is certified by an admin user.

2016-03-30 Tags: voynich, nkp, word embedding, word2vec by klotz

NLP Lecture 7 - Lexical Semantics and Word Embeddings This bookmark is certified by an admin user.

Word embeddings are suitable for use with neural network language models (as will be discussed later); they can also be used to enhance conventional (MEMM, CRF) models. The best ways to incorporate embeddings into such feature-based language models are still being explored. The simplest approach involves the direct use of the vector components as features (Turian et al 2010, Word Representations: A Simple and General Method for Semi-Supervised Learning, ACL 2010; Nguyen and Grishman, ACL 2014). Less direct approaches include building clusters from the embeddings and then using the clusters as features, or selecting prototypical examples of each type and then using similarity to these prototypes (based on embedding similarity) as features. Early results on NE tagging indicate a small advantage for the indirect methods (Guo et al., Revisiting embedding features for simple semi-supervised learning, EMNLP 2014). Models based on word embeddings are producing the best performance on named entity recognition (A. Passos et al, Lexicon Infused Phrase Embeddings for Named Entity Resolution, CoNLL 2014) and are effective for chunking (Turian et al ACL 2010).

2016-04-01 Tags: nlp, word embedding, word2vec, wordnet, clustering by klotz

DIY-Data-Science/gensim.md at master · jxieeducation/DIY-Data-Science · GitHub This bookmark is certified by an admin user.

2016-05-03 Tags: word2vec, howto, gensim, python, nlp by klotz

Making sense of word2vec | RaRe Technologies This bookmark is certified by an admin user.

2016-05-03 Tags: word2vec, nlp by klotz

Exploring Word Embedding for Drug Name Recognition This bookmark is certified by an admin user.

Isabel Segura-Bedmar, V´ıctor Suarez-Paniagua, Paloma Mart ´ ´ınez
Computer Science Department
University Carlos III of Madrid, Spain

This paper describes a machine learningbased
approach that uses word embedding
features to recognize drug names from
biomedical texts. As a starting point,
we developed a baseline system based on
Conditional Random Field (CRF) trained
with standard features used in current
Named Entity Recognition (NER) systems.
Then, the system was extended to
incorporate new features, such as word
vectors and word clusters generated by
the Word2Vec tool and a lexicon feature
from the DINTO ontology. We trained the
Word2vec tool over two different corpus:
Wikipedia and MedLine. Our main goal
is to study the effectiveness of using word
embeddings as features to improve performance
on our baseline system, as well as
to analyze whether the DINTO ontology
could be a valuable complementary data
source integrated in a machine learning
NER system. To evaluate our approach
and compare it with previous work, we
conducted a series of experiments on the
dataset of SemEval-2013 Task 9.1 Drug
Name Recognition.

2016-05-18 Tags: word embedding, word2vec, papers by klotz

SemanticScuttle - klotz.me

Tags: nlp* + word2vec* + text*

Linked Tags

Related Tags