A. Features
- 200-dimension vector representation.
- 213,118 english sentences in total.
- Access via this Link and will be continuously updated.
B. Case: To find similar word
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 |
>>> import gensim >>> model = gensim.models.Word2Vec.load("/home/tong/Desktop/w2v/janus-embedding-model") >>> for v in model.most_similar(positive=[u'scam'], topn=30): ... print v[0] fake idiots bs joke garbage payed trash lie legit Scam banned trust stupid shame rubbish waste rip obviously advertise steal bogus huh cheating fraud charged company charge con 😡 jelly >>> |