EduNLP.I2V¶
It just a api, so you shouldn’t use it directly. If you want to get vector from item, you can use other model like D2V and W2V.
- param tokenizer
the tokenizer name
- type tokenizer
str
- param t2v
the name of token2vector model
- type t2v
str
- param args
the parameters passed to t2v
- param tokenizer_kwargs
the parameters passed to tokenizer
- type tokenizer_kwargs
dict
- param pretrained_t2v
- type pretrained_t2v
bool
- param kwargs
the parameters passed to t2v
Examples
>>> item = {"如图来自古希腊数学家希波克拉底所研究的几何图形.此图由三个半圆构成,三个半圆的直径分别为直角三角形$ABC$的斜边$BC$, ... 直角边$AB$, $AC$.$\bigtriangleup ABC$的三边所围成的区域记为$I$,黑色部分记为$II$, 其余部分记为$III$.在整个图形中随机取一点, ... 此点取自$I,II,III$的概率分别记为$p_1,p_2,p_3$,则$\SIFChoice$$\FigureID{1}$"}
>>> model_path = "examples/test_model/test_gensim_luna_stem_tf_d2v_256.bin"
>>> i2v = D2V("text","d2v",filepath=model_path, pretrained_t2v = False)
>>> i2v(item)
([array([ ...dtype=float32)], None)
- returns
i2v model
- rtype
I2V
- EduNLP.I2V.i2v.I2V.tokenize(self, items, indexing=True, padding=False, key=<function I2V.<lambda>>, *args, **kwargs) list¶
tokenize item
EduNLP.I2V.D2V¶
Examples
>>> item = {"如图来自古希腊数学家希波克拉底所研究的几何图形.此图由三个半圆构成,三个半圆的直径分别为直角三角形$ABC$的斜边$BC$, ... 直角边$AB$, $AC$.$\bigtriangleup ABC$的三边所围成的区域记为$I$,黑色部分记为$II$, 其余部分记为$III$.在整个图形中随机取一点, ... 此点取自$I,II,III$的概率分别记为$p_1,p_2,p_3$,则$\SIFChoice$$\FigureID{1}$"}
>>> model_path = "examples/test_model/test_gensim_luna_stem_tf_d2v_256.bin"
>>> i2v = D2V("text","d2v",filepath=model_path, pretrained_t2v = False)
>>> i2v(item)
([array([ ...dtype=float32)], None)
- returns
i2v model
- rtype
I2V
- EduNLP.I2V.i2v.D2V.infer_vector(self, items, tokenize=True, indexing=False, padding=False, key=<function D2V.<lambda>>, *args, **kwargs) tuple¶
- Parameters
items (str) –
tokenize –
indexing –
padding –
key –
args –
kwargs –
- Returns
- Return type
vector
- EduNLP.I2V.i2v.D2V.tokenize(self, items, indexing=True, padding=False, key=<function I2V.<lambda>>, *args, **kwargs) list¶
tokenize item
EduNLP.I2V.W2V¶
Examples
>>> i2v = get_pretrained_i2v("test_w2v", "examples/test_model/data/w2v")
>>> item_vector, token_vector = i2v(["有学者认为:‘学习’,必须适应实际"])
>>> item_vector
array([[...]], dtype=float32)
- returns
i2v model
- rtype
W2V
- EduNLP.I2V.i2v.W2V.tokenize(self, items, indexing=True, padding=False, key=<function I2V.<lambda>>, *args, **kwargs) list¶
tokenize item
EduNLP.I2V.get_pretrained_i2v¶
- param name
- param model_dir
- returns
i2v model
- rtype
I2V
Examples
>>> item = {"如图来自古希腊数学家希波克拉底所研究的几何图形.此图由三个半圆构成,三个半圆的直径分别为直角三角形$ABC$的斜边$BC$, ... 直角边$AB$, $AC$.$\bigtriangleup ABC$的三边所围成的区域记为$I$,黑色部分记为$II$, 其余部分记为$III$.在整个图形中随机取一点, ... 此点取自$I,II,III$的概率分别记为$p_1,p_2,p_3$,则$\SIFChoice$$\FigureID{1}$"}
>>> i2v = get_pretrained_i2v("test_d2v", "examples/test_model/data/d2v")
>>> print(i2v(item))
([array([ ...dtype=float32)], None)