get_pretrained_i2v¶
概述¶
使用 EduNLP 项目组给定的预训练模型将给定的题目文本转成向量。
优点:简单方便。
缺点:只能使用项目中给定的模型,局限性较大。
导入功能块¶
[1]:
from EduNLP import get_pretrained_i2v
输入¶
类型:str
内容:题目文本 (text)
[2]:
item = {
"如图来自古希腊数学家希波克拉底所研究的几何图形.此图由三个半圆构成,三个半圆的直径分别为直角三角形$ABC$的斜边$BC$, 直角边$AB$, $AC$.$\bigtriangleup ABC$的三边所围成的区域记为$I$,黑色部分记为$II$, 其余部分记为$III$.在整个图形中随机取一点,此点取自$I,II,III$的概率分别记为$p_1,p_2,p_3$,则$\SIFChoice$$\FigureID{1}$"
}
模型选择与使用¶
根据题目所属学科选择预训练模型:
预训练模型名称 |
模型训练数据的所属学科 |
|---|---|
d2v_all_256 |
全学科 |
d2v_sci_256 |
理科 |
d2v_eng_256 |
英语 |
d2v_lit_256 |
文科 |
[3]:
i2v = get_pretrained_i2v("d2v_sci_256")
EduNLP, INFO Use pretrained t2v model d2v_sci_256
downloader, INFO http://base.ustc.edu.cn/data/model_zoo/EduNLP/d2v/general_science_256.zip is saved as /home/lvrui/.EduNLP/model/general_science_256.zip
downloader, INFO file existed, skipped
注意: 默认的 EduNLP 项目存储地址为根目录(
~/.EduNLP),模型存储地址为项目存储地址下的model文件夹。您可以通过修改下面的环境变量来修改模型存储地址:EduNLP 项目存储地址:
EDUNLPPATH = xx/xx/xx模型存储地址:
EDUNLPMODELPATH = xx/xx/xx
[4]:
print(i2v(item))
([array([-2.38860980e-01, 7.09681511e-02, -2.71706015e-01, 1.64714813e-01,
2.81243492e-02, -1.82386801e-01, 9.22331214e-02, 1.31783364e-02,
9.15176645e-02, 3.14464062e-01, 9.37800854e-02, -2.28523940e-01,
-2.60597020e-01, 6.49375990e-02, 9.75619778e-02, -1.97933778e-01,
8.29798505e-02, -2.26491719e-01, -1.77030653e-01, -3.56038064e-02,
6.22844934e-01, -2.66110301e-01, 8.00080523e-02, -1.60827965e-01,
-1.78654417e-01, -1.33000776e-01, 2.76004016e-01, 1.79546073e-01,
8.71006995e-02, 2.33958483e-01, 1.76031828e-01, 1.55402005e-01,
-1.38987333e-01, -1.92975491e-01, -1.09528497e-01, 1.12305783e-01,
2.32549626e-02, 7.75609687e-02, -2.43636876e-01, 6.35311157e-02,
-4.82399836e-02, -2.24204548e-02, 7.49862418e-02, -1.91449642e-01,
9.72701237e-02, 4.00750965e-01, 2.81992704e-01, 3.07581365e-01,
-4.68867749e-01, -3.03025767e-02, -1.95257351e-01, 1.79073047e-02,
-2.15334237e-01, 9.98005569e-02, -2.62755096e-01, -2.39337608e-01,
3.44270498e-01, 1.50241479e-01, -2.96006531e-01, -3.81666899e-01,
-1.19041964e-01, 6.18071109e-02, 6.49120063e-02, 9.94637012e-02,
1.23297565e-01, 1.29930690e-01, 1.27305657e-01, -1.53804764e-01,
7.04720244e-03, -1.33500487e-01, -1.51161134e-01, 1.13862932e-01,
-2.44814962e-01, -8.95622373e-02, 4.76458520e-02, -5.92206642e-02,
2.88407020e-02, -5.88610955e-02, -4.25557904e-02, 3.20446432e-01,
-2.61463765e-02, 7.19539896e-02, -1.32161498e-01, 1.62227061e-02,
1.20197656e-03, -2.03355268e-01, -6.83294982e-03, -2.82588631e-01,
-1.61395460e-01, -5.05547188e-02, -2.27462381e-01, -1.70932785e-01,
1.41351461e-01, -1.30069017e-01, -1.83039993e-01, -6.79691881e-02,
-2.15642393e-01, -7.84436688e-02, 1.77202985e-01, 4.50607650e-02,
7.02605024e-02, 8.01992565e-02, -1.55584306e-01, -2.00563252e-01,
1.17082551e-01, 9.73844752e-02, -1.10356934e-01, -1.37866074e-02,
-8.57235789e-02, -5.56467362e-02, -9.36827138e-02, 6.82030804e-03,
6.92379624e-02, -2.28701755e-01, 6.70390204e-02, 1.34586483e-01,
2.25231394e-01, 1.33322045e-01, -8.82911906e-02, 1.42205298e-01,
2.41012901e-01, 7.94170424e-03, -7.02124536e-02, 2.51370400e-01,
1.04983136e-01, -6.39194548e-02, 5.24720028e-02, 7.16757867e-03,
-1.08169973e-01, -1.08731678e-02, 1.69618204e-02, 7.87692815e-02,
-2.26539060e-01, 3.29003595e-02, 1.91522852e-01, 2.75921494e-01,
-1.64055750e-01, 5.83723187e-02, 9.84422341e-02, 3.21688712e-01,
-2.62310840e-02, -2.08140060e-01, 1.14425711e-01, 1.23823956e-01,
-8.62085819e-03, -4.14005108e-02, -3.41566652e-02, 1.34680912e-01,
4.27634180e-01, 1.42883554e-01, -1.54787973e-01, 7.96157196e-02,
1.40678003e-01, 1.39171826e-02, 1.66003749e-01, -4.85638082e-02,
5.88261709e-02, 9.51106697e-02, 1.81014258e-02, 1.44485429e-01,
4.01205927e-01, 6.77596256e-02, -5.52676022e-01, -1.87850371e-01,
1.12366609e-01, -6.84190989e-02, 9.48949978e-02, 2.23454669e-01,
-1.69843137e-01, 2.09085494e-01, 4.29946512e-01, -3.36349100e-01,
6.12608856e-03, -1.46142125e-01, -5.11092655e-02, 8.06671828e-02,
1.81744993e-01, -6.78945482e-02, -5.77093139e-02, 1.52337164e-01,
2.21259117e-01, 3.35705757e-01, -2.51778495e-02, 1.03662543e-01,
-4.21361588e-02, 1.43061429e-01, -3.92947495e-01, -4.89463992e-02,
-9.15660262e-02, -1.00108273e-01, 3.86523217e-01, -4.25569601e-02,
4.10154127e-02, -3.41399819e-01, 2.13903114e-02, 8.09015241e-03,
9.56344381e-02, 1.12729572e-01, 7.25207478e-02, -6.64384067e-02,
-2.73666024e-01, -2.79651750e-02, 1.18422434e-01, -5.22459708e-02,
-2.47057881e-02, 2.84700710e-02, 2.07451075e-01, -9.74238589e-02,
8.08936954e-02, 4.07307222e-02, -1.35277033e-01, 2.18436554e-01,
1.28792310e-02, -1.20433331e-01, 2.41929386e-02, 1.28128864e-02,
-7.39881098e-02, -1.12995692e-01, 7.69245178e-02, -2.87000872e-02,
1.64782573e-02, -2.78794408e-01, -2.64403820e-01, -2.43874848e-01,
1.77457914e-01, 4.11631197e-01, -6.09753132e-02, 2.84967333e-01,
9.81074646e-02, -2.68213183e-01, 1.52153388e-01, 2.42148209e-02,
1.24371536e-01, 6.02926640e-03, 8.22689310e-02, 2.82294262e-04,
-1.40584474e-02, 4.09389734e-02, -2.58334547e-01, -9.83026102e-02,
-1.91695184e-01, -2.61005852e-02, -2.21736208e-01, -4.36628833e-02,
9.49840024e-02, -5.16017936e-02, 2.17577979e-01, 2.58604765e-01,
6.33814484e-02, -7.10158283e-03, 9.87893157e-03, -2.26405971e-02,
1.67435139e-01, 2.90897069e-03, 2.35914681e-02, 5.43428905e-06],
dtype=float32)], None)