GitHub - svjack/Sbert-ChineseExample: Sentence-Transformers Information Retrieval example on Chinese

* 1 这个工程使用自定义的 es-pandas 的重载接口 (支持向量存储) 来使用pandas对于elasticsearch实现简单的操作。
* 2 try_sbert_neg_sampler.py 抽取困难样本（模型识别困难的样本）的功能来自于 https://guzpenha.github.io/transformer_rankers/，也可以使用 elasticsearch 生成困难样本, 相应的功能在 valid_cross_encoder_on_bi_encoder.py 中定义。
* 3 上面在 cross_encoder 上训练的功能, 需要预先在不同的句子间检查语义区别程度，组合相似语义的样本对于模型训练是有帮助的。
* 4 增加了一些对Sentence-Transformers多类别结果比较的工具。

贡献

Contributing

License

Distributed under the MIT License. See LICENSE for more information.

Contact

svjack - svjackbt@gmail.com ehangzhou@outlook.com

Project Link: https://github.com/svjack/Sbert-ChineseExample

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.spyproject		.spyproject
script		script
README.md		README.md
README_EN.md		README_EN.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.spyproject

.spyproject

script

script

README.md

README.md

README_EN.md

README_EN.md

requirements.txt

requirements.txt

Repository files navigation

Sbert-ChineseExample

内容提要

关于这个工程

About The Project

构建信息

Built With

开始

Getting Started

安装

Installation

使用

Usage

引导

Roadmap

贡献

Contributing

License

Contact

Acknowledgements

About

Releases

Packages

Languages

svjack/Sbert-ChineseExample

Folders and files

Latest commit

History

Repository files navigation

Sbert-ChineseExample

内容提要

关于这个工程

About The Project

构建信息

Built With

开始

Getting Started

安装

Installation

使用

Usage

引导

Roadmap

贡献

Contributing

License

Contact

Acknowledgements

About

Topics

Resources

Stars

Watchers

Forks

Languages