calcTranslationSimilarity

Table of Contents

GoogleTranslationAuto
Package
Prerequisite
Setup
Usage
- Further options
License
Contributor
Reference

calcTranslationSimilarity

calculate a sentence similarity between japanese sentence pairs, this program can be used for inspecting whether the sentence is machine translated or not.

Package

calcTranslationSimilarity.bat
batch file for demo
calcTranslationSimilarity.py
main program
sentences.csv
input file for the batch file
sentences_out.csv
output file with sentence similarity.

Prerequisite

Windows 10 x64
Anaconda 5.2.0 (conda 4.9.2)
Python 3.8.5

Setup

You need to install MeCab library first. For installation on Anaconda on windows 10, plz refer to: https://emotionexplorer.blog.fc2.com/blog-entry-349.html

$ conda install -c mzh mecab-python3

$ conda install -c conda-forge unidic-lite

Usage

Run calcTranslationSimilarity.bat for demo.

Further options

You can change between Normal mode and Important mode. Normal mode is based on normal 'wakati' sentence separation, while Important mode is based on the important components (i.e. verb, noun, adjective, etc). default is Important mode.
```
# similarity_score = o_mecab.calcTranslationSimilarity_normal(original_translation, other_translations)
similarity_score = o_mecab.calcTranslationSimilarity_important(original_translation, other_translations)          
```

You can change the interest of components in the Important mode below.

if node.feature.split(",")[0] == "名詞" or node.feature.split(",")[0] == "動詞" or node.feature.split(",")[0] == "形容詞" or node.feature.split(",")[0] == "形容動詞":

License

This software is released under the MIT License, see LICENSE.

Contributor

d_paopao9913

Reference

https://d-paopao.com/calculation_sentense_similarity/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

calcTranslationSimilarity

Package

Prerequisite

Setup

Usage

Further options

License

Contributor

Reference

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
calcTranslationSimilarity.bat		calcTranslationSimilarity.bat
calcTranslationSimilarity.py		calcTranslationSimilarity.py
sentences.csv		sentences.csv
sentences_out.csv		sentences_out.csv

Folders and files

Latest commit

History

Repository files navigation

calcTranslationSimilarity

Package

Prerequisite

Setup

Usage

Further options

License

Contributor

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages