Crate cld2 [−] [src]
Detect the language of a string using the cld2 library from the Chromium project.
use cld2::{detect_language, Format, Reliable, Lang}; let text = "It is an ancient Mariner, And he stoppeth one of three. 'By thy long grey beard and glittering eye, Now wherefore stopp'st thou me?"; assert_eq!((Some(Lang("en")), Reliable), detect_language(text, Format::Text));
This library wraps the cld2-sys
library, which provides a raw
interface to cld2. The only major feature which isn't yet wrapped is
the ResultChunk
interface, because it tends to give fairly imprecise
answers—it wouldn't make a very good multi-lingual spellchecker
component, for example. As always, pull requests are eagerly welcome!
WARNING: We assume that nobody tries to change the loaded cld2
data
tables or calls the C++ function CLD2::DetectLanguageVersion
behind
our backs. These configuration and debugging APIs in cld2
are not
thread safe.
For more information, see the GitHub project for this library.
Reexports
pub use self::Reliability::{Reliable, Unreliable}; |
Structs
DetectionResult |
Detailed language detection results. |
Hints |
Hints to the decoder, which it will use to make better guesses. |
Lang |
A language code, normally two letters for common languages. |
LanguageScore |
Detailed information about how well the input text matched a specific language. |
Enums
Format |
Possible data formats. |
Reliability |
Is the output of the language decoder reliable? |
Functions
detect_language |
Detect the language of the input text. |
detect_language_ext |
Detect the language of the input text, using optional hints, and return detailed statistics. |
detector_version |
Get the version of cld2 and its embedded data files as a string. |