Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to use the decomposer ? #3

Open
cyclomarc opened this issue Mar 26, 2015 · 1 comment
Open

How to use the decomposer ? #3

cyclomarc opened this issue Mar 26, 2015 · 1 comment

Comments

@cyclomarc
Copy link

I am looking for an example mapping file that shows how to use the decompounder ? I am familiar with the ES dictionary_decompounder, but if I understand well, the plugin provides a decompounder that does not require a word list. My question is: what is the syntax to be used in the mapping file (filter, analyzer, tokenizer) so that the decompounder is used during analysis ?

Hope you can help
Marc

@jprante
Copy link
Owner

jprante commented Mar 26, 2015

I have added example configurations at the README

https://github.com/jprante/elasticsearch-plugin-bundle/blob/master/README.md

It is not much but at least a start.

The decompounder is based on the ASV toolbox

http://wortschatz.uni-leipzig.de/~cbiemann/software/toolbox/#_Baseforms

so it should be somehow possible to ramp up a "training environment" to create new parameter files for decompounding. The binary prebuilt files for german decompounding in the plugin are simply copied from ASV toolbox.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants