Lord, decide the following problem: there are website development, is desirable key words and a set of already written of the individual texts.
Now making mechanism in order to identify the intersection of how the current text content corresponds to a set of goals.
Finding the key words in the text from a given list, the situation is clear:
When adding keywords we
use the implementation of the mechanism of Stemming ruby
The question is to rate other: how the text corresponds to this request, and it doesn't others. That is, I need a mechanism that will parse text, find the most often repeated words, and give their per cent content in the text.
The question is, do you know someone (maybe even described in the Network?) turnkey solution for rails? For example, the algorithm
TF-IDF?
It is clear that you can write everything. But there are such an important resource as time. So I'm asking: is there already some ready solutions to this issue?
Thank you.