Hi
I was wondering if there is some documentation available for the package Moose-Algos-InformationRetrieval or if someone ever used it?
I don’t understand how the MalLogLikelihoodRatio is working (it’s a kind of similarity metric between two sets of terms).
What I have right know is a corpus of documents with their contents and I’d like to compare a request to my corpus.
(it requires that you have executed some scripts in a playground before)
data := FileSystem workingDirectory allFiles select:[:e| e extension = 'ph'] thenCollect:#contents.
corpus := MalCorpus new.
'Initiating corpus...' displayProgressFrom: 1 to: data size during: [ :bar|
data do:[:e|
corpus addDocument: e with: (MalTerms fromString: e contents).
bar increment.
].
corpus removeStopwords .
].
stringRequest := 'toto tata'.
request := MalTerms fromString: stringRequest.
likelihood := MalLogLikelihoodRatio new.
likelihood setTerms1: request.
requestAnswer := Dictionary keys: data values: (data collect:[:e|
likelihood setTerms2: (corpus atDocument: e); computeAll
]).
I don’t get what are the values that I have in the variable requestAnswer.
Any hints?
Thanks guys (and girls)
Regards,
--------------
Brice Govin
PhD student in RMoD research team at INRIA Lille
Software Engineer at THALES AIR SYSTEMS Rungis
ENSTA-Bretagne ENSI2014
22 Avenue du General Leclerc 92340 BOURG-LA-REINE
_______________________________________________
Moose-dev mailing list
[hidden email]
https://www.list.inf.unibe.ch/listinfo/moose-dev