Erik de Vries
|
ef51ce60a9
|
Fixed dupe_detect error on documents with one sentence or less, and a maximum # of words in dfm_gen
|
6 years ago |
Erik de Vries
|
085252abda
|
documentation: updated dupe_detect and merger
|
6 years ago |
Erik de Vries
|
4cd46d1a5e
|
dupe_detect: added support for both lower and upper cutoff point
|
6 years ago |
Erik de Vries
|
adc4b3c639
|
Updated feature selection in modelizer function (see comment on lines 166/167)
|
6 years ago |
Erik de Vries
|
65f8c26ec6
|
Renamed dupe_detect, and added return output
|
6 years ago |
Erik de Vries
|
c815dc7f2b
|
Duplicate detection first commit
|
6 years ago |