You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
mamlr/R
Erik de Vries c32c9e5ad3
ud_update: fix to deal with non-existing column names
6 years ago
..
actorizer.R actorizer, ud_update: implemented 'ver' variable for keeping track of updates 6 years ago
bulk_writer.R actorizer, ud_update: Updated merging of document fields to properly deal with missing punctuation at the end of fields (e.g. a title without punctuation at the end of the string) 6 years ago
class_update.R class_update; dfm_gen; merger: updated functions to accept text parameter for both old style 'lemmas' and new style 'ud' 6 years ago
dfm_gen.R class_update; dfm_gen; merger: updated functions to accept text parameter for both old style 'lemmas' and new style 'ud' 6 years ago
dupe_detect.R Fixed dupe_detect error on documents with one sentence or less, and a maximum # of words in dfm_gen 6 years ago
elastic_update.R actorizer: major fix to ud parsing, changed regex to remove html tags to only include tags with a maximum of 20 characters in them 6 years ago
elasticizer.R actorizer: major fix to ud parsing, changed regex to remove html tags to only include tags with a maximum of 20 characters in them 6 years ago
merger.R class_update; dfm_gen; merger: updated functions to accept text parameter for both old style 'lemmas' and new style 'ud' 6 years ago
modelizer.R actorizer, ud_update: Updated merging of document fields to properly deal with missing punctuation at the end of fields (e.g. a title without punctuation at the end of the string) 6 years ago
query_gen_actors.R elasticizer: Updated bulk size to 1024 (a power of 2) and set a timeout of 900s every 500000 updates 6 years ago
query_string.R Add query_string function for generating query_string queries 6 years ago
ud_update.R ud_update: fix to deal with non-existing column names 6 years ago