You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Your Name
9eae486a80
separated data preprocessing routines
...
class_update: check if there are idf values associated with model, before applying weights
estimator: make use of preproc() function for data preprocessing
preproc: function containing all logic with regards to text data preprocessing and weighting
5 years ago
..
actor_fetcher.Rd
actor_fetcher: elasticizer batch function to fetch actorsDetail fields from all relevant documents
6 years ago
actorizer.Rd
lemma_writer: updated to provide support for writing raw documents to individual files using utf-8 encoding
5 years ago
bulk_writer.Rd
actorizer, ud_update: Updated merging of document fields to properly deal with missing punctuation at the end of fields (e.g. a title without punctuation at the end of the string)
6 years ago
class_update.Rd
revised modeling pipeline:
5 years ago
cv_generator.Rd
revised modeling pipeline:
5 years ago
dfm_gen.Rd
revised modeling pipeline:
5 years ago
dupe_detect.Rd
elasticizer: renamed size parameter to batch_size, created max_batch parameter to limit the number of results returned
6 years ago
elastic_update.Rd
Major overhaul to ES bulk update integration. Added support for both setting and appending to variables
6 years ago
elasticizer.Rd
actorizer, dfm_gen, modelizer, out_parser: replaced all instances of detectCores by cores parameter (which defaults to detectCores)
6 years ago
estimator.Rd
revised modeling pipeline:
5 years ago
feat_select.Rd
revised modeling pipeline:
5 years ago
lemma_writer.Rd
revised modeling pipeline:
5 years ago
merger.Rd
dfm_gen, merger: Added option for generating lemma_upos hybrids for merged field
6 years ago
metric_gen.Rd
revised modeling pipeline:
5 years ago
modelizer.Rd
revised modeling pipeline:
5 years ago
modelizer_old.Rd
revised modeling pipeline:
5 years ago
out_parser.Rd
revised modeling pipeline:
5 years ago
preproc.Rd
separated data preprocessing routines
5 years ago
query_gen_actors.Rd
actorizer, query_gen_actors: revamped actor searches entirely
6 years ago
query_string.Rd
actor_aggregation: Added function to generate aggregate actor measures at daily, weekly, monthly and yearly level
6 years ago
ud_update.Rd
actorizer, ud_update: implemented 'ver' variable for keeping track of updates
6 years ago