11 Commits (74909ca3a0406cef78ca187cb41e47bad3aea789)

Author SHA1 Message Date
Your Name c99ac23bb5 actor_merger: fixed absence of publication_date in some cases
4 years ago
Your Name cc7fa5bffa actor_merger: added aggregations of all individual actors and all party mentions in an article
4 years ago
Your Name d9d578c06a actor_merger: mult fix
4 years ago
Your Name 771145faf7 actor_merger: added mult='first' to metadata join for parties_actors to deal with duplicate partyIds (see 50Plus, Conservatives and Labour)
4 years ago
Your Name 1c14646e8f actor_merger: dont deselect sent_words and sent_sum columns
4 years ago
Your Name 9bd382f955 actor_merger: fix to generate bogus sentiment columns
4 years ago
Your Name b7f1afddd1 actor_merger: total rewrite based on data.table for performance reasons. Added some exceptions due to non-existing partyIds that some individual actors have in the actor database
4 years ago
Your Name 2c8a88f9a0 elasticizer: switched from bind_rows to rbindlist for composing result
4 years ago
Your Name e499d70671 actor_merger: added ungroup() calls at the start and end of function, to speed up processing
4 years ago
Your Name 61e0581595 actor_merger: removed debug line
4 years ago
Your Name f022312485 actor_merger: added function for generating actor-document data frames
4 years ago