Archives for April, 2017

A python gist for balancing (re-sampling) a training dataset to match a reported positive-negative class ratio, in the unseen dataset When we know the unseen’s pos-neg class ratio (or “guess” it from the LB..) we should give a try at balancing the training dataset, to reflect it. I wrote a python gist for it, using […]

mor krispil

Archives for April, 2017

Balancing the training dataset to a reported positive-negative class ratio, in the unseen dataset

TFIDF and Machine Learning – efficient hybrid processing

Meta