Fitting models from noisy heuristic labels - Emir's blog
Summary
I present a weak supervision paradigm called “data programming” which uses
maximum likelihood estimation to produce soft labels from heuristics. These
soft labels can then be used to train other models, without true labels
being required at any stage. I’ve included a simple example from first
principles to show that the methods work. The original authors have a fully
featured package called Snorkel which
provides sophisticated data programming and related features.
Introduction
There’s a...
Read more at emiruz.com