Active Learn using Density as well
Previously I posted active learning example.
and did say I'm going to try with density label so here it is. Took a while with all sorts of problems but I had it in the end.
I plotted what happens with AMES dataset.
steep rize (y-axis is accuracy, x-axis is adding 10 samples each round) then linear rize upto 100% accuracy (well, overfit...)
so looks like around 0.80 is the target or maybe more for the actual model usage. I should add validation dataset so I can actually measrure the true accuracy but this dataset is difficult in a sense that it is actually quite sparse. Real life dataset will definitely be interesting and things to try at work.
Also really like the Exploration/Exploitation Score combiner (1-x) for exploration reduces math formula node away! haha!
As for clustering lots of data, unfortunately seeing the limit here and trying with GPU assisted way and others using python. I will probably post how that goes too.
Data science/chemistry is so much fun! but then do think that molecular simulation is another dimension fun too!!