Active Learn using Density as well

Previously I posted active learning example.

hateknime.hatenablog.com

 

and did say I'm going to try with density label so here it is. Took a while with all sorts of  problems but I had it in the end. 

f:id:hateknime:20200412175535p:plain

 

I plotted what happens with AMES dataset.

f:id:hateknime:20200412175620p:plain

 

steep rize (y-axis is accuracy, x-axis is adding 10 samples each round) then linear rize upto 100% accuracy (well, overfit...)
so looks like around 0.80 is the target or maybe more for the actual model usage. I should add validation dataset so I can actually measrure the true accuracy but this dataset is difficult in a sense that it is actually quite sparse.  Real life dataset will definitely be interesting and things to try at work.

Also really like the Exploration/Exploitation Score combiner (1-x) for exploration reduces math formula node away! haha!

 

As for clustering lots of data, unfortunately seeing the limit here and trying with GPU assisted way and others using python. I will probably post how that goes too.

 

Data science/chemistry is so much fun! but then do think that molecular simulation is another dimension fun too!!