So should I be checking something besides the distribution? To be sure of my results I ran a training set using the buildClassifier (data) call then distributionForInstance to use the model. From what you are saying I must be doing something wrong. Do you know where their is some sample code that I can check out?
Yes, you should get the same result, regardless of whether you use the API or one of WEKA’s user interfaces, as long as the training and test sets are the same.
> On 9/03/2016, at 2:51 AM, Jerry Thorpe <email@example.com> wrote:
> I am using J48 on a data set with 9,000 records. During the training I get a 75% correct rate but when i use the same data in my Java code the distribution rate only finds 120 out of the 9,000 which is about 1.3%.Shouldn't the values match since the data set is the same?
> Wekalist mailing list
> Send posts to: Wekalist@list.waikato.ac.nz
> List info and subscription status: http://list.waikato.ac.nz/mailman/listinfo/wekalist
> List etiquette: http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html
Wekalist mailing list
Send posts to: Wekalist@list.waikato.ac.nz
List info and subscription status: http://list.waikato.ac.nz/mailman/listinfo/wekalist
List etiquette: http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html