Hello Eibe,

 

Thanks for your reply. Yes, the three subsets are selected at random from the overall dataset.

 

Regards,

 

Ronan

 

Message: 3

Date: Sat, 16 Dec 2017 18:42:36 +1300

From: Eibe Frank <eibe@waikato.ac.nz>

To: Weka machine learning workbench list.

                <wekalist@list.waikato.ac.nz>

Subject: Re: [Wekalist] Select attributes - why are the rankings

                different forsubsets of the same dataset?

Message-ID: <5a34b259.488f630a.96b50.47a4@mx.google.com>

Content-Type: text/plain; charset="utf-8"

 

Have you shuffled the data before you created the three subsets? The Randomize filter in WEKA can be used for that. Alternatively, you can use the RemoveFolds filter (configuring it for a three-fold cross-validation).

 

Cheers,

Eibe

 

From: Ronan Flynn

Sent: Saturday, 16 December 2017 12:50 AM

To: wekalist@list.waikato.ac.nz

Subject: [Wekalist] Select attributes - why are the rankings different forsubsets of the same dataset?

Hello All,

 

I have a speech dataset that is divided into three subsets. There are approximately 90 attributes and the target is a numerical correlation value. I want to rank the attributes and have used the following:

 

Evaluator:??? weka.attributeSelection.WrapperSubsetEval -B weka.classifiers.functions.SMOreg -F 5 -T 0.01 -R 1 -E CORR-COEFF -- -C 0.0302 -N 0 -I "weka.classifiers.functions.supportVector.RegSMOImproved -T 0.001 -V -P 1.0E-12 -L 0.001 -W 1" -K "weka.classifiers.functions.supportVector.PolyKernel -E 1.0 -C 250007"

Search:?????? weka.attributeSelection.GreedyStepwise -R -T -1.7976931348623157E308 -N -1 -num-slots 1

 

When I run the attribute selection on each of the three speech subsets I get three very different ranked lists. I would have expected the rankings for the three subsets to be similar given that they are taken from the same overall speech dataset. Can anyone suggest possible reasons as to why the rankings are so different for each of the three speech subsets?

 

Also, is it possible when doing the ranking to output the correlation for each attribute individually? I would like to see the correlation for the individual attributes.

 

Regards and thanks,

 

Ronan Flynn

 

 

 

Tá an t-eolas atá le fáil sa ríomhphost seo faoi iontaoibh agus tá sé ceaptha le haghaidh aird an fhaighteora bheartaithe/na bhfaighteoirí beartaithe amháin. Más rud é go bhfuair tú an ríomhphost seo go hearráideach, ná húsáid agus ná tarchuir é ar mhaithe le haon chuspóir, le do thoil; ina áit sin cuir ar an eolas muid láithreach agus scrios gach cóip den ríomhphost seo ó do chóra(i)s ríomhaireachta. Ach amháin sa chás gur comhaontaíodh a leithéid go sonrach ag ár n-ionadaí údaraithe, is le húdar an ríomhphoist amháin na tuairimí a chuirtear in iúl ann, agus ní léiríonn siad tuairim ná ní chuireann siad ceangal ar aon chaoi eile ar Institiúid Teicneolaíochta Bhaile Átha Luain. Déan teagmháil le administrator@ait.ie nó cuir glao ar 090 6468000. The information contained in this email is confidential and is designated solely for the attention of the intended recipient(s). If you have received this email in error, please do not use or transmit it for any purpose but rather notify us immediately and delete all copies of this email from your computer system(s). Unless otherwise specifically agreed by our authorised representative, the views expressed in this email are those of the author only and shall not represent the view of or otherwise bind Athlone Institute of Technology. Contact administrator@ait.ie or telephone 090 6468000.