Averaging data according to categories

I Assume "Category" and "Measurement" are both 1D-waves...
I would recommend the use of several waves, 2D-Waves or maybe even data folders for the different categories.

If the number of measurements is constant, you might read your data as a 1D wave and redimension it to get a 2D wave. In this case you might use the imagestats function to get your statistics.

If this is not possible you could run a loop over your categories (either you know them or extract them: a loop over you first column adding a new value to the category wave) and mask out all the wrong categories:

function testit()
    variable CatValue
    wave Measurement, Category, Dummy
    duplicate /O Measurement, Dummy
    for (CatValue=10;CatValue<21;CatValue+=10)
        Dummy= (Category==CatValue) ? Measurement : NaN
        print "Category: "+num2str(CatValue)
        wavestats dummy
    endfor
end

The "Dummy= " assignment might even be used in combination with multithread in a procedure to speed it up for large(!) data sets.

Good data management is half the data treatment -- in my opinion.
If you are unfamiliar with the use of the commands please have a look at the manual.

Looking forward to other solutions,
HJ

January 22, 2015 at 12:11 pm - Permalink

jcor

Here's my contribution:

http://www.igorexchange.com/node/6258

... conceptually similar to HJDrescher's, but allows the input of an unspecified number of categories.

I tried an alternative algorithm, which tried to avoid memory overflow errors, but it was an order slower than this one. So this one only includes a simple check for memory overflow problems, and otherwise the user will have to use a smaller number of data or keys (categories).

January 23, 2015 at 04:43 am - Permalink