[ACT-R-users] experiments to validate the behavior of a model
Rhiannon L Weaver
rlweaver at stat.cmu.edu
Tue Sep 16 13:55:55 EDT 2008
Hi Florian,
Are you talking about how many model simulations to run, or how much human
data to collect to compare to simulations? If it is a question of model
simulations, it depends on how you want to characterize the behavior. In
essence, each time the model runs, you can record any number of
characteristics of that run in order to get a profile of the behavior. Say
you record J items for each model run. So eg, total time, number of
turns, number of corrections, when they see the cloud, whether or not they
avoid it, etc...
If you are interested in averaging this behavior across multiple runs, and
characterizing the model by this average, then there is some standard
multivariate statistical analysis that you can apply to do a "sample size"
calculation-- in this sense, it would be the number of model runs you need
to do in order to estimate the mean value of your J items, and how they
interact (eg, the full covariance matrix), within a certain level of
statistical confidence. See for example, Johnson and Wichern, "Applied
Multivariate Statistical Analysis" and the chapters on estimating
multivariate means and profile analysis assuming Gaussian distributions.
This is the book I'm familiar with, but it can be rather dense. A
less-theoretical approach might be offered in Afifi, Clark, and May,
"Computer-aided multivariate analysis", though I'm just going by reviews
and a perusal of the table of contents.
In any case I recommend looking at the full distribution of the data
points across model runs in addition to the mean values, if only to see
whether you can assume that they are distributed as Multivariate Normal or
not.
The methods are similar for compiling human data, but because you have
less control over the parameters, you might need to think of sample size
within subjects. Ie, if you suspect, for example, that you have people of
different baseline skill levels, you might want to get several repetitions
of the task per person to take that into account, if you are interested in
a within-subjects characterization of behavior.
Best,
Rhiannon
On Tue, 16 Sep 2008, act-r-users-request at act-r.psy.cmu.edu wrote:
> Send ACT-R-users mailing list submissions to
> act-r-users at act-r.psy.cmu.edu
>
> To subscribe or unsubscribe via the World Wide Web, visit
> http://act-r.psy.cmu.edu/mailman/listinfo/act-r-users
> or, via email, send a message with subject or body 'help' to
> act-r-users-request at act-r.psy.cmu.edu
>
> You can reach the person managing the list at
> act-r-users-owner at act-r.psy.cmu.edu
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of ACT-R-users digest..."
>
>
> Today's Topics:
>
> 1. experiments to validate behaviour of a cognitive model
> (Florian Frische)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Tue, 16 Sep 2008 13:36:24 +0200
> From: Florian Frische <Florian.Frische at offis.de>
> Subject: [ACT-R-users] experiments to validate behaviour of a
> cognitive model
> To: act-r-users at act-r.psy.cmu.edu
> Message-ID:
> <OFDCE8D3F3.6C3ABEE3-ONC12574C6.003F7D42-C12574C6.003FE6BA at offis.uni-oldenburg.de>
>
> Content-Type: text/plain; charset="US-ASCII"
>
> Hi all,
>
> I would like to ask you about the amount of experimental data in order to
> validate a model's behaviour.
> I think there are several parameters that have an effect on the amount of
> data (suspects, trials) that is needed.
> The following 2 examples should help to describe what this question is
> about:
>
> Example 1: Let's say we have a 2-armed bandit that always returns -1 on
> the one arm and 1 on the other arm. We Would like
> to validate the behaviour of our model towards the behaviour of gamblers
> (The strategy that they use to maximize the overall outcome)
>
> Example 2: We have a complex flight task where a pilot has to avoid a
> thunderstorm. We would like to validate the behaviour
> of our pilot model in this task towards real pilots behaviour.
>
> It is obvious that example 1 is less complex than example 2 and I think
> that we need much more experimental data to get reliable results
> for the second example. I suppose there is a relationship beetween task
> complexity (e.g. independent variables) and the number of suspects/number
> of trials.
> But how can I assess how many suspects/trials I need (How many
> participants and how many repeats)?
>
> Thanks a lot for participating in this discussion,
>
> Florian Frische
>
> OFFIS
> FuE Bereich Verkehr | R&D Division Transportation
> Escherweg 2 - 26121 Oldenburg - Germany
> Phone.: +49 441 9722-523
> E-Mail: florian.frische at offis.de
> URL: http://www.offis.de
>
>
>
> ------------------------------
>
> _______________________________________________
> ACT-R-users mailing list
> ACT-R-users at act-r.psy.cmu.edu
> http://act-r.psy.cmu.edu/mailman/listinfo/act-r-users
>
>
> End of ACT-R-users Digest, Vol 38, Issue 11
> *******************************************
>
More information about the ACT-R-users
mailing list