[ACT-R-users] similarity

Roman Belavkin R.Belavkin at mdx.ac.uk
Sat Jun 18 03:55:10 EDT 2005


Hello Ayman,
 
Indeed, Steve Jobs may have a bit different associations than myself, which begs for some explanation, and I think it is beyond a simple difference in the corpus we use. Perhaps term apple, obviously ambiguous, meaning two very different objects, corresponds to two different representations in the mind (i.e. chunks).  And people are able to make this distinction very clearly, whereas a purely text-based technique, such as GLSA (or ICA, LSA, etc) has no means to do it.  Or does it? I wonder if there has been any research on trying to capture this difference with LSA?  Obviously, apple meaning fruit is `similar' to terms such as eat, orange, sweet and red, etc.  While apple meaning mac would have associations with terms such as Steve Jobs, OS X, etc.  The same term belongs to quite different clusters, therefore the method should be able to figure this out.
 
cheers,
Roman

	-----Original Message----- 
	From: act-r-users-admin at act-r.psy.cmu.edu on behalf of Ayman Farahat 
	Sent: Fri 6/17/2005 18:18 
	To: Peter Pirolli; Roy Wilson; act-r-users at act-r.psy.cmu.edu 
	Cc: Raluca.Budiu at parc.com; royer at parc.com 
	Subject: Re: [ACT-R-users] similarity
	
	

	Hello
	The apple example is interesting because it illustrates how the corpus (or
	domain knowledge) can influence similarity.  My feeling is that if we asked
	Steve Jobs (or for that matter a random person in the bay area) about the
	most similar term to apple, MAC would come up very high.
	An interesting example to try is "entropy". You will see that the similarity
	server captures the two senses of the term; the information theory and
	thermodynamics.
	The reason why we get better results with entropy has more to do with the
	corpus than with technique.
	Ayman
	
	> From: Peter Pirolli <pirolli at parc.com>
	> Date: Fri, 17 Jun 2005 09:50:05 PDT
	> To: Roy Wilson <rwilson+ at pitt.edu>, act-r-users at act-r.psy.cmu.edu
	> Cc: Ayman Farahat <farahat at parc.com>, Raluca.Budiu at parc.com, royer at parc.com
	> Subject: Re: [ACT-R-users] similarity
	>
	> As I point out
	> in the Pirolli (2005) Cognitive Science paper, PMI is approximately the
	> same thing as association strength in ACT-R. PMI (like LSA) is very good at
	> generating scores that correlate well with synonym judgements (e.g., as
	> tested by the TOEFL test), which might be one way to define
	
	
	_______________________________________________
	ACT-R-users mailing list
	ACT-R-users at act-r.psy.cmu.edu
	http://act-r.psy.cmu.edu/mailman/listinfo/act-r-users
	





More information about the ACT-R-users mailing list