[TeamTalk 308]: [844] trunk/TeamTalk/Resources/Grammar: Tokenize with underscores since that's what phoenix understands.
tk@edam.speech.cs.cmu.edu
tk at edam.speech.cs.cmu.edu
Wed Oct 17 22:35:42 EDT 2007
An HTML attachment was scrubbed...
URL: http://mailman.srv.cs.cmu.edu/pipermail/teamtalk-developers/attachments/20071017/d6856c8b/attachment.html
-------------- next part --------------
Modified: trunk/TeamTalk/Resources/Grammar/GRAMMAR/TeamTalkTask.gra
===================================================================
--- trunk/TeamTalk/Resources/Grammar/GRAMMAR/TeamTalkTask.gra 2007-10-17 18:56:22 UTC (rev 843)
+++ trunk/TeamTalk/Resources/Grammar/GRAMMAR/TeamTalkTask.gra 2007-10-18 02:35:42 UTC (rev 844)
@@ -267,10 +267,10 @@
;
[Side]
- (the=north=end=of)
- (the=east=end=of)
- (the=south=end=of)
- (the=west=end=of)
+ (the_north_end_of)
+ (the_east_end_of)
+ (the_south_end_of)
+ (the_west_end_of)
[Goal]
([Home])
@@ -371,18 +371,10 @@
(do not)
MOD
(thanks)
- (thank=you) # should be a lexeme
+ (thank_you) # should be a lexeme
(not really)
(it's not)
(i'm not)
-#NO
-# (no)
-# (not)
-#GOOD
-# (right)
-# (correct)
-# (good)
-# (okay)
;
Modified: trunk/TeamTalk/Resources/Grammar/tokenize.pl
===================================================================
--- trunk/TeamTalk/Resources/Grammar/tokenize.pl 2007-10-17 18:56:22 UTC (rev 843)
+++ trunk/TeamTalk/Resources/Grammar/tokenize.pl 2007-10-18 02:35:42 UTC (rev 844)
@@ -81,7 +81,7 @@
$text = $line; $prob = undef;
}
$text =~ s/^\s*\((.+?)\)\s*$/$1/; # trim spaces from ends, strip ()'s
- $text =~ s/\s+/=/g; # tokenize the text by substituting spaces
+ $text =~ s/\s+/_/g; # tokenize the text by substituting spaces
$tokens{"$text:$classid"}++;
$lexset{"$text:$classid"} = $prob;
}
More information about the TeamTalk-developers
mailing list