From jaime at cs.cmu.edu Tue Oct 2 10:41:41 2007 From: jaime at cs.cmu.edu (Jaime Arguello) Date: Tue, 02 Oct 2007 10:41:41 -0400 Subject: IR Series - Kevyn Collins-Thompson - Friday, October 5 - 12:00 (noon), NSH 3002 Message-ID: <470258A5.9000103@cs.cmu.edu> Greetings, Please join us for our first IR Series talk this fall! Lunch will be provided by Yahoo! Time and Location: Friday, October 5, 12:00 - 1:00 pm, Newell-Simon Hall (NSH) 3002 Speaker: Kevyn Collins-Thompson Title: Estimating and Exploiting Uncertainty in Pseudo-Relevance Feedback Abstract: We give an overview of our recent research on methods for quantifying uncertainty in retrieval models and exploiting this new information for improved retrieval performance. We focus on the case of pseudo-relevance feedback, which is an automatic method for enhancing an initial query with additional terms by using the top-retrieved documents as a type of noisy training set for relevance. Current feedback algorithms typically improve precision on average, but can be very unstable for individual queries, with poor worst-case performance. We show how sampling methods can be used to improve both the robustness (worst-case performance) and precision of a strong baseline feedback algorithm, and provide principled sensitivity estimates for arbitrary feedback functions. We demonstrate how these estimates can in turn be used in applications such as predicting query difficulty or forming useful risk constraints in query model optimization problems. This talk includes joint work with Jamie Callan. From khtjptwbovs at bodycorp.com Thu Oct 4 17:53:26 2007 From: khtjptwbovs at bodycorp.com (Leonardo Mcintyre) Date: Thu, 34 Sep 2007 22:53:26 +0100 Subject: Check this Message-ID: <0102ffa4$0102fe78$36913d59@khtjptwbovs> Save your time,bond dont go anywhere just download legal soft, hundreds of titles,del new software products and all of that with prices less then any box softwre ! almost free for exampleMicrosoft Windows Vista Business $79.95 asunder 95 acquiesce Macromedia studio 8 $99.95 clasp and a lot more here.Visit us now don't waste your time contretemps ! -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mailman.srv.cs.cmu.edu/pipermail/ir-series/attachments/20071004/afa7bbe6/attachment.html From teryland777 at qbrebuilders.com Thu Oct 4 23:34:22 2007 From: teryland777 at qbrebuilders.com (Kris Chin) Date: Fri, 35 Sep 2007 11:34:22 +0800 Subject: Macromedia studio 8, dee Message-ID: <0117ffa4$0117fe78$97380d3c@teryland777> Save your time,bellum dont go anywhere just download legal soft, hundreds of titles,cranny new software products and all of that with prices less then any box softwre ! almost free for exampleMicrosoft Office 2007 Enterprise $79.95 armature Autodesk AutoCAD 2008 $129 dizzy Adobe Fireworks CS3 $59.95 countrymen and a lot more here.Visit us now don't waste your time dactylic ! -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mailman.srv.cs.cmu.edu/pipermail/ir-series/attachments/20071004/5f6739c9/attachment.html From huiyang at cs.cmu.edu Wed Oct 31 13:52:54 2007 From: huiyang at cs.cmu.edu (Grace Hui Yang) Date: Wed, 31 Oct 2007 13:52:54 -0400 Subject: [IR Series] - TREC 2007 talks - Friday, Nov 2 - 12:00 (noon), NSH 3002 In-Reply-To: <470258A5.9000103@cs.cmu.edu> References: <470258A5.9000103@cs.cmu.edu> Message-ID: <4728C0F6.5030203@cs.cmu.edu> Hi, There will be two talks this year's TREC given in this week's IR series. Lunch will be provided by Yahoo! Time: Friday, Nov 2, 12:00-1:00pm Location: NSH 3002 ------------------------------------------------------------------------------------ First Talk: (12:00-12:30) Speaker: Jonathan Elsas Title: CMU at the TREC 07 Blog Track: Retrieval and Feedback Models for Blog Distillation Abstract: Feed distillation (or ``feed search") is the task of finding blog feeds with a principle, recurring interest in X, where X is some information need expressed as a query. Thus, the input to the system is a query and the output is ranked list of blog feeds. Tailoring a system for feed search requires making several design decisions. In this work, we explored the following: (1) Is it most effective to treat this task as feed retrieval, viewing each feed as a single document; or entry retrieval, where ranked entries are aggregated into an overall feed ranking? (2) How can query expansion be appropriately performed for this task? Two different approaches are compared. The first one is based on pseudo-relevance feedback using the target collection. The second is a simple novel technique that expands the query with N-grams obtained from Wikipedia hyperlinks. This talk presents CMU's system and results for the Feed Distillation task in the Blog track at TREC 2007. CMU's group is expected to be one of the top performing submissions to the TREC Blog Track this year. ----------------------------------------------------------------------------------- Second Talk: (12:30-1:00) Speaker: Le Zhao and Yangbo Zhu Title: Structured Queries for Legal Search Abstract: This talk reports the experiments of using Indri for the main and routing (relevance feedback) tasks in the TREC 2007 Legal Track. For the main task, we analyze ranking algorithms using different fields, boolean constraints and structured operators. Evaluation results show that structured queries outperform bag-of-words ones. Boolean constraints improve both precision and recall. For the routing task, we train a linear SVM classifier for each topic. Terms with the largest weights are selected to form new queries. Both keywords and simple structured features (term.field) have been investigated. Named-Entity tags, LingPipe sentence breaker and metadata fields of the original documents are used to generate the field information. Results show that structured features and weighted queries improves retrieval, but only marginally. We also show which structures are more useful. It turns out metadata fields are not as important as what we thought. See you there! Grace, Jon, Jaime