出版時間:2011-3 出版社:機械工業(yè)出版社 作者:(西班牙) Ricardo Baeza-Yates,(巴西)Berthier Ribeiro-Neto 頁數(shù):913
Tag標(biāo)簽:無
內(nèi)容概要
本書詳細(xì)介紹了信息檢索的所有主要概念和技術(shù),以及有關(guān)信息檢索方面的所有新變化,使讀者既可以對現(xiàn)代信息檢索有一個全面的了解,又可以獲取現(xiàn)代信息檢索所有關(guān)鍵主題的詳細(xì)知識?!冬F(xiàn)代信息融合技術(shù)在組合導(dǎo)航中的應(yīng)用》的主要內(nèi)容由信息檢索領(lǐng)域的代表人物Baeza-Yares和]Ribeiro-Neto編著,對于那些希望深入研究關(guān)鍵領(lǐng)域的讀者,書中還提供了由其他主要研究人員編寫的關(guān)于特殊主題的發(fā)展現(xiàn)狀。與上一版相比,本版在內(nèi)容和結(jié)構(gòu)上都有大量調(diào)整、更新和充實,其中新增內(nèi)容在60%到70%左右。具體更新情況如下新增了文本分類、網(wǎng)絡(luò)信息爬取、結(jié)構(gòu)化文本檢索和企業(yè)搜索等章節(jié),以及關(guān)于開源搜索的一個附錄。·全面改寫了用戶界面、多媒體檢索和數(shù)字圖書館等內(nèi)容。拓展了一些章節(jié),介紹了信息檢索方面的新的重要進(jìn)展,如語言模型、新的評價方法、查詢的特點、基于聚類和分布式信息檢索等。
作者簡介
Ricardo Baeza-Yates,于加拿大滑鐵盧大學(xué)獲得計算機科學(xué)博士學(xué)位,現(xiàn)為雅虎歐洲和拉丁美洲研究院副總裁,主管雅虎在巴塞羅納(西班牙)和圣地亞哥(智利)(的研究中心,并監(jiān)管海法研究中心。他曾擔(dān)任智利計算機科學(xué)學(xué)會主席、智利大學(xué)計算機科學(xué)系Web研究中心主任、ICREA教授,并且他還在巴塞羅納法布拉大學(xué)創(chuàng)立了信息與通信技術(shù)系Web研究組?,F(xiàn)在他仍是智利大學(xué)和法布拉大學(xué)的兼職教授。他的主要研究方向為算法與數(shù)據(jù)結(jié)構(gòu)、信息檢索、用戶界面以及可視化在數(shù)據(jù)庫中的應(yīng)用等。
書籍目錄
Preface to the Second EditionPreface to the First EditionAuthors' Acknowledgements to the Second EditionAuthors' Acknowledgements to the First EditionPublishers' Acknowledgements1 Introduction1.1 Information Retrieval1.1.1 Early Developments1.1.2 Information Retrieval in Libraries and Digital Libraries1.1.3 IR at the Center of the Stage1.2 The IR Problem1.2.1 The User's Task1.2.2 Information versus Data Retrieval1.3 The IR System1.3.1 Software Architecture of the IR System1.3.2 The Retrieval and Ranking Processes1.4 The Web1.4.1 A Brief History1.4.2 The e-Publishing Era1.4.3 How the Web Changed Search1.4.4 Practical Issues on the Web1.5 Organization of the Book1.5.1 Focus of the Book1.5.2 Book Contents1.6 The Book Web Site: A Teaching Resource1.7 Bibliographic DiscussionUser Interfaces for Searchby Marti Hearst2.1 Introduction2.2 How People Search2.2.1 Information Lookup versus Exploratory Search2.2.2 Classic versus Dynamic Model of Information Seeking . 2.2.3 Navigation versus Search2.2.4 Observations cf the Search Process2.3 Search Interfaces Today2.3.1 Getting Started2.3.2 Query Specification2.3.3 Query Specification Interfaces2.3.4 Retrieval Results Display2.3.5 Query Reformulation2.3.6 Organizing Search Results2.4 Visualization in Search Interfaces2.4.1 Visualizing Bcolesn Syntax2.4.2 Visualizing Query Terms within Retrieval Results2.4.3 Visualizing Relationships Among Words and Documents 2.4.4 Visualization for Text Mining2.5 Design and Evaluation of Search Interfaces 2.6 Trends and Research Issues2.7 Bibliographic DiscussionModeling3.1 IR Models3.1.1 Modeling and Rankirg3.1.2 Characterization cf an IR Model3.1.3 A Taxonomy of IR Models3.2 Classic Information Retrieval3.2.1 Basic Concepts3.2.2 The Boolean Model3.2.3 Term Weighting 3.2A TF-IDF Weights3.2.5 Document Length Normalization3.2.6 The Vector Model3.2.7 The Probabilistic Mcdel3.2.8 Brief Comparison of Classic Models3.3 Alternative Set Theoretic Models3.3.1 Set-Based Model3.3.2 Extended Boolean Model3.3.3 Fuzzy Set Model 3.4 Alternative Algebraic Models 3.4.1 Generalized Vector Space Model 3.4.2 Latent Semantic Indexing Moo'el3.4.3 Neural Netwozk Model3.5 Alternative Probabilistic Mcdels3.5.1 BM253.5.2 Language Models3.5.3 Divergence from Randomness 3.5.4 Bayesian Network Models3.6 Other Models……4 Retrieval Evaluation5 Relevance Feedback and Query Expansion6 Documents:Languages &Properties7 Queries:Languages &Properties8 Text Classiftcation9 Indexiong and Searching 10 Parallel and Distributed IR11 Web Retrieval12 Web Crawling 13 Structured Text Retrieval 14 Multimedia Information retrieval15 Enterprise Search16 Library Systems17 Digital Libraries
章節(jié)摘錄
Libraries were among the first institutions to adopt IR systems for retrieving information. Usually, library systems were initially developed by academic institutions and later by commercial vendors. In the first generation, such systems consisted of anautomation of existing processes such as card catalogs searching, restricted to authornames and titles. In the second generation, increased search functionality was added to include subject headings, keywords, and query operators. In the third generation,which is currently being deployed, the focus has been on improved graphical in terfaces,electronic f rms, hypertext features, and open system architectures. Traditional library management system vendors include Endeavor InformationSystems Inc., Innovative Interfaces Inc., and EOS International. Among systems developed with a research focus, we distinguish MELVYL developed by the California Digital Library at University of California, and the Cheshire system developed originally at UC Berkeley and lately in cooperation with the University of Liverpool.Further details on these library systems can be found in Chapter 16.1.1.3 IR at the Center of the Stage Despite its maturity, until recently, IR was seen as a narrow area of interest restrictedmainly to librarians and infrmation experts. Such a tendentious vision prevailed for many years, despite the rapid dissemination, among users of modern personalcomputers, of IR tools for multimedia and hypertext applications. In the beginning of the 1990s, a single fact changed once and for all these perceptions the in troductionof the World Wide Web. The Web, invented in 1989 by Tim Berners-Lee, has become a universal repository of human knowedge and culture. Its success is based on the conception of a standarduser interface which is always the same, no matter the computational environmentused to run the interface, and which allows any user to create their own documents.As a result, millions of users have created billions of documents that compose the largest human repository of knowledge in history. An immediate consequence is that finding useful information on the Web is not always a simple task and usually requiresposing a query to a search engine, i.e., running a search. And search is all aboutIR and its technologies. Thus, a hnost overnight, IR has gained a place with other technologies at the center of the stage. ……
圖書封面
圖書標(biāo)簽Tags
無
評論、評分、閱讀與下載