XML retrieval

Documents usually have a content and a structure. The content refers to the text of the document, whereas the structure refers to how a document is logically organized. An increasingly common way to encode the structure is through the use of a mark-up language. Nowadays, the most widely used mark-up...

Full description

Bibliographic Details
Main Author: Lalmas, Mounia.
Format: Electronic
Language:English
Published: San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA) : Morgan & Claypool Publishers, c2009.
Series:Synthesis lectures on information concepts, retrieval, and services (Online), # 7.
Subjects:
Online Access:Abstract with links to full text
LEADER 05077nam a2200577 a 4500
001 3428
005 20090709121016.0
006 m e d
007 cr cn |||m|||a
008 090708s2009 caua fsab 000 0 eng d
020 # # |a 9781598297874 (electronic bk.) 
020 # # |z 9781598297867 (pbk.) 
024 7 # |a 10.2200/S00203ED1V01Y200907ICR007  |2 doi 
035 # # |a (CaBNvSL)gtp00534965 
040 # # |a CaBNvSL  |c CaBNvSL  |d CaBNvSL 
050 # 4 |a QA76.76.H94  |b L258 2009 
082 0 4 |a 005.72  |2 22 
100 1 # |a Lalmas, Mounia. 
245 1 0 |a XML retrieval  |c Mounia Lalmas.  |h [electronic resource] / 
260 # # |a San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA) :  |b Morgan & Claypool Publishers,  |c c2009. 
300 # # |a 1 electronic text (ix, 99 p. : ill.) :  |b digital file. 
490 1 # |a Synthesis lectures on information concepts, retrieval, and services,  |v # 7  |x 1947-9468 ; 
500 # # |a Part of: Synthesis digital library of engineering and computer science. 
500 # # |a Title from PDF t.p. (viewed on July 8, 2009). 
500 # # |a Series from website. 
504 # # |a Includes bibliographical references (p. 81-98). 
505 0 # |a Introduction -- Basic XML concepts -- Element -- Well-formed XML document -- Document type declaration -- XML schema -- XML documents as trees -- Historical perspectives -- Structured document retrieval -- Passage retrieval -- Structured text retrieval -- Hypertext and web retrieval -- Data- vs document-centric XML documents -- Querying semi-structured data -- Content-oriented XML retrieval -- INitiative for the evaluation of XML retrieval (INEX) -- Focused retrieval -- Query languages -- Structural constraints -- Classification of XML query languages -- Content-only -- Content-and-structure -- Examples of XML query languages -- XPath -- NEXI -- XQuery -- XQuery full-text -- Discussion -- Indexing strategies -- Element-based indexing -- Leaf-only indexing -- Aggregation-based indexing -- Selective indexing -- Distributed indexing -- Structure indexing -- Discussion -- Ranking strategies -- Element scoring -- Contextualization -- Propagation -- Aggregation -- Merging -- Processing structural constraints -- Discussion -- Presentation strategies -- Dealing with overlaps -- Presenting elements in context -- Entry points -- Discussion -- Evaluating XML retrieval effectiveness -- Document collections -- Topics -- Relevance assessments -- Retrieval tasks -- Measures -- Discussion -- Conclusions -- XML element retrieval -- Beyond XML element retrieval -- Beyond XML retrieval -- Bibliography -- Biography. 
506 # # |a Abstract freely available; full-text restricted to subscribers or individual document purchasers. 
510 0 # |a Compendex 
510 0 # |a INSPEC 
510 0 # |a Google scholar 
510 0 # |a Google book search 
520 3 # |a Documents usually have a content and a structure. The content refers to the text of the document, whereas the structure refers to how a document is logically organized. An increasingly common way to encode the structure is through the use of a mark-up language. Nowadays, the most widely used mark-up language for representing structure is the eXtensible Mark-up Language (XML). XML can be used to provide a focused access to documents, i.e. returning XML elements, such as sections and paragraphs, instead of whole documents in response to a query. Such focused strategies are of particular benefit for information repositories containing long documents, or documents covering a wide variety of topics, where users are directed to the most relevant content within a document.The increased adoption of XML to represent a document structure requires the development of tools to effectively access documents marked-up in XML. This book provides a detailed description of query languages, indexing strategies, ranking algorithms, presentation scenarios developed to access XML documents. Major advances in XML retrieval were seen from 2002 as a result of INEX, the Initiative for Evaluation of XML Retrieval. INEX, also described in this book, provided test sets for evaluating XML retrieval effectiveness. Many of the developments and results described in this book were investigated within INEX. 
530 # # |a Also available in print. 
538 # # |a Mode of access: World Wide Web. 
538 # # |a System requirements: Adobe Acrobat reader. 
650 # 0 |a XML (Document markup language) 
690 # # |a Element 
690 # # |a Structure 
690 # # |a Structured document retrieval 
690 # # |a Focused retrieval 
690 # # |a Element retrieval 
690 # # |a XML 
690 # # |a Query languages 
690 # # |a Indexing strategies 
690 # # |a Ranking algorithms 
690 # # |a Presentation scenarios 
690 # # |a INEX 
730 0 # |a Synthesis digital library of engineering and computer science. 
830 # 0 |a Synthesis lectures on information concepts, retrieval, and services (Online),  |v # 7.  |x 1947-9468 ; 
856 4 2 |u https://ezaccess.library.uitm.edu.my/login?url=http://dx.doi.org/10.2200/S00203ED1V01Y200907ICR007  |3 Abstract with links to full text