Using sgrep for querying structured text filesJani Jaakkola, and Pekka Kilpeläinen: Using sgrep for querying structured text files. Report C-1996-83, Department of Computer Science, University of Helsinki, November 1996. 11 pages. <http://www.cs.helsinki.fi/TR/C-1996/83> Full paper: gzip'ed Postscript file AbstractSgrep is a Unix tool for searching the contents of text files. Sgrep implements an algebra of unrestricted text fragments called regions. The algebra allows the retrieval of document components, represented as regions, based on conditions on their relative containment and ordering. This simple yet powerful model is suitable for querying structured document formats like electronic mail, RTF, LaTeX, HTML, or SGML documents. We describe the sgrep query language and give examples of its use. Especially, we explain how sgrep can be used for querying and assembling SGML documents. Index Terms
Categories and Subject Descriptors:
General Terms: Design, Languages Additional Key Words and Phrases: text seach tools, structured documents, SGML |
Online Publications of Department of Computer Science, Anna Pienimäki