Using sgrep for querying structured text files
Jani Jaakkola, and Pekka Kilpeläinen: Using sgrep for querying structured text files. Report C-1996-83, Department of Computer Science, University of Helsinki, November 1996. 11 pages. <http://www.cs.helsinki.fi/TR/C-1996/83>
Full paper: gzip'ed Postscript file
Sgrep is a Unix tool for searching the contents of text files. Sgrep implements an algebra of unrestricted text fragments called regions. The algebra allows the retrieval of document components, represented as regions, based on conditions on their relative containment and ordering. This simple yet powerful model is suitable for querying structured document formats like electronic mail, RTF, LaTeX, HTML, or SGML documents. We describe the sgrep query language and give examples of its use. Especially, we explain how sgrep can be used for querying and assembling SGML documents.
Categories and Subject Descriptors:
General Terms: Design, Languages
Additional Key Words and Phrases: text seach tools, structured documents, SGML
Online Publications of Department of Computer Science, Anna Pienimäki