The 5th ForlĂ TEI Workshop, 7-9 June 2004
Course objectives
- An understanding of the purpose and utility of text
encoding and markup for language corpora
- An overview of the TEI Guidelines and their
recommendations for language corpora
- Practical experience in:
- using XML editing software to
- create new encoded texts
- standardize existing digital texts
- Using XML retrieval tools to
- build a searchable corpus
- analyse the corpus
Course Prerequisities
The course uses a single page from Punch
magazine as an example document. This, and all
other sample and exercise files, are collected in a single zip archive which you need to download and
copy to your hard disk. Participants are also expected to provide
samples of their own materials.
As background reading, we recommend
The Programme
- Monday
-
- 1100: Digital texts with XML and the
TEI Introductory talk on texts, textuality,
digitization and what they mean to you.
- Text Analysis exercise
- Brief technical introduction to XML concepts and a few words
about the TEI
- Exercise 1 from the Worksheet for Emacs Exercises
(Practical introducing features of XML, tagging the poem on the
Punch page)
- 1300: Lunch!
- 1430: Talk on TEI contents
introduces some more basic TEI elements
- Exercise 2 from the Worksheet for Emacs Exercises
(Practical using real TEI schema for speech and drama
- 1600: coffee!
- 1630: Practical session continues. By the end of the day, you should
aim to have at least two texts in valid TEI XML.
- 1830: close
- Tuesday
-
- 0900: Metadata matters (This
lecture discusses metadata, in particular how to use the TEI
Header to provide contextual information for a TEI corpus)
- Exercise 3 from the Worksheet for Emacs Exercises:
building a TEI header for the Punch materials
- 1030: Coffee!
- 1100: Taming the TEI Tiger (An
overview of the TEI and its customization using Roma)
- Roma pizza chef
exercise building simple schema for web page with graphic and
sound
- 1300: Lunch
- 1430: Making silk purses from
sows ears (This lecture discusses some issues in
converting from other formats into XML)
- Practical Session: make a corpus
header for your own material. Decide on your DTD. Start
converting your text and validating it
- 1800: Discussion and review
- Wednesday
-
- 0930: Introducing Xaira Lecture on the design and features of
the Xaira system
- Indexing a corpus with Xaira: exercise 1 from the Xaira Tutorial
- 1100: Coffee!
- Indexing your own corpus with Xaira: exercise 2 from the Xaira Tutorial
- 1300: Lunch!
- 1430: Exploring your own corpus and others with Xaira
- 1700: Discussion and review
- 1930: Dinner (Agroturismo Montecini)
The Participants
- Claudio Bendazzoli
cbendazzoli@tiscali.it
- Cinzia Bevitori c.bevitori@cliro02.cliro.unibo.it
- Olivier Bremer olivier84@cheapnet.it
- Lou Burnard
lou.burnard@oucs.ox.ac.uk
- Mariella D'Elia mariellad@sslmit.unibo.it
- Sabrina Fusari
sabrinafusari@racine.ra.it
- Mery Martinelli mery.martinelli@libero.it
- Cristina Monti cristinamonti@libero.it
- Sara Piccioni
spiccioni@sslmit.unibo.it
- Maria Chiara Russo
- Davide Smiraglio davidesmiraglio@libero.it
- Cinzia Valenti cinziavalenti74@yahoo.co.uk
|