12 Apr 2007, 3:29:51
29 Sep 2006, 10:29:05
09 Mar 2006, 8:59:14
09 Mar 2006, 8:37:24
09 Mar 2006, 9:04:40
TAPoR@UVic - Research
Map of Early Modern London Project
Principal Researcher:
Dr. Janelle Jenstad, Assistant Professor, Dept. of English, University of Victoria
Home:
http://mapoflondon.uvic.ca/
Description:

"The Map of Early Modern London" is a hyperlinked atlas of sixteenth- and seventeenth-century London based on the "Agas" woodcut map of the 1560s. Over 200 sites and streets are linked to pages that provide a full historical and archaeological survey, quotations from John Stow's Survey of London, and a bibliography of literary references. Over 200 additional sites and streets are identified.

To view a more comprehensive description of the technical development, as well as projected TAPoR outcomes for this project, click More.

Funding:
SCHRC
Timeline:
Ongoing

More arrow
Technologies:
PHP, eXist, XQuery
Technical Development:

The development of the Map of Early Modern London website was broken up into three distict sections: establishing a standardized encoding format, creating a queryable repository for the encoded documents and providing an infrastructure for users to access these documents in a variety of ways.

Text for the website was marked up in TEI P5 format and then stored in an eXist XML database. These two technologies work very well in tandem to provide an extensible, standardized and robust way of storing the text for this project. PHP along with XML-RPC was chosen to provide the infrastructure for the website due to it's ubiquity on TAPoR servers and its rapid prototying/development time.

TAPoR Outcomes:

This project has created an extractable infrastructure for providing rapid access to data stored in an eXist database. It also provides concepts useful for creating a web of links between disparate articles. This allows users of the website to traverse information in a way that is natural and intuitive. The final stage of this project will be the creation of a system for handling submission and review of proposals and will necessitate the addition of an administrative utility.

Robert Graves Diary (1935-39) Electronic Edition Project
Principal Researchers:
Dr. Elizabeth Grove-White, Professor, Dept. of English, University of Victoria
Co-investigators &; Collaborators:
Chris Petter, Special Collection, University of Victoria Library
Home:
Description:
Graves' diary (1935-39) manuscript includes 1546 pages including 117 enclosures: letters, clippings, photographs post cards, notes, games, etc. The projects' objective is to produce the first scholarly edition, in print and electronic form of this unpublished diary.

To view a more comprehensive description of the technical development, as well as projected TAPoR outcomes for this project, click More.
Funding:
SCHRC
Timeline:
March 2006

More arrow
Technologies:
XQuery, eXist, Cocoon, Tomcat
Technical Development:
The development history of the Graves' Project is documented in detail at the web site URL above. The present phase is to make the transcribed Graves Diary documents searchable online. To do this, the project team has deployed the open source native XML database system eXist, with general development focused on improving the speed and efficiency of information retrieval. Additional development is focused on stabilizing and standardizing a format for the rendering and placement of enclosures (some multi-page) alongside their digital images.
TAPoR Outcomes:
The primary outcome is a programming template based on standardized formats that will automate the process of rendering and placing enclosures accompanying digital images. This tool will be developed through Open source encoding, and made available for general purpose use.
Le mariage sous l'Ancien Régime: une anthologie critique
Principal Researcher:
Claire Carlin, Professor of French, University of Victoria
Co-investigators &; Collaborators:
Martin Holmes, Programmer, HCMC, University of Victoria
Home:
http://tapor.uvic.ca/cocoon/mariage/
General Description:
At the core of the project is the annotation of images of seventeenth-century French engravings, and the subsequent analysis and publication of the annotated images, linked to larger texts on the subject. The basic problems which we address are a) the question of how to allow for TEI- and SVG-encoded data to exist concurrently in the same document; b) what techniques or tools can be developed which allow for this method to be formalized; and, c) how textual analyses can be aided by these methodologies. This anthology and critical edition contains texts and images generated by the radical changes undergone by the institution of marriage in early modern France. Included are medical, legal, religious, satirical and literary documents and engravings, all heavily annotated. Work done so far focuses on polemical texts and images.

To view a more comprehensive description of the technical development, as well as projected TAPoR outcomes for this project, click More.
Funding:
University of Victoria
Timeline:
Ongoing

More arrow
Technologies:
TAPoR generated image mark up tool
Technical Development:
Dr. Carlin and Martin Holmes developed an XML model which allows for both the text of the annotations, and the image areas to which they correspond, to be encoded in a single document. The markup of text is based on a subset ot he TEI P5 guidelines. The TEI encourages customization of its guidelines to accommodate for a wide range of implementations, an approach this project demonstrates. The markup of the images has been achieved with the Image Markup tool, an application progam. This project has been the vehicle driving the development of the Image Markup tool (see full description of the mark-up tool). The project team and researcher will continue to work together creating features and modifying its utility based on the demands of the project.

The file format is TEI P5 XML, with the image data and annotation areas defined in a block of SVG code inside the document (making a multiple-namespace document).
TAPoR Outcome:
The files produced so far have been assembled into an eXist database, which allows them to be displayed in a browser. The user can click on annotations in the text and see the relevant part of the image highlighted. We intend to to do further work on various publication and display formats based on the eXist database.
The tool, developed exclusively using open source code libraries, is an open source application. The project has been the vehicle driving the tool's development, and will continue to be the primary application for the project, though it has already been applied to other UVic TAPoR projects (such as VI history) and likely will be used in others as well.

Other significant technical developments associated with the project: 1) saving the state of the Image Markup tool on closing so that on re-opening the application the program returns to an identical state, 2) detection of installed browsers, 3) use of reflection on the application structure to support translation of the interface.
Internet Shakespeare Editions
Principal Researcher:
Michael Best, University of Victoria
Co-investigators & Collaborators:
  • Dr. Ray Siemens, Department of English, University of Victoria
  • Dr. Daniel German, Department of Computer Science, University of Victoria
Home:
http://ise.uvic.ca
Description:
A major scholarly site that publishes multimedia editions of Shakespeare's plays, contextual information about his life and times, and a database of Shakespeare in performance. The plays are tagged in advanced XML on which it is possible to perform sophisticated searches and textual analysis.

To view a more comprehensive description of the technical development, as well as projected TAPoR outcomes for this project, click More.
Funding:
SCHRC
Timeline:
Ongoing. TAPoR commitment to eXist search mechanism: Nov 2005

More arrow
Technologies:
XML technologies, Cocoon, PosgreSQL
Technical Development:
The ISE website provides an opportunity to showcase the integration of many advanced technologies, since Shakespeare's works are unrivaled in their combination of complex textuality and the multimedia experience of the performance of his plays.

It is a principle of the ISE to employ only open-source software.

At the core of the project is Cocoon, a "pipeline" that transforms XML and XHTML files into dynamic web pages. The site is at present a combination of some static pages, and a large number of pages dynamically generated from a range of web applications. In future the static pages will themselves be delivered dynamically. The design of the site will thus be fully generalized, so that it can be modified or added to without further development of software. The major web applications themselves employ advanced technology:

  1. Shakespeare in Performance is a PosgreSQL database delivering text graphics, sound, and video records of Shakespeare productions around the globe. Fully relational in structure, the database resembles the International Movie Database, with the addition of actual artifacts of performance as well as data on those taking part. The front end of the database is PHP, which delivers XML to Cocoon. Under construction is a back end that will permit error-free data entry from a web browser.
  2. The Text Viewer is an application that combines multiple XML files of the texts, their annotations, and associated collations into a single viewer interface. Since Shakespeare's texts appeared in multiple formats (Quarto, Folio) and are being re-edited (Modern), the XML files necessarily involve the use of overlapping hierarchies of information. This technical hurdle has been overcome by an ingenious adaptation of the concept of XML namespaces, such that the separate trees can be extracted elegantly from a single master file. In due course we will generate conventional TEI-conformant XML from these same master files.
  3. The Facsimile Viewer is an extension of the Text Viewer that permits multiple views, browsing, and searches of facsimiles of the plays. It is also based on a PosgreSQL database. This feature provides more power and flexibility than any other related site on the web.
  4. The Links Database provides links to over 700 external sites of interest to readers of Shakespeare. This is an XML database, and is the central repository of all links referenced in on the site.
TAPoR Outcome:
None cited to date
An Electronic Edition of the Devonshire MS
Principal Researcher:
Ray Siemens, Associate Professor, Dept. of English, University of Victoria. Canada Research Council Chair, Humanities Computing.
Home:
http://etcl.uvic.ca/dms_edition/
Description:
Research for my project, Epistolary Politics and the Poetic Miscellany: An Exploration of the Devonshire Manuscript, will lead to the first major study and first textual edition, in print and electronic form, of the Devonshire MS (British Library Additional MS 17,492). The Devonshire MS is a poetic miscellany of the 1530s and 1540s containing courtly poetry by the canonical early Renaissance figures Thomas Wyatt and Henry Howard, the Earl of Surrey; the work of, or transcriptions of the work of others, by prominent court figures Mary Shelton, Margaret Douglas, Mary Howard, Thomas Howard and Anne Boleyn; and a mix of transcribed extracts of medieval verses by Geoffrey Chaucer, Thomas Hoccleve, and Richard Roos.
Funding:
SCHRC
Timeline:
On-going

More arrow
Technologies:
XML Technologies
MaClure House: Heritage Architectural Archives Project
Principal Researcher:
Dr. Martin Segger
Co-investigators & Collaborators:
Chris Petter, Uvic Library
Home:
None
Description:
This project would begin as a prototype focusing on the architectural design process from start to finish of a Maclure house. The prototype would involve 50 - 100 digitized drawings, architectural plans and photos chronicling the various stages of design and construction of the house. Accompanying text and notes would be annotated or included as captions to the still images, but would constitute original manuscripts of the architect, and any supporting commentary by others involved in the design and construction process. Bit mapping processes would prove helpful, and there is some possibility that GIS technologies could assist with identifying locations and mapping.

To view a more comprehensive description of the technical development, as well as projected TAPoR outcomes for this project, click More.
Funding:
None
Timeline:
Prototype Spring 2006

More arrow
Technologies:
XML technologies, TAPoR image mark-up tool
Technical Description:
The prototype phase of the project involves selecting less than a dozen plan documents and digitizing those to image files. It also involves compiling a list of elements and attributes to use in the markup based on the image files and then estimates of the amount of work required to provide markup appropriate to various audiences (scholars, construction professionals, general public).
The research phase of the project will involve reviewing, selecting and cataloging the relevant archives for the project. The digitizing and markup of the drawings will be based on the procedures and schema developed in the prototype, likely via the tools or coding routines developed to facilitate this process. Simultaneously, we will be working on the front-end interfaces appropriate to the intended audiences to allow them to interact with the material. Finally, we will populate a server-based database and test the front-end interfaces.
Some funding (by way of digitizing commitment by a commercial operator) is available for the prototype phase of the project. Further funding would need to be identified for the research phase.
TAPoR outcomes:
The prototype phase of this project focussed on transferable coding schemas and tools to facilitate the encoding of the images and produced useful information to guide the subsequent phase of the project. Some of that information is specific to building plan documents, and other information is of wider applicability. This project clearly addresses the UVic TAPoR node's mandate to explore multi-media projects.
Great Unsolved Mysteries in Canadian History
Principal Researcher:
Dr. John Lutz, University of Victoria, co-director
Co-investigators & Collaborators:
Home:
http://www.canadianmysteries.ca
Description:
The "Great Unsolved Mysteries in Canadian History" project is focused on providing high-quality materials to high schools and universities for the teaching of historical methods and Canadian History. The project, based at the University of Victoria, the Université de Sherbrooke and the Ontario Institute for Studies in Education at the University of Toronto, has started a series of instructional websites based on the premise that students can be drawn into Canadian history and archival research through the enticement of solving historical cold crimes. All the material is provided free as a public service.

To view a more comprehensive description of the technical development, as well as projected TAPoR outcomes for this project, click More.
Funding:
Heritage Canada
Timeline:
Phase 3 scheduled for completion Spring 2006

More arrow
Technologies:
MySQL, PHP, XHTML
Technical Development:
The database system developed in 2004 for phase 2 of the project was designed to emulate the real-life work flow for each document that is transcribed and ultimately published on the website. The system, composed of php scripts and a MySQL database, incorporated user authentication, and a document submission function made up of steps that intuitively grouped required elements for each document (such as citation elements) and the ability to upload a soft copy of the transcribed document that ultimately passes on to subsequent steps. Document processing is made up of listings indicating documents available for certain tasks in the appropriate processing order (such as translation, or markup), the ability to work on documents with a check-out and check-in process in place, and the ability to restrict skill-specific tasks to users with the right credentials. An automated script within the system is also setup to generate the project's websites on the fly by laying out each site's directory structure and creating HTML files containing text from within the database, thus eliminating the manual steps that are required to perform this task every time there is a revision made to the sites' contents.

To make the system complete, a flexible search function allows users to sift through thousands of documents by titles, authors and keywords; a custom administration panel that allows for user administration, progress reports for each website complete with document and word count as well as work flow status statistics; and the ability to override certain aspects of the work flow in order to facilitate work among project members are integral parts put in place.

Currently, in phase 3, the database continues to expand to accommodate the new projects and the needs of the Great Unsolved Mysteries in Canadian History team, including revisions, enhanced automation of tasks and finer tuned search and retrieval capabilities.

Files must be marked up to XHTML manually and uploaded to the database, and this work continues daily at HCMC. Additionally, the project has design and image editing needs, which are also met mainly by staff working in HCMC and elsewhere.

TAPoR Outcome:
Currently, the project is using TAPoR resources, which are listed as in-kind donations in partnership with the Canadian Heritage grant funding. HCMC and TAPoR staff are providing some technical supervision and guidance. HCMC staff were key partners in the initial design, development and implementation of the database.

The database has some cross application for similar projects of this scope and purpose. The database will continue to be upgraded and expanded throughout the lifecycle of the "Mysteries" projects, now in the 3 phase of development, with 3-5 phases more to follow.
Renaissance English Knowledge base (REKn)
Principal Researcher:
Ray Siemens, Associate Professor, Dept. of English, University of Victoria. Canada Research Council Chair, Humanities Computing
Home:
None
Description:
This knowledge-base will reflect [a] primary materials (texts, images, audio) related to the Devonshire MS specifically and the Renaissance period more generally, and [b] secondary materials related to the same. In itself, this represents a unique and important contribution to the field; it will be constructed via a methodology in keeping with that noted by Lancashire ("Renaissance Knowledge Base"). The knowledge-base will be constructed initially via adapted electronic documents gathered through research collaborations and partnerships with established groups and digitizing by my research group. These materials -- text, visual, and aural -- will be given a common metadata structure and textual materials will receive a light common encoding that will include the product of semi-automated lemmatisation processes.
Funding:
Canadian Foundation for Innovation
Timeline:
Ongoing

More arrow
Technologies:
XML technologies, to be determined
Katakana Database Project
Principal Researchers:
Dr. Joe Kess, Yuko Igarashi
Home:
http://tapor.uvic.ca/cocoon/katakana/
Description:
This research involves creating a computerized database of katakana words in Japanese, in order to assess the changing role of the katakana script in modern Japanese written materials. The ultimate aim is to chart a basic shift in direction in Japanese orthographic practices in respect to how words are written. There has been little attempt to sort out these three usage practices to date, and we intend to do just that by creating a computerized database of katakana words in current media practice, using magazines, newspapers, and the written segments of television advertisements, and making them searchable and displayable within open source technologies.

To view a more comprehensive description of the technical development, as well as projected TAPoR outcomes for this project, click More.
Funding:
University of Victoria
Timeline:
Spring 2006

More arrow
Technologies:
XML technologies, Cocoon, Javascript, XHTML
Technical Development:
The schema has been developed and Oxygen is being used for the XML mark up, and an eXist database will generate statistics about the data, and present it through a website.

The mark up focuses on distinguishing Hirigana, Katakana, and Kanji characters via seg tags and place attributes. Currently work is being done on the feature structure array, and so far 6 types of documents have been developed with 10 projected. All of the original sources (newspapers, magazines) have been transcribed into Unicode, and the project focus is on the mark up of the data and the creation of the database and search engine.
TAPoR Outcomes:
The coding libraries can be applied to other projects potentially using non-western scripts in a Unicode format. The transferability lies mainly in the knowledge gained and its application to future projects involving the creation of word counts and frequency lists in Asian scripts. It serves as a prototype for potentially very large databases and the mark up of Asian script materials.

The second outcome involves the project's role as a test bed for identifying and documenting problems with the underlying eXist platform. The Katakana project has assisted us in pinpointing eXist platform development issues at a macro scale, with the end goal to create a stable eXist platform on which to build projects such as the above. Currently, TAPoR UVic is contracting an eXist developer to implement necessary changes (see eXist development for more details) for this project's (and others') future development.
Vancouver Island History
Principal Researcher:
Dr. Patrick A. Dunae, History Department, Malaspina University-College
Co-investigators & Collaborators:
  • Dr. John Lutz, History Department, University of Victoria
  • Dr. Peter Baskerville, History Department, University of Victoria
  • Dr. Eric Sager, History Department, University of Victoria
Home:
http://vihistory.uvic.ca/
Description:
This web site is principally about the history of Vancouver Island, but is also a teaching and research tool and a vehicle for exploring the larger field of Canadian history during the late 19th and early part of the 20th century. The viHistory site hosts several large, searchable databases including a census database for Victoria and Vancouver Island, c. 1871-1901. A 5% sample of the population of the province of British Columbia, derived from the CFI-funded Canadian Families Project, is also accessible here. We are now (Fall 2005) in the process of moving the viHistory database from proprietary software and a small server to a broader open-source platform and a more robust server. The expertise of faculty and staff associated with TAPoR are facilitating this process.

To view a more comprehensive description of the technical development, as well as projected TAPoR outcomes for this project, click More.
Timeline:
January, 2006

More arrow
Technologies:
PosgreSQL, PHP, MapServer, ka-Map
Technical Development:
Existing data has been ported from an ASP/MS Access database to a php/postgresql database, with added functionality. Database development focussed on creating an abstracted set of php functions that can be re-implemented in many different contexts that use php/postgresql. The most recent development demonstrates the use of a map interface to query the Postgres database. We are working on various features in that user interface to allow grouping and filtering of results. The objective is to allow the user to specify what kind of information she or he wants and to click on or select an area in the map and get back census and similar information related to the mapped location.
TAPoR Outcome:
Transferability to any project that intends to gather vector data as ancillary information and use it to filter results. Its potential use at the University of Victoria has been identified for the following projects currently under development or considered for future development: "London Map Project", "Towards Virtual Victoria & 3D History: A prototype for multidimensional historical representation", and "Creating an historical Geographic Information System [GIS] of Victoria, 1901 - 1902", and it may be suitable for the Maclure project as well.

Other outcomes include documented research into:
  • porting of proprietary databases into open source databases, including integrating multiple databases into one, and bridging steps for a more streamlined process.
  • creation of an annotation tool that allows outsiders to comment on the data and make their comments readable to the world in a wiki-like fashion.
  • phonetic spelling engine for the queries within open source programs such as PosgreSQL and PHP.
  • creation of tools for functionality for handling binary objects (still images, sound files and so forth)
  • user interfaces for the editors and public to allow them to work effectively within an open source environment