Using Ancillary Text to Index Web-Based Multimedia Objects

Lyne Da Sylva; James Turner

Authorship

1. Lyne Da Sylva

ESBI - Université de Montréal
2. James Turner

ESBI - Université de Montréal

Work text

This plain text was ingested for the purpose of full-text search, not to preserve original formatting or readability. For the most complete copy, refer to the original conference program.

PériCulture is the name of a research project at the Université
de Montréal which is part of a larger project based at the
Université de Sherbrooke. The parent project aimed to form a
research network for managing Canadian digital cultural
content. The project was financed by Canadian Heritage and
was conducted during the fiscal year 2003-2004. PériCulture
takes its name from péritexte and culture, péritexte being one
of a number of terms used (in French, our working language)
to mean ancillary text associated with images and sound. It is
a sister project to DigiCulture, another part of the same larger
research project which studied user behaviours in interactions
with Canadian digital cultural content. The general research
objective of PériCulture was to study indexing methods for
Web-based nontextual cultural content, specifically still images,
video, and sound. Specific objectives included:
1. identifying properties of ancillary text useful for indexing;
2. comparing various combinations of these properties in terms
of performance in retrieval;
3. contributing to the development of bilingual and
multilingual searching environments;
4. developing retrieval strategies using ancillary text and
synonyms of useful terms found therein.
In computer science, research into indexing images and sound
focuses on the low-level approach, performing statistical
manipulations on primitives in order to identify semantic
content. This approach is also referred to as the 'content-based
approach' (e.g. Gupta and Jain, Lew). In information science,
research into indexing images and sound focuses on associating
textual information with the nontextual elements, and this often
involves manipulating ancillary text. This approach is referred
to as the 'high-level' or 'concept-based approach' (e.g.
Rasmussen, O'Connor, O'Connor, and Abbas). A number of
factors militate in favour of automating the high-level approach
as much as possible. These include the very large volume of
Web-based materials available, the disparity among cataloguing
and indexing methods from one collection to another, and the
high cost and relative inconsistency of human indexing. Our work in this project focuses on text associated with
Web-based still images, and builds on previous work in this
area of information science (e.g. Goodrum and Spink,
Jörgensen, Jörgensen et al., Turner and Hudon). We identified
a number of Web sites that met our criteria, i.e., that contained
multimedia objects, that had text associated with these objects
that was broader than file names and captions, that were
bilingual (English and French), and that housed Canadian digital
cultural content. We identified keywords that were useful in
indexing and studied their proximity to the object described.
We looked at indexing information contained in the Meta and
Alt tags, and whether other tags contained useful indexing
terms. We studied whether standards such as the Dublin Core
were used. We identified Web-based resources for gathering
synonyms for the keywords.
Our study found that a large number of useful indexing terms
are available in the ancillary text of many Web sites with
cultural content. We evaluated various types of ancillary text
as to their usefulness in retrieval. Our results suggest that these
terms can be manipulated in a number of ways in automated
retrieval systems to improve search results. Cross-language
comparison of the results reinforces our previous research
results, which suggest that indexing in other languages can be
generated automatically from a single language using
Web-based tools.
Rich information that can be used for retrieval is available in
many places on Web sites with cultural content, from the file
name to explicit information in captions to descriptive
information in surrounding text to the contents of various
HTML tags. Algorithms need to be developed to exploit this
information in order to improve retrieval.
Finally, we feel that our work is useful because of the synergy
created by the approaches we use. We are both interested in
image indexing, but come from different fields. Lyne Da Sylva's
expertise is in linguistics and James Turner's in information
science. By working together, we are able to pool our
knowledge and develop richer methods than would otherwise
be available to either of us for approaching the question of
automating indexing for images and other multimedia objects.
Bibliography
Goodrum, A., and A. Spink. "Image searching on the Excite
web search engine." Information Processing and Management
27.2 (2001): 295-312.
Gupta, A., and Ramesh C. Jain. " Visual information retrieval."
Communications of the ACM 40.5 (71-79): 71-79.
Jörgensen, Corinne. Image attributes: an investigation. PhD
thesis, Syracuse University, 1995.
Jörgensen, Corinne. "Image attributes in describing tasks: an
investigation." Information Processing and Management
34.2/3 (1998): 161-174.
Jörgensen, Corinne, Alejandro Jaimes, Ana B. Benitez, and
Shih-Fu Chang. "A conceptual framework and empirical
research for classifying visual descriptors." Journal of the
American Society for Information Science and Technology
(JASIST) 52.11 (2001): 938-947.
Lew, Michael S. Principles of visual information retrieval.
New York: Springer, 2001.
O'Connor, Brian C., Mary K. O'Connor, and June M. Abbas.
"User reactions as access mechanism: an exploration based
upon captions for images." Journal of the American Society
for Information Science 50.8 (1999): 681-697.
Rasmussen, Edie M. "Indexing images." Annual Review of
Information Science and Technology 32 (2004): 169-196.
Turner, James M., and Michèle Hudon. "Multilingual metadata
for moving image databases: preliminary results."
L'avancement du savoir : élargir les horizons des sciences de
l'information, Travaux du 30e congrès annuel de l'Association
canadienne des scicnces de l'information. Ed. Lynne C.
Howarth, Christopher Cronin and Anna T. Slawek. Toronto,
2002. 34-45.

Full text license: This text is republished here with permission from the original rights holder.

If this content appears in violation of your intellectual property rights, or you see errors or omissions, please reach out to Scott B. Weingart to discuss removing or amending the materials.

Conference Info

In review

ACH/ALLC / ACH/ICCH / ALLC/EADH - 2005

Hosted at University of Victoria

Victoria, British Columbia, Canada

June 15, 2005 - June 18, 2005

139 works by 236 authors indexed

Affiliations need to be double checked.

Conference website: http://web.archive.org/web/20071215042001/http://web.uvic.ca/hrd/achallc2005/

Series: ACH/ICCH (25), ALLC/EADH (32), ACH/ALLC (17)

Organizers: ACH, ALLC

Using Ancillary Text to Index Web-Based Multimedia Objects

1. Lyne Da Sylva

2. James Turner

ACH/ALLC / ACH/ICCH / ALLC/EADH - 2005