Reading "The Unknown" with a Network Mapping Device: Graph Data Visualization for Hyperfiction Works

paper, specified "short paper"
Authorship
  1. 1. Gabriel Viehhauser

    Universität Stuttgart

  2. 2. Claus-Michael Schlesinger

    Universität Stuttgart

  3. 3. Pascal Hein

    Universität Stuttgart

  4. 4. Andre Blessing

    Universität Stuttgart

  5. 5. Mona Ulrich

    Deutsches Literaturarchiv Marbach

Work text
This plain text was ingested for the purpose of full-text search, not to preserve original formatting or readability. For the most complete copy, refer to the original conference program.


Because of their hypertextual structure, the use of topological metaphors for the analysis of hyperfiction works appears to be obvious: Reading hyperfiction seems like walking through a maze (Schmundt, 1994). However, discussion about mapping a structure that is otherwise perceived in a processual manner, namely by (non-)linear reading, has been abundant (Ciccoricco, 2004).
For our approach to
The Unknown, a hyperfiction novel by William Gillespie, Frank Marquardt, Scott Rettberg and Dirk Stratton (Gillespie et al:, 1998-2002), we use computational methods to extract network data from the site to visualize its hypertext structure. As web pages tend to change over time, our software Warc2graph focuses on WARC files, which are a standardized format for web archiving (IIPC, n.d.), containing data pertaining to the complete communication between a client and the server. Given a WARC file, Warc2graph extracts references from the archived object, encompassing any HTML tag that contains reference information, such as hyperlinks in an tag, embedded media, automatic forwarding defined in a tag etc. We use three different methods for reference extraction: a) evaluate the WARC header, b) parse the HTML workload, c) parse the Document Object Tree of the workload utilizing Selenium, a software controlled browser engine (see Ulrich et al., 2021, for a detailed description of the methodology applied by Warc2graph).

The extracted reference information can be understood as the graph structure of an archived web object. We visualize this structure as a network showing each resource as a node and the references between resources as edges. Graph data analytics provide different measures that can be used to gain insights into the object’s structure, e.g. in order to describe the functional role of certain nodes in the network. The network thus provides a structural overview of the hypertext work, which opens up new possibilities for reading hyperfiction, which, by the nature of its medium often defies the rules of linearity: Besides the sequential connection of network nodes, the graph data analysis e.g. reveals possible end points of the story line(s), which can be manifold in hypertext. It is also possible to detect central passages that can be crucial for the decisions the reader has to make, as she finds her own way through the text, since it is likely that these passages are represented by nodes that show characteristic patterns.

Figure 1:
Internal reference structure of
The Unknown, node size pertaining to degree

Figure 1 shows such a network of
The Unknown.

We provide the extracted network data together with a copy of the Warc2graph software package via Zenodo, DOI: 10.5281/zenodo.5773099
The nodes are sized according to their degree, revealing pages that appear of central importance for the site as well as clusters of interlinked documents. The network thus establishes a spatial text-view, which seems particularly apt for
The Unknown, a text that very explicitly emphasizes its status as a non-linear hypertext (Rettberg, 2019: 80-83). By sketching out the fictional narrative of a book tour of the authors through the US the novel itself becomes a road trip through the unknown territory of hypertext, an “encyclopedic novel in a network environment” that evokes “worlds that never quite attain or even aspire to a coherent ‘entirety’” (Ciccoricco, 2007: 129). Whereas these ‘worlds’ can be related to the clusters of the network, the graph metrics also seem to be instructive for the characterization of single nodes: E.g., one of the nodes with the highest degree refers to the page ‘default.html’, which contains an ironic metafictional discussion on the unsuitability of the concept of a home page for hypertext literature, which, strictly speaking, as a single starting point contradicts the principle of nonlinearity. Revealingly, the node of default.html has 129 outgoing edges, which reflects the fact that the page is ironically interspersed with links that in fact could make the site apt to serve as an explanatory starting page for the novel. However, the authors have deliberately undermined this function by ‘hiding’ the page in the jumble of HTML-documents, since only five ingoing links refer to the node. 

The analysis also reveals that an approximation of textual structures with Warc2graph has to reflect technical constraints that could affect its results: In
The Unknown, the main navigation was copied into each of the over 700 HTML-documents, automatically providing the navigation elements with a network degree of 759 and clustering these nodes in our graph. Another cluster with highly interlinked elements consists of 123 HTML-pages with watercolor drawings by Katie Gilligan, which accompany the text as sort of a diary. All pages are interlinked by a calendar on each page, leading to a similar degree for each document. Thus, technical features like site navigation could reveal structural similarities of nodes, but also have to be taken into account as potential biases for further interpretation.

If reading a hypertext is like walking through a maze, the graph might be near to a map of the maze. However, in order to construct a network, numerous parameters need to be set, which implies decisions that are not independent from reading and interpreting the text. Thus, rather than provide a definite map of the maze, our networks add additional views on structural qualities of The Unknown.

Bibliography

Ciccoricco, D. (2004). Network Vistas: Folding the Cognitive Map.
Image & Narrative,
8.
http://www.imageandnarrative.be/inarchive/issue08/daveciccoricco.htm#013
(accessed April 27, 2022).

Ciccoricco, D. (2008). Reading Network Fiction. Tuscaloosa: Univ. of Alabama Press.

Gillespie, W., Marquardt, F.,
Rettberg, S. and Stratton, D. (1998).
The Unknown
.
https://unknownhypertext.com/
(accessed April 27, 2022).

IIPC. n.d. The WARC Format.
https://iipc.github.io/warc-specifications/specifications/warc-format/warc-1.1/
(accessed April 27, 2022).

Rettberg, S. (2019). Electronic Literature. Cambridge, UK ; Medford, MA: polity.

Schmundt, H. (1994). Autor Ex Machina: Electronic Hyperfictions: Utopian Poststructuralism and the Romanticism of the Computer Age.
AAA: Arbeiten Aus Anglistik Und Amerikanistik
19 (2): 223–46.
http://www.jstor.org/stable/43023678
(accessed April 27, 2022).

Ulrich, M., Schlesinger, C.-M., Blessing, A., and Hein, P. (2021). Networks of Net Literature - Modelling, Extracting and Visualizing Link-Based Networks in the DLA Corpus of Net Literature.
https://elmcip.net/node/16380

(accessed April 27, 2022).

If this content appears in violation of your intellectual property rights, or you see errors or omissions, please reach out to Scott B. Weingart to discuss removing or amending the materials.

Conference Info

In review

ADHO - 2022
"Responding to Asian Diversity"

Tokyo, Japan

July 25, 2022 - July 29, 2022

361 works by 945 authors indexed

Held in Tokyo and remote (hybrid) on account of COVID-19

Conference website: https://dh2022.adho.org/

Contributors: Scott B. Weingart, James Cummings

Series: ADHO (16)

Organizers: ADHO