A deep learning approach to Cadastral Computing

paper, specified "long paper"
Authorship
  1. 1. Sofia Ares Oliveira

    École Polytechnique Fédérale de Lausanne (EPFL)

  2. 2. Isabella di Lenardo

    École Polytechnique Fédérale de Lausanne (EPFL)

  3. 3. Bastien Tourenc

    École Polytechnique Fédérale de Lausanne (EPFL)

  4. 4. Frederic Kaplan

    École Polytechnique Fédérale de Lausanne (EPFL)

Work text
This plain text was ingested for the purpose of full-text search, not to preserve original formatting or readability. For the most complete copy, refer to the original conference program.


Introduction
Among all the diverse typologies of administrative systems, the fiscal-cadastral sources retracing the ownership of lands are undoubtedly the richest and, in a sense, the most coherent records. The need for efficient tax collection processes pushed different administrations and political regimes in the 18th century to develop scale-invariant coherent property tracing system. These systems were progressively generalised at the turn of the 19th century with the abolition of privileges for certain classes of citizens and the establishment of republican administrations for managing cities and countries (Kain and Baigent 1993). The cadastre, by introducing a constant collection of taxes calculated on a fixed percentage, defines a new conception of the city and of the urban space. It marks the transition from the Ancient Regimes to the Modern State. The introduction of a constant, proportional and impartial tax determined a new conception of the city, adapted to statistical computations. In some sense, for the first time, the cadastre transforms the city into a computational object. We use the neologism “cadastral computing” to refer to the operations performed on such primary sources. This article presents a generic approach for processing automatically the information contained in the Napoleonic cadastres. The cadastres established during the first years of the 19th century cover a large part of Europe. For many cities they give one of the first geometrical surveys, linking precise parcels with identification numbers. These identification numbers point to register lines with the names of the parcel’s owners (Fig. 1). As the Napoleonic cadastres include millions of parcels, it therefore offers a detailed snapshot of large part of Europe’s population at the beginning of the 19th century (Clergeot 2007).

Figure 1. Identification numbers in parcels and in the register

To develop a generic approach adapted to the processing of these administrative documents, one needs to solve two difficult challenges: (1) developing algorithms capable of robustly segmenting maps into parcels and administrative tables into cells (2) developing solutions for transcribing handwritten text containing people or places mentions and identification numbers. Until recently, these two problems were considered much beyond the state-of-the-art. The results of this article are based on the important progress recently made on both issues, using deep learning architectures. In 2017, an initial study on cadastre extraction showed promising results in parcel extraction and identifier recognition. However, it was entirely designed as an ad hoc pipeline fine-tuned for a particular cadastre (Ares Oliveira, Lenardo, and Kaplan 2017). A year later, a generic deep-learning segmentation engine, relying on a convolutional neural network (Ares Oliveira, Seguin, and Kaplan 2018) demonstrated that it was possible to design a generic architecture to segment many typologies of documents. In this article we use the genericity of this approach to develop the first fully automatic pipeline to transform the Napoleonic Cadastres into an information system.

Method
The automatic processing of the cadastre maps aims at extracting the parcels as geometrical shapes and also at transcribing the parcel’s identification numbers. It is composed of three main steps:
1. Training of the deep neural network on manually annotated data
2. Segmentation of the maps into meaningful regions and objects
3. Transcription of the identification numbers
Our segmentation network is a fully convolutional neural network inspired by the U-Net architecture (Ronneberger, Fischer, and Brox 2015). It uses a ResNet-50 pretrained network (He et al. 2016) as encoder, which speeds up the training, reduces the amount of training data needed and helps generalization. The full details of the architecture and open-source implementation can be found in (Ares Oliveira, Seguin, and Kaplan 2018). The network is trained to extract the parcel contours and text using annotated data from the Venetian cadastre (Fig. 2). The training data corresponds to roughly 1/3 of one map sheet among the 26 maps of the city of Venice (roughly 800 parcels).

Figure 2. Sample of training data for cadastre maps segmentation. Parcels contour are in red, text is in green.

The transcription network is a convolutional recurrent neural network (CRNN) (Shi, Bai, and Yao 2017) which produces a chain of characters when given an image segment containing text. The network is trained on samples of numbers from the Venetian archives and on numbers synthetically generated with MNIST digits (LeCun, Cortes, and Burges 1998) (Fig. 3).

Figure 3. Example of training data for the transcription system. Left: synthetically generated numbers; right: numbers from the Venetian archives.

The segmentation model obtained after the training is able to predict the parcel contours and text region at pixel level (Fig. 4). Watershed by flooding algorithm (Beucher 1979) is applied on parcel contours predictions, which allows the extraction of parcel objects as polygonal shapes. Text regions are cropped, horizontally aligned and converted into grayscale image segments for further processing by the transcription system.

Figure 4. Output of the segmentation network (overlay in purple). Left: text extraction; right: contour extraction.

The image segments containing text are fed to the transcription network, which outputs a prediction of a number, i.e. a chain of digits. Each transcription is then linked to its corresponding parcel. The contours of the parcels (whether they contain an identifier number or not) are saved as polygonal shapes and are exported into JSON format. In our case, since the images have previously been georeferenced, the coordinates are exported as geographical coordinates and can therefore directly be imported in any geographic information system.

Results
Two evaluations are performed in order to assess the performances of the system: the geographical accuracy of the extracted parcels and the transcription of the identification numbers.

Figure 5. Visualization of the results of the automatic extraction of parcels. The red rectangle indicates the parcels used as training data for the segmentation network.

Geometrical evaluation
The number of geometrical shapes extracted and manually annotated are listed in Tab. 1. After a filtering step, which keeps only shapes which area range from 2 m
2 to 15000 m
2, the total number of extracted parcels is 28711, among which 18138 contain a transcription.

Table 1. Total number of geometries in the automatic extraction and manual annotation. The first three rows relate to automatically extracted parcels, the two last rows show the statistics for manually annotated parcels.

Geometries
Number

Geometries extracted automatically
31 342

Geometries remaining after filtering
28 711

---- with ID number
18 138

Manually annotated geometries
16 946

---- with ID number
15 634

The quality of the parcel’s extraction is evaluated by measuring the intersection over union (IoU) between the geometries produced automatically and almost 17000 manually annotated shapes. Precision and recall with three different IoU thresholds 0.5 (acceptable), 0.7 (good), 0.9 (excellent) are reported in Tab. 2. The recall value shows that a large majority of parcels are extracted (85% in the most strict case). The low precision value is mainly due to the incorrect extraction of streets, squares, canals, etc. that are currently not filtered out (example in Fig. 6).

Figure 6. Example of false extraction of streets and canals (in blue)

Table 2. Evaluation of the geometrical shape extraction with different Intersection over Union (IoU) thresholds.

IoU
Correct parcels
Precision
Recall

t=0.5
15 999
0.557
0.944

t=0.7
15 292
0.533
0.902

t=0.8
14 440
0.503
0.852

Transcription evaluation
We assess the performance of the transcription of parcel’s identifier numbers by computing the number of correct predictions and report the precision and recall values in Tab. 3. The current method assumes that the identifiers are located within the parcel, thus, identifiers partially or completely outside the geometrical shape are not correctly transcribed (Fig. 7), resulting in a lower recall.

Figure 7. Examples of identifiers numbers outside or partially outside the parcel.

In order to increase the precision value and since we can assume that spatially close parcels will have numerically close identifiers, we tried to discard false predictions by determining if a transcription was ‘plausible‘ or not, using information from its spatial neighbourhood. Thus, a transcription is considered as an outlier if the (numerical) difference between the predicted number and the median of its 5 neighbouring transcriptions is greater than 10. This results in a significant increase in precision (up to 93%), but at the expense of a decrease in recall.

Table 3. Evaluation of the transcriptions of parcel’s identifiers numbers

Correct transcriptions
Precision
Recall

Transcriptions
11101
0.612
0.710

Transcriptions after outlier detection
8070
0.927
0.516

Perspectives
This initial study demonstrates on a particular case that 90% of the urban geometrical structure and more than 50% of a city population can be automatically remapped with high precision using only generic pipelines. Even if these numbers need to be confirmed on the basis of other case studies, the genericity of the methods used makes us optimistic about the possibility of conducting a large-scale study in the coming years. Such datasets call for a confrontation with the large number of historical hypotheses that have been formulated on the urban society at the beginning of the 19th century based on much smaller sets of evidence. Thanks to the standardisation processes of the Napoleonic administration, we hope in the coming months to extend this systematic processing beyond Venice to a large part of Europe.

Bibliography

Ares Oliveira, Sofia, Isabella di Lenardo, and Frederic Kaplan. (2017). Machine Vision Algorithms on Cadaster Plans. In
Premiere Annual Conference of the International Alliance of Digital Humanities Organizations (Dh 2017).

Ares Oliveira, Sofia, Benoit Seguin, and Frederic Kaplan. (2018). DhSegment: A Generic Deep-Learning Approach for Document Segmentation. In
Proceedings of 16th International Conference on Frontiers in Handwriting Recognition (Icfhr-2018).

Beucher, S. (1979). Use of Watersheds in Contour Detection.
Proceedings of the International Workshop on Image Processing. CCETT.
https://ci.nii.ac.jp/naid/10008961959/en/.

Clergeot, Pierre (coord.). (2007).
Cent Millions de Parcelles En France : 1807, Un Cadastre Pour L’Empire. Éditions Publi-Topex.

He, Kaiming, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. (2016). Deep Residual Learning for Image Recognition. In
Proceedings of the Ieee Conference on Computer Vision and Pattern Recognition, 770–78.

Kain, R. J. P., and Elizabeth Baigent. (1993).
Cadastral Map in the Service of the State: History of Property Mapping. Univ. Chicago P.

LeCun, Yann, Corinna Cortes, and Christopher JC Burges. (1998). The Mnist Database of Handwritten Digits.

Ronneberger, Olaf, Philipp Fischer, and Thomas Brox. (2015). U-Net: Convolutional Networks for Biomedical Image Segmentation. In
International Conference on Medical Image Computing and Computer-Assisted Intervention, 234–41. Springer.

Shi, Baoguang, Xiang Bai, and Cong Yao. (2017). An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition.
IEEE Transactions on Pattern Analysis and Machine Intelligence 39 (11). IEEE: 2298–2304.

If this content appears in violation of your intellectual property rights, or you see errors or omissions, please reach out to Scott B. Weingart to discuss removing or amending the materials.

Conference Info

In review

ADHO - 2019
"Complexities"

Hosted at Utrecht University

Utrecht, Netherlands

July 9, 2019 - July 12, 2019

436 works by 1162 authors indexed

Series: ADHO (14)

Organizers: ADHO