Article of the Month - December 2024

Improving Cadastral Accuracy for Disaster Management: The Role of Segment Anything Model (SAM) in Digitizing Historical Cadastral Maps

Sanjeevan Shrestha, Tina Baidar and Shangharsha Thapa, Nepal


Sanjeevan SHRESTHA	Tina BAIDAR	Shangharsha THAPA

This article in .pdf-format (12 pages)

This paper was presented at the FIG Regional Conference 2024 in Kathmandu, Nepal, 14-16 November 2024

SUMMARY

Up-to-date cadastral maps with detailed land ownership, boundaries, and values, are crucial in disaster-prone regions like Nepal, where accurate land data significantly impact disaster risk management for efficient resource allocation, response planning, and so on. Given the challenges associated with updating cadastral mapping, there is a pressing need to digitize existing maps to establish an up-to-date cadastral database. The digitization of old cadastral maps faces challenges like inconsistent skill levels, human errors, and data quality issues, making the process time-consuming and prone to inaccuracies. Hence, automating the process is essential to create an accurate and up-to-date cadastral database.

This study explores the application of the Segment Anything Model (SAM) for automating the digitization of historical cadastral maps, specifically focussing on land parcel boundary extraction, specifically in the context of Nepal. Using a diverse dataset of scanned cadastral maps, the study evaluates SAM’s zero-shot segmentation performance under different prompting conditions, including bounding box, multi-point prompts, and their combinations. Key factors such as parcel size, shape, eccentricity, clarity of boundaries, and noise levels of the cadastral map were analyzed. SAM demonstrated promising results, particularly when employing combined prompts, but challenges arose in handling noisy data near parcel boundaries and complex configurations within the parcel. Moreover, false positives between segmented parcels continue to be significant challenges, and increasing the scanning resolution also did not noticeably improve segmentation accuracy.

The study concludes that SAM provides promising solutions for enhancing cadastral digitization in Nepal. The challenges faced highlight the need for integrating Geographic Information Systems (GIS) with SAM, along with human oversight, to ensure the creation of accurate and complete cadastral databases. Future research should focus on fine-tuning SAM for one-shot learning or using SAM-2 model and integrating it with diverse remote sensing data to further improve segmentation accuracy and resilience in land administration systems.

1. INTRODUCTION

Nepal is highly vulnerable to various natural disasters, including earthquakes, floods, landslides, and more recently, glacial lake outburst floods (GLOFs). These disasters not only destroy lives and properties but also adversely affect land administration by erasing physical land boundaries and destroying land records (Lukman Syahid, 2011). In the event of a natural disaster, land tenure will only remain secure if adequate land administration records exist or if landowners possess legal documentation proving their rights to the land (Mitchell, 2009). The adverse effects of disasters can be minimized by linking efficient land administration with disaster risk management. Cadastral maps are foundational to land administration systems as they provide detailed records of land parcels, ownership, boundaries, and legal rights. These maps are essential for managing land-related activities, including land registration, property taxation, and land use planning. Up-to-date cadastral information is essential for disaster risk management as it facilitates efficient resource allocation, improves response planning, ensures accurate damage assessment, and provides legal and administrative clarity (Lukman Syahid, 2011). It also enables informed decision-making and improves environmental and risk management strategies.

In the context of cadastral records of Nepal, the initial cadastral survey, completed in 1995 A.D., provided analog cadastral maps for all of Nepal but excluded densely populated areas such as village blocks and public lands (Sapkota, 2012). As demands grew for accurate and easily accessible land records, the Department of Land Information and Archive (DoLIA) was established in 2000 A.D. to implement a Land Information System (LIS) aimed at efficient land management. DoLIA began archiving cadastral records and developing software systems for acquiring spatial data from hard copies of cadastral sheets through digitization and their attribute data as well (Sapkota, 2012). Despite these advancements, significant challenges persist in the scanning and digitization of old maps, including susceptibility to human errors, variability in interpretation and digitization skills among personnel, and inconsistencies in data quality. The digitization process remains time-consuming and error-prone due to differing skill levels and interpretations among individuals working on different map sections, resulting in edge problems and data inconsistencies. Additionally, not all personnel are proficient in digital technology, further complicating the digitization efforts. Given the challenges associated with digitizing cadastral records, there is a pressing need to automate the process to establish an up-to-date cadastral database.

Several studies have been conducted since a long time on developing automatic map interpretation systems and methods for the automatic extraction of cadastral records, aiming to streamline and improve the efficiency of cadastral map digitization and analysis. One of the earliest studies used a baseline automatic cadastral map interpretation method that employed processes including noise removal and skeletonization of scanned maps, vectorization, parcel detection, and interpretation (Janssen, Duin, & Vossepoel, 2002). Among the recent studies, a study used a segmentation method that combined four steps of image processing algorithms to extract land regions automatically from historical cadastral maps and demonstrated that the method extracted land boundaries with an average error of 0.62% with a standard deviation of ± 0.61% (Kim, Lee, Lee, & Seo, 2014). The results imply that while the average error is low, there are some fluctuations in accuracy across different maps. The study also acknowledges limitations in the approach, particularly when dealing with maps that lack clear delineations or contain ambiguities. Another study overviewed the use of deep learning techniques including convolutional neural networks (CNN) and semantic segmentation methods, to automate the digitization of historical cadastral maps (Ignjatić, Nikolić, Rikalović, & Ćulibrk, 2018). The study addressed the limitations of the deep learning algorithms that they require large, high-quality training datasets and the models’ struggle with generalizing across different map types. Moreover, accuracy concerns persist, particularly with faded or complex map features, necessitating ongoing human oversight to correct errors. Furthermore, another study assessed the application of Object-Based Image Analysis (OBIA) procedures for the semi-automatic digitalization of heritage maps including historical cadaster maps which demonstrated OBIA techniuqe is viable approach to digitalization over classical pixel based classification methods (Gobbi et al., 2019). The limitations observed in previous studies are either lengthy procedures involved in the case of pixel-based and object-based classifications and complex (combination of) image processing algorithms or the unavailability of large training datasets for deep neural networks such as CNN to perform efficiently.

Segment Anything Model (SAM), recently released by Meta AI Research, is a foundational model in the field of artificial intelligence. SAM has been trained on a massive dataset, consisting of 11 million images and 1.1 billion masks, and it demonstrates impressive zero-shot performance across a wide range of segmentation tasks (Kirillov et al., 2023). Foundation models like SAM, which have made significant strides in both natural language processing (NLP) and more recently in computer vision, are capable of zero-shot learning. This means they can adapt to new datasets and perform unfamiliar tasks using 'prompting' techniques, even with little or no prior training. This capability has the potential to reduce human efforts during the digitization and annotation process and presents an opportunity to alleviate time-intensive tasks. A recent study demonstrated promising adaptability to segmentation of various remote sensing data (satellite, airborne, and UAV) and its analysis and recommended further research models to improve the model’s performance by integrating it with additional fine-tuning techniques and other network architectures (Osco et al., 2023).

This study explores the potential of the Segment Anything Model (SAM) for the automatic digitalization of historical cadastral maps, with a specific focus on land parcel boundary extraction. The primary objective is to assess the feasibility and effectiveness of SAM in automating the segmentation of land parcels from scanned cadastral maps into GIS databases. The model's robustness and adaptability were evaluated under varying scenarios and complexities of cadastral parcels in the context of Nepal. A Zero-shot segmentation technique, based on SAM, was employed throughout the study to examine its performance across diverse conditions.

2. MATERIALS AND METHODS

The study has investigated SAM’s segmentation capacity with different scanned cadastral maps under different prompting conditions. Figure 1 shows the schematic representation of the overall workflow of the study.

Figure 1: Schematic representation of a step-wise process for evaluating the efficacy of SAM

2.1 Cadastral Data Synthesis

The dataset for this study comprises a diverse array of scanned cadastral images, providing a broad foundation for evaluating the Segment Anything Model (SAM) in terms of robustness and adaptability across a wide range of conditions (Table 1). The georeferenced analog cadastral maps were systematically categorized into five key attributes: size, shape, visual clarity, noise condition, and scanning resolution, allowing for a detailed exploration of SAM’s capabilities. Regarding size, the dataset included images of uniform, large, and small dimensions, facilitating the assessment of SAM’s performance across varying parcel sizes. For shape, the dataset covered both regular parcels and those with significant eccentricity, enabling the model’s adaptability to irregular geometries to be tested. Visual clarity was addressed by comparing parcels with clear and blurred boundaries, which provided insight into SAM’s ability to handle imperfect or degraded imagery. Noise conditions were evaluated by including both noisy and noise-free images, simulating issues like scanning defects. All images were initially scanned at a resolution of 300 DPI to standardize the evaluation. To further explore the effect of resolution, the same cadastral maps were scanned at 300, 400, 500, 600, and 800 DPI, allowing an additional layer of analysis on SAM's performance in response to varying image quality and detail. This multi-dimensional dataset serves as a rigorous test bed for assessing SAM’s versatility and effectiveness in automating land parcel boundary segmentation under diverse conditions.

Table 1: Diverse cadastral datasets and prompting conditions

S.N	Condition	Scenario	Scanning Resolution	Target	Box	Point	Combination
1	Size	Equal size	300	Parcel	Yes	Yes	Yes
2		Big size	300	Parcel	Yes	Yes	Yes
3		Small size	300	Parcel	Yes	Yes	Yes
4	Shape	Regular	300	Parcel	Yes	Yes	Yes
5	Shape	Large eccentricity	300	Parcel	Yes	Yes	Yes
6	Visual clarity	Clear	300	Parcel	Yes	Yes	Yes
7	Visual clarity	Blur	300	Parcel	Yes	Yes	Yes
8	Noise Condition	Noise-free	300	Parcel	Yes	Yes	Yes
9	Noise Condition	Noisy	300	Parcel	Yes	Yes	Yes
10	Scanning resolution		300	Parcel	No	No	Yes
11			400	Parcel	No	No	Yes
12			500	Parcel	No	No	Yes
13			600	Parcel	No	No	Yes
14			800	Parcel	No	No	Yes

2.2 Prompt Configuration

The study particularly investigated SAM’s segmentation capacity in the context of automatic extraction of parcels from scanned analog cadastral maps under different prompting conditions, focusing on zero-shot segmentation. Multi-point and bounding box prompts were provided as a baseline. Bounding boxes (rectangular areas) highlight specific areas within the image restricting SAM’s segmentation per object (in our case each parcel) for the sake of segmentation. Moreover, multi-point prompts are a series of specific foreground and background points within the image to guide SAM’s processing. We also experimented with combining point-based and bounding box prompts in the segmentation process. This combined approach was intended to harness the strengths of both methods and enhance SAM's adaptability for automated cadastral segmentation.

2.3 Zero-Shot

This section outlines the process of adapting the SAM for automatic cadastral segmentation. The QGIS plugin "GeoSAM" was used to perform zero-shot segmentation(Zhao, Fan, & Liu, 2023). Initially, image features were extracted and saved using SAM’s image encoder through the plugin's encoding module. SAM offers various models, including ViT-H, ViT-L, and ViT-B, each with different computational requirements and architectural complexities (Kirillov et al., 2023). For this study, we employed the ViT-L model, which offers a balance between high accuracy and manageable computational demand. Using the saved image features and prompt encoder i.e. bounding boxes, multi-point, and combined approach, valid masks representing individual land parcels were generated and subsequently converted into polygon shapefiles.

2.4 Model Evaluation

The performance of adopted zero-shot models was evaluated by simply inspecting the visual quality of the segmentation. The segmentation result from each prompt action on each scanned image scenario was inspected visually and inference was made. This is because individual scenarios for cadastral parcels that are considered for the analysis are not present ideally within a scanned parcel image. There is the presence of a combination of multiple scenarios within an image scene, making it difficult to evaluate using quantitative metrics.

3. RESULTS AND ANALYSIS

This section explains the results obtained from various prompt configurations used for diverse cadastral datasets and analyzes the results through visual inspection and comparison of the outputs. For this, representative areas were selected for analysis, focusing on the unique characteristics of parcels in the context of Nepal.

The variation in parcel size and shape in Nepal is primarily due to the differing map scales and the geographic diversity of land parcels. Figure 2(i) illustrates the results of parcel extraction using SAM’s selected base prompts (multi-point and bounding box) and their combinations for areas with varying parcel densities. The figure demonstrates that for equally sized parcels, all base prompts performed comparably well, producing high accuracy, except at parcel boundaries where false negatives were observed. However, for areas with dense parcel configurations, a noticeable decline in accuracy was evident compared to the performance on equally sized parcels. In such dense areas, false negatives were observed not only at the parcel boundaries but also within the parcel interior when using base prompts. This underestimation was mitigated to some extent by employing a combination of the base prompts. The underestimation of dense parcels can be attributed to the complexity of closely packed parcels, which increases the challenge of accurately delineating boundaries. In these scenarios, the proximity of adjacent parcels may cause the model to struggle with distinguishing between them, leading to boundary confusion and misclassification. Additionally, the limited resolution of the base prompts in high-density areas may contribute to the difficulty in accurately capturing finer details within tightly clustered parcels.

Figure 2: Visualization of prediction of three variations of prompts of zero-shot segmentation of SAM on cadastral parcel extraction task from historical scanned cadastral images (i) based on parcel density (a) equally sized; (b) dense and variety of pixel; and (ii) based on combination of parcel size and its eccentricity. Green pixels are True Positive; red pixels are False Negative, and yellow pixels are False Positive.

Additionally, the capability of zero-shot segmentation was evaluated for all sizes of parcels, as presented in Figure 2(ii). Across all parcel sizes, the mixed prompt approach outperformed individual base prompts. In Figure 2-ii(a), both large and small parcels were accurately extracted using all prompt types, particularly when the parcel shape closely matched well-defined geometric forms, with an eccentricity value near one. However, there was an observed underestimation in delineating larger parcels with high eccentricity (i.e., where the length is significantly greater than the width). Figure 2-ii(b) further demonstrates that as parcel eccentricity increases, the performance of zero-shot segmentation declines, irrespective of parcel size. This finding suggests that the segmentation accuracy for parcels of different sizes is strongly correlated with their eccentricity. The increase in eccentricity introduces greater heterogeneity within the parcel shape, which poses a challenge to SAM’s segmentation capability.

Figure 3: Visualization of prediction of three variations of prompts of zero-shot segmentation of SAM on cadastral parcel extraction task from historical scanned cadastral images based on different visibility of parcel boundary. Green pixels are True Positive; red pixels are False Negative, and yellow pixels are False Positive.

Figure 4: Visualization of prediction of three variations of prompts of zero-shot segmentation of SAM on cadastral parcel extraction task from historical scanned cadastral images based on different noise levels. Green pixels are True Positive; red pixels are False Negative, and yellow pixels are False Positive.

In the case of scanned images with a lack of clarity in the parcel boundaries primarily due to suboptimal scanning processes and the use of pencil marks during parcel subdivision, the results obtained are shown in Figure 3. The clarity of parcel boundaries is a critical factor in accurate delineation. To assess the impact of boundary clarity, the zero-shot segmentation capability was tested across different levels of line clarity in the scanned images. Notably, all prompt types produced promising results in delineating parcels, even under varying degrees of boundary clarity or ambiguity.

Some representative scanned cadastral map images exhibited significant noise, both within and adjacent to parcel boundaries. Noise within the boundary, excluding parcel numbers, did not negatively impact the performance of SAM's zero-shot segmentation, as illustrated in Figure 4. In these cases, the model was able to delineate parcels accurately despite the internal noise. However, noise located adjacent to parcel boundaries significantly reduced the accuracy of segmentation, as shown by the red box in Figure 4. This adjacent noise interfered with the model's ability to precisely delineate parcel boundaries. Additionally, when faint boundary lines were accompanied by adjacent noise, the segmentation was further compromised. In such cases, illustrated by the yellow box in Figure 4, the model either failed to properly delineate the parcels or mistakenly merged two adjacent parcels into one. This highlights the negative impact of adjacent noise on segmentation accuracy and the importance of clear boundary delineation in scanned images.

Figure 5: Visualization of prediction of zero-shot segmentation of SAM on cadastral parcel extraction task from historical scanned cadastral images scanned at different scanning resolution levels. Green pixels are True Positive; red pixels are False Negative, and yellow pixels are False Positive.

A common issue across all the experiments was the occurrence of false negatives between parcels. This can be attributed to the fact that the cadastral maps were scanned at a relatively low resolution (300 DPI), where parcel boundary lines occupied only a few pixels, leading to segmentation errors. To address this, the same cadastral map was scanned at higher resolutions to assess the impact on mitigating boundary underestimation. Surprisingly, increasing the scanning resolution did not significantly reduce the occurrence of false negatives between parcels, as shown in Figure 5. In fact, the delineation capability of SAM further decreased with higher scanning resolutions. This reduction in performance can be attributed to the increased heterogeneity introduced by higher resolutions, which likely introduced more noise and finer details, complicating SAM's ability to accurately segment parcel boundaries.

In summary, our findings indicate that the combination of base prompts consistently outperforms individual base prompts in the zero-shot learning approach across all datasets. However, SAM's zero-shot approach faces challenges when handling noisy data near boundaries and areas with complex parcel configurations. Additionally, the occurrence of false positives between segmented parcels remains a persistent issue. These challenges highlight the need for integrating Geographic Information Systems (GIS) with SAM, along with human oversight, to ensure the creation of accurate and complete cadastral databases.

4. Conclusion

In this study, we conducted a comprehensive analysis of the zero-shot segmentation capabilities of the Segment Anything Model (SAM) for cadastral data extraction from scanned historical cadastral maps under various scenarios and complexities. Our analysis revealed that SAM's different prompting methods (points, bounding boxes, and combinations) performed notably well in most cases, except when dealing with noisy data near boundaries and areas with complex parcel configurations. The model demonstrated the potential to significantly reduce human workload and error with minimal or no supervision. However, this initial experiment was limited to exploring SAM's zero-shot capabilities. Future research should focus on evaluating SAM's one-shot segmentation capabilities as well as SAM-2 model, which may further enhance its performance. Additionally, SAM has the potential to integrate with diverse remote sensing data, such as UAV imagery, to quickly generate segmentation outputs without the need for extensive training. This makes SAM particularly well-suited for Nepal's varied geographic conditions, especially in post-disaster scenarios like earthquakes or floods. By incorporating SAM into existing GIS platforms and remote sensing workflows, Nepal's cadastral system can be made more resilient to natural disasters and ongoing land use challenges.

REFERENCES:

Gobbi, S., Ciolli, M., La Porta, N., Rocchini, D., Tattoni, C., & Zatelli, P. (2019). New tools for the classification and filtering of historical maps. ISPRS International Journal of Geo-Information, 8(10), 1–24. Link
Ignjatić, J., Nikolić, B., Rikalović, A., & Ćulibrk, D. (2018). Ignjatić.pdf. 4617(Cd), 42–47.
Janssen, R. D. T., Duin, R. P. W., & Vossepoel, A. M. (2002). Evaluation method for an automatic map interpretation system for cadastral maps. 125–128. Link
Kim, N. W., Lee, J., Lee, H., & Seo, J. (2014). Accurate segmentation of land regions in historical cadastral maps. Journal of Visual Communication and Image Representation, 25(5), 1262–1274. Link
Kirillov, A., Mintun, E., Ravi, N., Mao, H., Rolland, C., Gustafson, L., … Girshick, R. (2023). Segment Anything. In Proceedings of the IEEE/CVF International Conference on Computer VisionProceedings of the IEEE/CVF International Conference on Computer Vision (pp. 4015–4026).
Lukman Syahid, H. (2011). Land administration and disaster risk management : case of earthquake in Indonesia. 91.
Mitchell, D. (2009). Reducing Vulnerability to Natural Disasters in the Asia Pacific through Improved Land Administration and Management. International Federation of Surveyors Working Week 2009, Eilat, Israel, 3-8 May, (October 2010), 1–12.
Osco, L. P., Wu, Q., de Lemos, E. L., Gonçalves, W. N., Ramos, A. P. M., Li, J., & Marcato, J. (2023). The Segment Anything Model (SAM) for remote sensing applications: From zero to one shot. International Journal of Applied Earth Observation and Geoinformation, 124. Link
Sapkota, R. K. (2012). LIS Activities in Nepal: An Overview in prospect of DoLIAe. Nepalese Journal on Geoinformatics, 11, 23–28.
Zhao, Z., Fan, C., & Liu, L. (2023). Geo SAM: A QGIS plugin using Segment Anything Model (SAM) to accelerate geospatial image segmentation. Research Software. Link

BIOGRAPHICAL NOTES

Sanjeevan Shrestha is a Chief Survey Officer at the Survey Department with 13 years of experience under the Ministry of Land Management, Cooperative and Poverty Alleviation. He currently also serves as the Vice President of Nepalese Remote Sensing and Photogrammetry Society. He holds a Master of Science degree in Geospatial Technologies from Universidad Nova de Lisboa, Portugal and University of Munster, Germany through Erasmus Mundus program and a Bachelor in Geomatics Engineering from Kathmandu University, Nepal. His expertise includes remote sensing, geospatial analysis, geo-statistics, and the applications of deep learning and machine learning techniques.

CONTACTS

Sanjeevan Shrestha
Survey Department, Minbhawan. Kathmandu,
NEPAL
Tel. +9779865464752