Benchmark on High Density Aerial Image Matching
ISPRS Logo  

Benchmark on
High Density Aerial Image Matching

EuroSDR Logo

Background and Scope of the project

Innovations in matching algorithms as well as the increasing quality of digital airborne cameras considerably improved the quality of elevation data generated automatically from aerial images. This development motivated the launch of the joint ISPRS/EuroSDR project “Benchmark on High Density Aerial Image Matching” aiming at evaluating the potential of photogrammetric 3D data capture in view of the ongoing developments of software for automatic image matching. Basic scope is the evaluation of 3D point clouds and DSM produced from aerial images with different software systems. Such a comparative evaluation provides a platform for software developers to demonstrate the state-of-the-art of their ongoing developments. Furthermore, it can help potential users like National Mapping and Cadastral agencies (NMCAs), which consider a state-wide-generation of high quality DSMs to understand the applicability of such tools while triggering further developments based on their needs.

As a joint test data set subsets of three aerial image blocks are provided. Two data sets cover nadir imagery, which are captured at different landuse and block geometry, while the third data set includes oblique aerial images.

Access to the data is provided by a ftp-server hosted by the Institute for Photogrammetry. After the participant has filled out the registration form including a signed user agreement, the password will be provided by mail from the point of contact.

ISPRS/EuroSDR Project Group Dense Image Matching
c/o Prof. Dr.-Ing. Norbert Haala
Institute for Photogrammetry
Geschwister-Scholl-Str. 24D
70174 Stuttgart
Phone : +49-711-685-83383
Fax : +49-711-685-83297
Norbert.Haala@ifp.uni-stuttgart.de

 

Processing parameters

General focus of the benchmark is the comparison of dense multi-stereo matching results as produced by different participants and/or software systems. For evaluation, DSM raster have to be generated for the nadir image blocks while the investigations for the obliqe image block are based on generated 3D point clouds. To allow for comparable results of dense image matching, it is absolutely essential that all the processing is based on the same image orientation. Participants therefore should use the provided orientation, with no modification. For each project corresponding image and ground coordinates are provided on demand so the proper use of the transformations can be controlled by the participants on their own.

Oblique Airborne Test

The collection and use of oblique imagery for photogrammetric purposes has become increasingly important. As an example, oblique images are integrated in global map services, such as Google Maps, where they are used for visualization but also for 2.5D or 3D information extraction. Airborne oblique views depict facades and footprints of buildings. Thus dense image matching offers a great potential for the automatic extraction of urban 3D models. However, applying dense matching algorithms to oblique imagery introduces some additional challenges. Examples are greater changes in viewpoint to object distances also resulting in increasing differences in object scale, or larger variations in viewing directions, which induce larger changes in radiometry. Furthermore multiple occlusions occur especially for dense urban environments.

The oblique image block to be used during the test was acquired over the city of Zürich with the medium format camera Leica RCD30 Oblique Penta which features a pixel size of 6 µm and a radiometric resolution of 14 bit. Imagery is captured in a maltese cross configuration, i.e. with one nadir and four oblique cameras. Nadir imagery is captured with a GSD of 6 cm and an approximate overlap of 70% in flight and 50% across flight direction. The imagery of the four oblique views mounted at tilt angle of 35° has a GSD of 6-13 cm.

Figure 1: Block configuration with camera stations, nadir image footprints, and nadir overlap (maximum eight-fold) overlaid to the DSM

The block configuration for the test area is given in figure 1, which shows the camera stations as blue dots overlaid to the DSM. For reason of simplicity, the image footprints depicted as blue polygons relate to the nadir camera only. The same holds true for the image overlap as represented by the color coded map. Nevertheless, for each camera station all 5 views are available. Thus, the sub-block to be used in the test consists of 3 strips, each with 9 camera stations, which again provides 5 views each. This results in a total of 3x9x5=135 images

 

Orientation parameters and deliverables

The images with 9000 x 6732 pixels each are free of lens distortion. They are made available as RGB 16 bit tiff data set together with the corresponding interior and exterior orientation parameters. To allow for comparable results, participants should use the orientation parameters as provided with no modification.
The test area is represented in figure 1 by the red polygon. This square area with a length and width of approximately 540 m is oriented parallel to the image block. The respective UTM coordinates are:

north-west (upper left) 463950 5249400
north-east (upper right) 464450 5249200
south-east (lower right) 464250 5248700
south-west (lower left) 463750 5248900

 

For simplification the area of interest for which the matching results should be provided has been reduced to the orange polygon with the following UTM coordinates:

north-west (upper left) 464000 5249090
north-east (upper right) 464200 5249010
south-east (lower right) 464100 5248760
south-west (lower left) 463900 5248840

 

As it is visible in figure 1, the block is not exactly oriented in east-west direction. The deviation is approximately 20°. Nevertheless, still the terms North, South, East and West are used while referring to the look direction of the respective oblique imagery with respect to the block orientation. Only those images are provided, which actually depict the test area. Depending on the viewing direction, this can result in different flight lines and thus different image numbers, which were selected according to the respective time stamp.

Within the respective directories, images and orientation files are arranged according to their respective look direction. As an example, for the viewing direction South two stripes where flown in West-East and one in East-West direction. Thus except for the nadir images the images relate to two different cameras heads. This is reflected by different camera directions and numbers. The following tables summarize the available images.

 

Viewing direction South

Camera
direction

Camera
number

Flight
line

Time stamp = image number

Right

61010

25

121239

121241

121243

121245

121247

121248

121250

121252

121254

Left

61007

26

121538

121535

121533

121530

121528

121525

121523

121521

121518

Right

61010

27

121945

121947

121949

121952

121954

121956

121959

122001

122003

 

Viewing direction North

Camera
direction

Camera
number

Flight
line

Time stamp = image number

Left

61007

27

121945

121947

121949

121952

121954

121956

121959

122001

122003

Right

61010

28

122316

122313

122311

122309

122306

122304

122301

122259

122257

Left

61007

29

122657

122659

122701

122704

122706

122709

122711

122713

122716

 

 

Viewing direction East

Camera
direction

Camera
number

Flight
line

Time stamp = image number

Backward

61005

26

121545

121542

121540

121538

121535

121533

121530

121528

121525

Forward

61001

27

121939

121941

121943

121945

121947

121949

121952

121954

121956

Backward

61005

28

122322

122320

122318

122316

122313

122311

122309

122306

122304

 

 

Viewing direction West

Camera
direction

Camera
number

Flight
line

Time stamp = image number

Forward

61001

26

121528

121525

121523

121521

121518

121516

121513

121511

121509

Backward

61005

27

121954

121956

121959

122001

122003

122006

122008

122010

122012

Forward

61001

28

122306

122304

122301

122259

122257

122254

122252

122250

122247

 

 

Viewing direction Nadir

Camera
direction

Camera
number

Flight
line

Time stamp = image number

Nadir

62056

26

121538

121535

121533

121530

121528

121525

121523

121521

121518

Nadir

62056

27

121945

121947

121949

121952

121954

121956

121959

122001

122003

Nadir

62056

28

122316

122313

122311

122309

122306

122304

122301

122259

122257

 

 

Figure 2: Ortho image of test area Figure 3: Exemplary point cloud overlaid to 3D city model.

Figure 2 depicts the test area by a red rectangle as well as the area to be processed by an orange rectangle overlaid to an ortho image. A 3D city model for a section of this area is shown in figure 3 with an exemplary 3D point cloud additionally overlaid. During the benchmark such 3D point clouds will be used for evaluation of the matching results since they provide the most general output of dense multi-view image matching. Thus 3D point clouds have to be uploaded by the participants as LAS-Files for further analysis. During this evaluation a comparison to reference measurements from terrestrial LiDAR will be performed for a number of representative facades. Since terrestrial laser scanning is impossible for a large scene, the decision was taken to choose several distinct reference patches distributed over the whole test area.

Further information on the evaluation procedure can be found in the paper.

Cavegn, S., Haala, N., Nebiker, S., Rothermel, M. & Tutzauer, P., 2014. Benchmarking High Density Image Matching for Oblique Airborne Imagery. In: Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., Zürich, Switzerland, Vol. XL-3, pp. 45-52.

 

Nadir Airborne Test Data Sets

The nadir airborne benchmark is a continuation of a test already implemented in the framework of EuroSDR in the year 2013. Parts of available results were presented and discussed during the 2nd EuroSDR workshop on 'High Density Image Matching for DSM Computation' June 13th to 14th 2013 at the Federal Office of Metrology and Surveying (BEV) in Vienna. Based on the outcome of the workshop, a report on the “Landscape of Dense Image Matching Algorithms” was compiled (see paper and presentation slides).

The nadir airborne benchmark features two data sets with different landuse and block geometry. Vaihingen/Enz is from an aerial image flight collected in a semi-rural area. Both ground sampling distance and image overlap are rather moderate, thus this data set is representative for statewide data collection. In contrast, the test data set München has a higher overlap and resolution and is more typical for applications in densely built-up urban area.

Figure 4: Ortho image and reference DSM of test area Vaihingen/Enz with image footprints and camera stations of UltraCam-X 20cm GSD flight.

The data set Vaihingen/Enz was captured by a UltraCam-X at height above ground of 2900m and a ground sampling distance (GSD) of 20 cm. Figure 4 shows the ortho image and the DSM of the test area. Image footprints and camera stations are additionally overlaid. The overall block consists of 3 strips with 12 images. Participants are required to generate a DSM for the central part with a size of 7.5kmx3.0km at a grid with of 0.2m. Details on the exact definition of the test area and the requested processing are given below.

Figure 5: Color-coded image overlap (maximum nine-folded) for test flight Vaihingen/Enz.

Figure 5 represents the block configuration by a color coded map, with image footprints and camera stations again overlaid. As it is visible, the overlap of 63% in flight and 62% cross flight results in image overlap conditions from one-folded areas (red) to nine-folded overlap (dark green). For the the test area Vaihingen/Enz PAN images are made available as Tiled Tiff uncompressed 8 bit/pix with 9420x14430 pixel at a data volume of 180 Mbyte/image. On demand RGB images can additionally be made available as Tiled Tiff uncompressed 24 bit with 9420x14430 pixel at 537 Mbyte/image.

 

The data set München is provided by the Landesamt für Vermessung und Geoinformation Bayern (LVG) with support from the City of Munich and the Company Astec. While Vaihingen/Enz was selected as an example for nation-wide DSM generation at areas with varying landuse, the data set München is representative for densely built-up urban areas with complex 3D structures. For such applications, images are usually collected at higher resolution and overlap. As it is visible figure 6, the image sub-block to be processed consists of 3 image strips with 5 images each. This results in 80% in flight and 80% cross flight overlap, which provides up to fifteen-folded object points. The München data set covers a central part of the city and was captured by DMC II 230 at a GSD of 10cm. Each image has a size of 15552x14144 pixel at 16 bit/pix. In addition to the PAN images for image matching, again RGB images can be made available on demand i.e. for visualization purposes. The area to be processed has a size of 1.5x1.7km.

Figure 6: München: Color-coded image overlap (maximum fifteen-folded) with image footprints and camera stations (left) and DSM (right).

 

Figure 7: 3D Visualisation of test area München.

A DSM of the test area is shown in figure 6 (right), figure 7 additionally shows a 3D visualization the data set. As it is visible, the area is densely covered by buildings with heights of up to 50m. These buildings result in occlusions especially for surface parts close to the facades. Thus, visibility can be limited for such regions, which will potentially aggravate the matching processes during DSM generation.

 

Processing parameters and deliverables

General focus of the benchmark is the comparison of DSM quality as produced by different participants and/or software systems. To allow for comparable results of dense image matching, it is absolutely essential that all the processing is based on the same image orientation. Participants therefore should use the provided orientation, which is given as Inpho project file and individual orientation files with no modification. For each project corresponding image and ground coordinates are provided a file so the proper use of the transformations can be controlled by the participants on their own.

The camera orientation to be used as a common input is available here

In order to simplify the comparison of the results, the DSM generated from the test participants should be provided in the same geometry i.e. they should feature the same resolution and area as well as the same convention for pixel origin.

 

Area to be processed, pixel center

Size of area

Resolution

Vaihingen

Left: 491800

Bottom: 5418800

Right: 499300

Top: 5421800

37500x15000 pixel

7.5x3.0km

0.2 m

München

Left: 4468200

Bottom: 5332500

Right: 4469700

Top: 5334200

15000x17000 pixel

1.5x1.7km

0.1 m

The results shall be provided by a raw Tiff image encoding with float values corresponding to the elevation in meters. For the pixel origin convention, we propose to adopt the so-called "pixel-center" convention, i.e. the pixel of coordinate (100,90) corresponds to the square [99.5,100.5] x [89.5,90.5] of continuous space. The predefined areas of interest for the respective data sets are given in the above table.

In addition to the DSM data the participants also should provide additional information on used hardware environment and time effort for processing of the benchmark data based on the delivered questionnaire.

Evaluation of test results

For the nadir benchmark the performance of the implemented algorithms is evaluated by analysis of a generated DSM raster. This DSM has to be produced by the respective participants in a predefined format. After deliverance to the project team, the difference to a reference DSM will be computed and made available again to the participant. In contrast, evaluation of the oblique benchmark is performed on 3D point clouds.

The nadir airborne benchmark is a continuation of a test already implemented in the framework of EuroSDR in the year 2013, thus results are already available from a number of participants. Software systems with respective DSM results, which were presented at the workshop were

  • SocetSet 5.6 (NGATE) from BAE Systems
  • Microsoft’s UltraMap V3.1.
  • Match-T DSM 5.5 from Trimble/inpho
  • ImageStation ISAE-Ext from GeoSystems
  • Pixel Factory from Astrium
  • DSM Tool from the Royal Military Academy (RMA) of Brussels
  • Remote Sensing software package from Joanneum Research
  • MicMac developed at IGN
  • SURE from the Institute for Photogrammetry (IfP), University of Stuttgart
  • FPGA implementation of SGM from the German Aerospace Center (DLR).

As a common reference surface, the results provided by the participants were used to generate a median DSM. Of course this median does not provide an independent ground truth. However, it can be used very well to illustrate differences between the respective solutions and thus give hints to situations which are potentially difficult for image matching.

As reference surfaces, two median DSMs were generated based on the results of the 10 participants with a cell width corresponding to the ground sampling distances. Although the reference DSMs are dependent on the DIM results, they are appropriate to highlight DIM differences.

Figure 8 Vaihingen/Enz: Ortho image and median DSM with RMS differences between the available solutions.

Figure 8 shows a an ortho image and a shaded representation of this median for a part of the Vaihingen/Enz test area. A color map representing the root mean square differences of all DSMs with respect to this median is additionally overlaid. As it is visible, larger differences occur in the shaded area of the quarry, in the river area, in the vicinity of patches of trees and in the area of the vineyards. However, while comparing the individual results it becomes obvious that a number of software systems still provide consistent accuracies even for such situations.

Figure 9 München: Ortho image and median DSM with exemplary differences for one solution.

Similarly, results for the test area München are presented in figure 9. For this example a color map representing the differences to the median DSM was generated from a single solution. Larger differences are mainly visible at small details and steep edges which occur in the vicinity of building borders. Furthermore, cast shadows seem to result in larger differences and increasing noise for the reconstructed surfaces.

While a considerable amount of results were already made available to the project team and have been used to compile these preliminary results, the benchmark data is still made available and can be applied to generate new outcome or improve already existing results.

 

Haala, N., 2014. Dense Image Matching Final Report. EuroSDR Publication Series, Official Publication No. 64, pp. 115-145. http://www.eurosdr.net/sites/default/files/uploaded_files/eurosdr_no64_c.pdf

2nd EuroSDR workshop on 'High Density Image Matching for DSM Computation http://geo.tuwien.ac.at/news/2nd-workshop-on-high-density-image-matching-for-dsm-computation-successfully-completed-2013-06-19/

Haala, N., 2013. The Landscape of Dense Image Matching Algorithms. In: D. Fritsch (Ed.), Photogrammetric Week ’13. Wichmann, Berlin/Offenbach, pp. 271-284. paper and presentation slides

 

Principals:

Prof. Dr. Norbert Haala
Stefan Cavegn

Institute for Photogrammetry, Universität Stuttgart

Andy McGill

Ordnance Survey Ireland

Prof. Dr. Stephan Nebiker

Institute of Geomatics Engineering, FHNW Muttenz

Prof. Dr. Norbert Pfeifer

Department of Geodesy and Geoinformation, TU Wien

Michael Franzen

Federal Office for Metrology and Surveying, Wien

 

 

 

 

 

Logo

 

Logo

Our Contact

University of Stuttgart
Institute for Photogrammetry and Geoinformatics
Geschwister-Scholl-Str. 24D
70174 Stuttgart
GERMANY
Email: info@ifp.uni-stuttgart.de

 

2023 © Institute for Photogrammetry and Geoinformatics. All Rights Reserved.