In this chapter we discuss the issues involved in implementing a distributed system that provides a computational infrastructure for the development of decision support and research applications requiring access to and manipulation of large geospatial data sets, such as satellite imagery, from remote servers. We give an overview of some prototype systems we have developed, and some more advanced systems that are still under development, which employ standard Internet and World Wide Web client/server technology. This chapter provides a summary of [43] and [112].
The amount of digital geospatial data available is rapidly growing. In particular, there is a vast amount of data from earth observation satellites, and next-generation satellites are expected to produce terabytes of data per day. This presents a challenge for the development of computer systems that enable the storage, management and dissemination of these huge data sets in online data archives or digital libraries. Ideally, such a system would provide efficient, on-demand remote access to these data sets over the Internet (or an intranet), so that authorised users could easily access and utilise the data for a variety of Geographic Information Systems (GIS) applications, including decision support, research and other analysis.
For a number of GIS applications, such as those requiring real-time or interactive analysis of large data products such as satellite imagery, the processing requirements are large enough that high-performance compute servers are required. This leads to the concept of an ``active'' digital library, where the server provides not only services for querying and downloading of data from the library, but also services for processing the data before downloading [113, 108, 106, 127, 129, 235].
This approach is particularly useful if the amount of data to be processed is very large, for example multiple channels of a satellite image, but the final result is relatively small, for example a processed satellite image for a localised area, or perhaps just a few numbers such as average sea temperature or percentage cloud cover or some correlation coefficients. If the data is obtained from the server using a wide-area, relatively low-bandwidth network, it will be more efficient if the user only has to download the final results rather than download the large input data set and process it locally. Many decision support applications that manipulate spatial data involve operations on very large data sets, but carry out data reduction operations to provide summarised information to the end user. Some of the data sets may be remotely accessed from different servers, possibly over wide-area networks, and the processing may be done on yet another machine, possibly a high-performance computer or supercomputer. We have investigated the consequences of connecting together resources for fast mass storage and high-performance computing with broadband networks, whereas the user's client computer may only have modest network capabilities, such as a modem link via the World Wide Web.
We have experimented with large image repositories such as for geostationary meteorological satellites, as well as other sources of geospatial data such as digital terrain maps. Operations on bulk data range from simple data overlays, to computationally intensive operations such as data interpolation and registration and rectification. Some of these operations are infeasible to do in real-time except on supercomputers. The client programs that can set up processing on demand of various data sets need to be robust, and yet easy to use with suitable graphical interfaces.
In section
we discuss mechanisms for enabling an active
digital library,
using as an example our online repository of geostationary satellite
data and some prototype systems for providing access to this data and
services for processing the data.
We discuss the issues we have encountered in implementing prototype systems
using Web, Java and CORBA technologies.
In section
we present some general scenarios
illustrating how decision support systems can benefit from the use of
fast on-demand access to archives of geospatial data and distributed
GIS services, with more specific applications being described in
section
.
We briefly summarise our findings and conclusions in
section
.
With the advent of cheaper and more powerful computers, networks and electronic storage media, and particularly the huge increase in the use of the Internet and the World Wide Web, large online data archives and digital libraries that can be accessed over the Web are becoming widespread in both the commercial and scientific arenas. The technologies for implementing and accessing such archives, are for enabling online data processing, are still evolving. Our On-Line Data Archives (OLDA) Program [178] is investigating these issues, particularly as they relate to large satellite data archives.
There are a number of existing projects aiming to provide access to digital libraries of earth observation data. These include the Synthetic Aperture Radar Atlas (SARA) at Caltech [235, 234]; NASA's Earth Observing System Data and Information System (NASA-EOSDIS) [164]; the European Space Agency's online product catalog [60]; and the Australian Centre for Remote Sensing (ACRES) Digital Catalogue [17].
Like these and many other projects, we are investigating the computer systems and software issues related to the efficient storage and dissemination of data in large distributed online archives, or digital libraries, of satellite data. However our work, and similar efforts such as the SARA project, is also focusing on developing active digital libraries, which enable remote data processing as well as remote data access. New technologies such as the World Wide Web, Java [88] and the Common Object Request Broker Architecture (CORBA) [177, 163] are making it easier to develop portable distributed client/server systems of this kind.
We are implementing our software support infrastructure using a set of
server-side programs accessed using a mixture of Java programs
communicating between client and server as well as between servers;
CORBA for invoking remote procedures through an object-oriented
interface; and the Common Gateway Interface
(CGI) [100], the standard mechanism for invoking
processes on a Web server. The client is a customised Web-based
interface using Java applets [88] that can be
downloaded and run from a standard Web browser. This architectural
mix provides a portable and powerful framework for rapidly prototyping
systems that can integrate existing server-side utilities, and is
extensible to create complex systems with the desired functionality.
Most importantly, it is built using Web standards, with standard Web
browsers for the clients. Our prototype system, Eric, implements an
active digital library browser and manipulation tool for spatial
data. Eric is discussed further in section
.
In order to make Web-based distributed geographic information systems more general, so that the clients and servers can handle any kind of geospatial data set, and to develop distributed systems with multiple clients and servers that are interoperable, there are two important standardisation issues that must be addressed.
The first is the standardisation of metadata, so that searching digital libraries by querying metadata can be done using the same metadata fields for different data sets. There are several organisations working on metadata standards, both for specialised fields such as geospatial data [125, 223, 222], and in general [51].
The second issue is the standardisation of the client/server interface, so that different clients and servers can be interoperable, and clients can request data from multiple repositories using a standard interface, or an application programming interface (API). The API to an active digital library should specify mechanisms for querying, processing, and retrieving of data. Using general metadata standards, it would be possible to construct a basic general API for querying of metadata and downloading of data that would be universal across any kind of digital library and any kind of data (general Web search engines do this for restricted data sets, such as Web pages and newsgroup postings). However to allow more powerful and domain-specific queries, and to provide an interface to data processing services which would in most cases be specific to the application and type of data, specialised APIs are needed that are specific to a particular domain or topic -- for example, the services and interfaces for an active digital library of geospatial data would be different to those for a library of human genome data.
To develop a more advanced active digital library for geospatial data, integrated with an improved version of ERIC, a well-defined API is required. As with metadata, proposed standards are gradually being developed for digital libraries and data archives. There are various groups, including the Open GIS consortium [179], the International Standards Organisation (ISO) geographic information technical committee [125], and various government agencies in the U.S. and elsewhere [223, 229], that are all involved in developing standards for the storage and exchange of geospatial data.
Currently the API that appears to be the most advanced in design, and the most suited to our existing archive of GMS-5 satellite data, is the Geospatial and Imagery Access Services(GIAS) [228] specification, which is a fundamental part of the U.S. Imagery and Geospatial Information System (USIGS) [226, 227] developed by the U.S. National Imagery and Mapping Agency (NIMA) [229]. This system is targeted at use by the U.S. military, for analysis of satellite and photo-reconnaissance imagery, however it is general enough to support the storage, management and dissemination of geospatial imagery data for a more general distributed GIS. We are therefore developing an active digital library for GMS-5 satellite data that is based on the GIAS. This prototype system could be generalised to support any kind of geospatial data.
The GIAS specification describes an object-oriented archive management system for an active digital library. The specification is structured so that the main functions of the library, such as adding data, querying the metadata, and downloading the data of interest, are controlled by different managers, each of which is handled by a different class. Remote access to the system is implemented using the CORBA standard, and described using Interface Description Language (IDL) [163]. The GIAS also defines a standard Boolean Query Syntax (BQS) which specifies a format for metadata queries on geospatial data. More detailed information on the GIAS can be found in the specification [228].
We have developed a prototype implementation of a GIAS-compliant
geospatial digital library [45].
This work was done in collaboration with the Imagery Management and
Dissemination (IMAD) Project group of the Australian
Defence Science and Technology Organisation (DSTO), who are implementing
a prototype distributed system for handling photo-reconnaissance imagery
for military command, control, communications and intelligence (C
I)
applications.
The same system can also be used for a variety of non-defence GIS
applications, such as those described in section
.
We have initially implemented a subset of the GIAS that provides the
basic functionality for managing a geospatial image library.
The implementation includes the main managers for the server,
an interface to a database on the server for storing the queryable metadata,
a translator that converts BQS queries to standard SQL database queries,
the remote invocation of the managers using CORBA, and a client for
testing the complete functionality of the subset of the GIAS that we
have implemented.
Figure
shows a screen image of the test client.
Only the queryable metadata is stored in the database, while the imagery
data is accessed via the standard file system, and may be stored either on
disk or in a tape silo.

Figure 49: A screen dump of some manager windows from the test client
for the GIAS implementation.
In this case, a query has been made using the Catalog Access Manager,
a section of one of the products matching the query has been selected
and downloaded using the Array Access Manager,
and the image has been displayed in a separate window.
The purpose of this client is just to test the functionality of the
GIAS implementation - an application client would have a more
user-friendly interface.
The GIAS provides a sound basis for implementing an active digital library of geospatial data. It specifies interfaces to the basic functionality of searching for data products by querying metadata, and downloading the required data in a specified format and resolution. It also provides interfaces for some basic data processing services of the kind provided in the original ERIC system, such as cropping out a specified geospatial region, and creating animations of time series of images. The design, which is based on CORBA, is capable of supporting additional data processing services, however an API would need to be defined for each one, and included into an extended interface specification.
We are currently working on interfacing our existing GMS-5 repository to the GIAS server. This requires loading metadata for all the existing GMS-5 data into the database used by the GIAS implementation, and converting our automated data ingest programs so that new data from the satellite is automatically incorporated into the GIAS system.
The GIAS implementation provides most of the services required to support the functionality of ERIC. We are developing a new, GIAS-compliant version of ERIC, using a Java applet rather than HTML forms as the user interface, connecting to the server using CORBA rather than CGI, and implementing the server programs using Java processes that can call other programs as native methods, rather than calling them from shell scripts and Perl programs. Using CORBA requires an Object Request Broker (ORB) running on the client as well as the server, however CORBA and the Internet Inter-ORB Protocol (IIOP) [177, 163] that it uses are now being integrated into Web servers and browsers, for example the latest versions of Netscape Communicator feature a built-in Java ORB to handle CORBA requests [168].
By implementing the ERIC user interface as a Java applet, it can easily be customised for different applications, which can download the basic applet, plus classes specific to the particular application. Using Java programs on the server allows for a more complex and modular system than could be constructed from shell scripts and Perl programs, and allows much better error handling, which was a major problem with the original version of ERIC. The GIAS defines standard exceptions (errors) as part of the specification.
In this section we illustrate some typical scenarios for decision support systems that access distributed data archives across wide-area networks, and discuss the middleware and distributed computing issues which we believe are critical for their widespread uptake.
One example from the area of land and environmental resource management is particularly relevant to Australia, which derives a substantial part of national productivity from primary industries. A great deal of spatial and land resource information is now available from various satellites, aerial reconnaissance, measurement study programs, and other sources such as census and land registry. There are many interoperability and ownership and legal access issues involved in multi-source data. Although these are a great impediment to wide-area distributed decision support systems, we do not address these issues here. Another significant impediment is the difficulties in setting up the software interoperability and machine access for decision makers to access and manipulate such data. We will concentrate on these issues, and in particular discuss the consequences of wide-area distributed computing and metacomputing in addressing these difficulties.
Imagine a station manager or farmer planning the year's activities and wishing to make sensible decisions regarding irrigation and rainfall runoff; crop rotation and planting; expected yield and optimal harvest times and other land care operations such as salinity management and prevention. It is unlikely that the farm manager is working without some existing base of knowledge. There is very likely a base of experience that may have been passed down in the family or is available from neighbouring stations or other individuals in a similar situation. Nevertheless, there may be newly acquired land or new situations, new crop species or irrigation techniques that expand the list of option available to the farmer. How can they exploit the data that may be available to aid the decision making process? Some data will be available in the form of highly sophisticated processed data products such as short and long term weather forecasts. These may be available at the resolution and localisation required for precision agriculture decisions, or may only be available in undigested form. Perhaps no-one has asked for detailed forecasts or data for the farmer's particular region before at the requested resolution. It may however be entirely possible for an automated processing system to be able to create the desired products. Aerial reconnaissance data can be readily bought for a particular region by hiring a suitable plane and aviator. Fly-over data at very high resolution from various commercial satellites is quite likely available in raw form for the region the farmer requires. How can a processing and automatic product creation framework be put in place to allow those organisations who do have the necessary skills to create the desired products to do so economically for what might be a set of isolated one-off sales?
A wide-area metacomputing environment built using a set of clustered computing resources can be set up to provide a common shared resource for the customers (farmers and land managers) to interact with the value-adders or organisations who can create processed products from raw data and the raw data suppliers or government custodians. What is needed is a suitable set of middleware or software that can provide the necessary interoperability and scheduling of the necessary computing services.
There is a feeding chain relationship that exists between the end user
of a processed product, the custodians of the raw geospatial data,
value-add processing agencies which may
involve several intermediate stages, and the final data suppliers.
In many cases around the world raw data belongs to the government and
may be made available for the public good to anyone who wishes to use it.
Government agencies typically have a dedicated and customised
infrastructure for processing raw data to construct decision support
products. Government operated weather bureaus are an example of
this. The feeding chain model for this is shown in
figure
, where the entire system can have dedicated
hardware, software and a high bandwidth data access system.

Figure 50: Dedicated decision support system for a large organisation
such as a government agency (a government land management office or
weather bureau for example) with its own customised, dedicated
processing system for generating decision support products.
Many organisations now exist to sell products derived from this raw
data, possibly involving considerable expertise and innovation as well
as sophisticated processing equipment. The feeding chain model that
allows vendor companies to value-add to existing data is shown in
figure
, illustrating how the end-user receives the
final product through the vendor alone and need not be aware of the
raw data such as satellite imagery that went into its construction.

Figure 51: The feeding chain for vendors to value-add to publicly
archived or government data to generate lower volume, higher margin
data products for decision support.
Some value added products such as weather forecasts are by no means
trivial to derive from raw observational data, and weather
agencies around the world expend a lot of resources in assimilating
data and producing their products. The economics of forecasting is
selling to a large volume market, since so many people and
organisations want accurate weather forecasts that they are generally
sold at a very low margin, if any at all. Land management decision
support products tend to be more esoteric and require large
margins to be worthwhile for a value-adding organisation. In
consequence the decision support capability of such data tends to be
underutilised. Many government agencies set up their own special
systems (as shown in figure
) to provide spatial or
geographic decision support for their areas of jurisdiction. Weather
agencies are moving towards more automation which will lead to a
driving down of the cost of production of value-added forecasts, but
automation is even more necessary for low-volume market areas.
The low-volume market can be effectively served by a hierarchy of
different value-adders all interoperating, on-selling products derived
from each other's work, and ultimately derived from the bulk raw data
that may itself originate from satellite or other sources run by a
commercially-operated company. This hierarchical relationship is
shown in figures
and
.

Figure 52: Hierarchical relationships between value-adders allow complex
end-user products to be derived from a feeding chain of on-sold data
products. Each value-adder on sells derived data products.
A number of general purpose computing technologies can be used to improve
the automation of these value-added services. Improved data archive
management facilities can provide faster bulk access to data sources
more cheaply. A number of systems for archiving data make use of what
is essentially a smart middleware to control a mix of storage media -
many cheap bulk devices such as tapes are housed in multi-tape silos
under robotic control, working in tandem with fast but more expensive
devices such as bulk disk or RAID arrays of disks which can act as a buffer
for frequently accessed data. The primary data suppliers may choose
to use technology such as this in managing the bulk archives of raw
unprocessed data. Value-adder organisations may require access to
more than one primary source of bulk data simultaneously, as shown in
figure
, and it may prove economic for suppliers to
use on-line archive technology such as combined RAID and tape silo
systems to minimise the amount of data that needs to be replicated at
various sites.

Figure 53: Value-adder combining primary data sources by data fusion
into a complex output data product.
Processing resources can be pooled together by organisations to provide better response time or resource utilizations than if individual departments under-utilised their own resources most of the time just to be able to handle their occasional peak capacity requirements. In particular many organisations have clusters of workstations or personal computers that are only sporadically used by individuals during working hours and which stand idle overnight and on weekends and holidays. It is often too difficult to make use of these lost cycles due to lack of smart interoperable scheduling software environments. Improved scheduling on clusters can turn an unused resource into a virtual supercomputer for these organisations.
Storage technology can be integrated with World Wide Web data delivery
techniques to provide casual customers access to a plethora of
pre-prepared data products with very low shipping and delivery costs
in spite of up to the minute relevance. Nevertheless public demand
for network bandwidth is growing and better data connectivity between
data suppliers and value adders is likely to be necessary to avoid
competition with public Internet traffic. Smart middleware can be
used to manage the existing networks between cooperating suppliers
in a feeding chain, to make sensible use of data caches and
intermediate storage so as minimise peak loads and hence conflicts on
the networks. This idea is illustrated in figure
where a value-adder has some sort of smart store that can cache
products according to some access policy and thus archive
processed data. A smart middleware management system is capable of
managing access pattern predictions and pre-fetching data for certain
applications and users. This would make use of a network of caches
and archives to make optimal use of the customer data delivery network.

Figure 54: Homogeneous and heterogeneous mixes of end-user requests
benefit from smart caching of final products as well as intermediate
data used to generate final products.
End-users of a particular value-adder can generate a homogeneous or
heterogeneous mix of product requests. A particular vendor may
benefit from the caching model shown in figure
even
if the requests are heterogeneous in nature, providing the
intermediate products, from which different final products are derived,
are cached to allow fast reuse.
In addition to the normal feeding chain of products supplied by a
value-adder to their customers, some degree of product development is
likely to be ongoing as an in-house effort or perhaps in collaboration
with special customers. These new or one-off data products may be
researched and developed by special users who have access to the
individual processing services inside the value-adder's middleware
system. The decision support products thus developed may eventually
make the transition to being fully automatable products for normal
on-demand delivery to all customers. This model is shown in
figure
. The value-adding organisation's staff then
spend their time developing new products and services, and encoding
these into the smart scheduling environment as well as handling those
occasional special jobs which require some manual intervention. The
schedule management environment might still provide substantial help
to the value-adders staff undertaking special jobs, as it can provide
the primitive operations and support environment. We now have a mixed
scenario with different levels of users all drawing different services
from the environment.

Figure 55: New and one-off special products can be developed by in-house
special users, for eventual automation as on-demand end-user products.
In summary, better middleware to manage storage of raw data, exploit under-utilised processing capabilities such as existing clusters of workstations, and improve data network delivery using mechanisms such as smart caches and pre-fetching, can significantly enhance the performance and cost performance perceived by decision support users. This is particularly effective over large wide-area clusters of resources, where one might expect the statistical load fluctuations from individual users is balanced out.
A number of middleware products have already been attempted and several techniques are being presently researched. An area we are addressing ourselves is that opened up by restricting the general scheduling problem to that of handling only well-defined high-granularity services.
Consider the types of services that might be required for the land care and management decision support scenario we have outlined. These are not arbitrary programs running with arbitrary data. Instead they are generally drawn from a subset of well-defined application components with known performance requirements and will typically be run on a very restricted set of data sets and data set sizes. Consequently it is possible to set up a scheduling management system that has access to the characteristics of each application component and can therefore predict and hence optimise more accurately the cost and time needed to carry it out on a set of computing resources. The smart scheduler is therefore able to make very good use of the whole set of resources under its control and can organise user requests in priority, running the most important or urgent on fast resources, and the lower priority actions using slower, cheaper queues. Given that a restricted finite or manageable set of applications and data sets are being used under the system as a whole it is also possible to effectively cache or store frequently accessed or requested data products.
Consider an example. Suppose a farmer has requested information regarding the prediction of crop yield and optimal harvest time. Crop acreage may already be known to the farmer, or can be accessed from either a land registry database or perhaps calculated from a satellite image which is geo-rectified and registered to allow a geographical area calculation. Localised weather data for the current growing season, weather predictions for the region, and weather data and crop yields from previous years may be combined to calculate an estimate of the likely crop yield. A practical system might provide a series of possible harvest time predictions and options with the computed consequences for the farmer. The decision support product delivered to the farmer may only be the summary output of the calculations - a few kilobytes perhaps, whereas the raw data upon which the calculation was based may have represented many gigabytes of spatial imagery, digital terrain data, runoff patterns and so forth. The final product can therefore be easily delivered over a low-bandwidth network to a client application running in a Web browser.
Similarly, the computational power of the client computer the farmer might use to place the query and request the decision support product might be low, whereas many processing cycles may have been used by the value-adding organisation in creating the data product. Suppose the farmer makes a similar request the next day, but perhaps with some additional new information. By controlling and caching the intermediate data products the value-adding organisation used to create the product such as the raw satellite imagery for the farmer's particular region, it may well be possible to save considerable re-processing time. The profit margin on the second product will be much greater - or of course the value adder may choose to pass on the margin saved to its customers in the form of a cheaper price. This model of being able to cache intermediate data products may provide significant savings and better utilizations of resources. It can only be set up under a smart caching system however. It would be too hard to track the necessary data items manually.

Figure 56: Decision support possibilities if a complex query can be
broken down into a set of computable parts.
The farmer's query is decomposed into a sequence of well-defined and
computable steps in the example above. This is shown in
figure
. Each of these steps might be an
application component with a well-characterised performance that can
be controlled by the smart scheduler. The value-adding organisation
that provides some particular services and data products might have
access to archived raw data either as a locally archived copy or from
some other data provider such as a government agency or another
vendor. Our value-adder might be able to respond semi-automatically
to the incoming query and deliver the product and a bill to the farmer
using a suitable scheduling environment which manages its computing
resources.
It is not always entirely possible to automate product generation and delivery. Some products will always require some human expertise or intervention to create them. A common example is the detailed local weather forecasts issued by government weather agencies. A great deal of automatically collected raw data such as satellite imagery, and computer model generated data is used as input for regional forecasts, but the complexity of the problem is such that these data products are generally only used as input to an expert forecaster who digests the data and uses them as decision support information in producing his own actual forecast. The forecaster has final say on the final product, but has made use of a highly sophisticated decision support environment in producing it. This human intervention stage can occur anywhere along the feeding chain, and the middleware support environment can be thought of as a framework to support those humans involved.
Other decision support applications share many of the properties of the scenario we have outlined above. Here we present a brief description of some of the specific applications that we are targeting with our prototype distributed GIS software infrastructure and active digital libraries.
An example scenario of providing decision support information for land management using a Web-based distributed GIS was given in the previous section. There are many similar applications. For example, as well as providing localised crop yield forecasting for an individual farm, there is a demand for crop yield forecasting on a regional and national level. This requires the use of large amounts of disparate data that may be distributed over many sources. If this data is made available online, it can be integrated by a distributed GIS, which can also provide support for including compute servers and interfacing to the software to perform the crop yield prediction.
A distributed GIS has obvious applications to large-scale land care studies and environmental monitoring, which also require large amounts of disparate geospatial data, including satellite data. For example, a researcher may want to combine vegetation index data obtained from NOAA satellite images with rainfall data obtained from weather bureau data and GMS-5 meteorological satellite images, in order to correlate rainfall with vegetation growth, to provide better understanding of the environment and better models for predicting crop yields, or providing localised predictions of rainfall and frost.
Many intelligence products originate from satellite or aerial
reconnaissance flights and are archived using various technologies
including digital data systems. Various organisations within a
government's defence forces may collect and archive their own data and
may make it available to each other. Sources may vary in quality and
type considerably. Some organisations may work as in
figure
in an entirely in-house customised system, but
more frequently economic and interoperating reasons require various
value-adding relationships to be set up with the whole defence force
to share data and to derive intelligence products from multiple
distributed sources and `on-sell' derived products for decision support.
Human analysts may be working as end-users or as value-adders in the
system itself, combining data products into decision support material for
those processes that are not yet automatable. The defence community
has some additional constraints but overall the model is very similar
to that for commercial value-adding of land resource data.
Some additional properties however include: real-time or near-real
time data delivery; tighter security and secrecy between components of
the system; encryption and restrictions to certain parts of the
`market'; and closed rather than open product catalogues. In spite of
these differences the quality and quantity of the data components and
the automated and human enhanced processing activities still conform
to the decision support information delivery model outlined in
section
.
We are currently working with the Australian Defence Science and
Technology Organisation (DSTO) to develop a prototype Web-based imagery
access system as a technology demonstrator [45].
The system is aimed at a
variety of C
I clients, from image analysts using fast networks and
high-end graphics workstations, to commanders in the field using laptops
and low-bandwidth connectivity.
There are two basic scenarios for the use of GIS by emergency services. The first is for emergency planning purposes, to investigate measures for alleviating or responding to emergencies such as fire or flood. These systems allow the user to simulate possible situations and their outcomes. For example, firefighters or forestry managers might run simulations of what would happen if a fire was started in specified areas under certain conditions, in order to identify danger areas where fuel loads should be reduced with cool burns, and under what conditions these managed burns would be safe. Emergency services might run simulations of evacuation procedures in the event of a flood or fire.
The second scenario is decision support during an actual emergency, which provides a much greater challenge to a distributed GIS. As with emergency planning, the GIS will draw on a wide variety of existing geospatial data, such as the positions of populated areas and houses, road networks, fire stations and hydrants, etc. However in this case, it will also require access to real-time data that may be rapidly changing. For example, a GIS for decision support during a bushfire would ideally have real-time data feeds coming from many different sources, including the positions of firefighters, police and emergency services personnel in the field, information about the position of the firefronts, which can be transmitted from helicopters with GPS units, and current and predicted weather information such as temperature and wind speed.
A Web-based distributed GIS could be accessed in the emergency response headquarters, as well as by crews in the field using a laptop connected by a cellular modem. The GIS could provide up-to-the-minute information on the situation status, and provide decision support information such as the optimal routing of fire trucks or evacuations, and predicting the path of the fire using simulation.
The National Key Centre for Social Applications of GIS at the University of Adelaide is developing such systems for emergency services decision support [166]. We are working with them on addressing the complex distributed computing aspects of this application, including the access to real-time data feeds from multiple sources, and the use of high-performance compute servers to provide rapid simulation.
We are investigating the computer systems and software issues related to the efficient storage and dissemination of data in large distributed online archives of geospatial data such as satellite images. We are developing ``active'' digital libraries, which enable remote data processing as well as remote data access. Technologies such as the World Wide Web, Java, and CORBA are making it much easier to develop portable distributed client/server systems of this kind.
We are currently developing the infrastructure to support a Web-based
distributed GIS. Some prototype systems have been implemented, such as
the ERIC active digital library for GMS-5 satellite images, and the
Imagery Management and Dissemination (IMAD) system for handling defence
photo-reconnaissance images.
These distributed geographic information systems, and the middleware
infrastructure that supports them, are currently being provided with
additional functionality and robustness.
As the systems are improved, they will be used for advanced decision
support applications such as land management and crop yield prediction,
and C
I applications for emergency services and defence.
Client/server computing, particularly over wide-area networks, is not yet widely used for GIS applications and research, but we believe the use of distributed geographic information systems that can remotely access and process a variety of data from multiple online data archives has great promise in many research and decision support applications.