DANIEL H. WAGNER ASSOCIATES, INC

A Leader in Applying Mathematics and Computer Science to Industry

 

Operations Research - Mathematics - Software Development

Home

About Us

Technology

Projects

Products

Careers

Contact Us

Search

You are at: Wagner Home > Technologies > Environmental Data Fusion

Environmental Data Fusion

Our Environmental Data Fusion algorithms/tools address many of the primary challenges associated with environmental and other geostatistical data fusion.

·        They automatically access data from multiple, remote sources by providing a system architecture that allows any unit at any location to look for environmental or other important data across a general network, and to pull that data into a common database. 

·        They automatically resolve conflicts in returned data by merging or fusing this data into a single database and comparing values over the spatial domain, identifying regions of agreement/disagreement and selecting the “Best” data for current needs.

·        They automatically receive and assimilate new measurements as they relate to the existing knowledge base.  This is particularly important since a single measurement has a known validity that is generally better than that of historical (averaged over a long period of time) data, but the single measurement only provides a micro-view of the space, in fact only the view at a single point.

·        They automatically recommend optimal points at which to obtain additional measurements.

·       They automatically tie environmental and other geospatial data to operational needs.  Even perfect knowledge of geospatial factors is only useful if it can be applied to an operationally useful problem.

Our Environmental Data Fusion Tool Set contains the following four primary tools:

1.      Information Collection Tool

2.      Polygon Combination Tool

3.      Measurement Fusion and Optimization Tool

4.      Data Analysis Tool to Relate Geospatial Data to Operational Needs

Information Collection Tool

 

 


 

 

 

 

 

 

 

 

 

Figure 1.  Information Collection and Data Fusion/Analysis/Mining Architecture

The most general way to think of this tool is to envision any application (Mine Warfare or not) requesting data from a generic data server or broker for multiple servers.  The application need not specify the specific location of the server, but should only ask for the kind of data it needs, the geographic location associated with that data, and other supporting information if applicable (for example, unique definitions of “Best”).  The broker then returns to the calling application the “Best” data available to support its request.

The second step is to provide a general broker/agent architecture that supports the access and retrieval of environmental data from a variety of sites.  In our architecture, the broker manager takes the request from the calling application and accesses the Data Agent, which finds all the data from all of its source repositories and provides this data to the broker.  To minimize the bandwidth associated with these requests, and to accommodate interruptions in service between the servers, the local system stores a cache copy of each database.  Then, when the broker makes a request, the data agent does a query for new data across the net.  If none exists, no data is pulled.  Locally collected data can also be stored, and this data is then in turn available to be pulled from other sites.  The architecture also has a Graphical User Interface (GUI) for the definition and management of the server connections and the sites for data sources.  It should also be noted that this architecture supports data compression, and can support information exchange on bandwidth-limited networks, over RF networks, and over intermittent networks.

Once all the available data of a given type and for a given location is in the local broker’s database, the next step is to combine all of this information into a “Best” picture.  We do this using a software tool called mcmm_attribute that performs a polygon combination of all the available data sources and then applies rules to those polygons to determine the “Best” available view.  In our design, the rule-set for defining “Best” can come from the application or through a GUI.  Examples of these polygon combinations are provided in the following discussion on the Polygon Combination Tool.

This approach allows us to get the best data available, but we still need tools and techniques for the assimilation of new measurements into the common environmental picture as described in the Measurement Fusion Tool section.  This capability is outside the broker, but connected to the requesting application through the common software interface.  This way, the assimilation tools are always operating on the “Best” data for their task.  Note that the data can then be inserted into both the local and “Best” databases.  Finally, this system description can be thought of as just a single node of a distributed system.  This provides a very powerful tool to support geospatial data fusion.

Polygon Combination Tool

Figure 2.  Polygon Combination Tool Example

 This example shows how the polygon combination tool can fuse data from multiple sources, as well as data with multiple attributes and multiple sources.

To illustrate the fusion of data from multiple sources, we have three sources of bathymetry data with certain regions of validity associated with those measurements, shown in the left side of Figure 2, with arrows pointing to “Best” fused result.  The least trusted, but most available data is the NIMA digital chart.  The next most trusted is the gridded data from the Precise Underwater Mapping Analysis (PUMA) system.  Finally, a single sounding from the U.S. Naval Oceanographic Office with a certain radius of validity is the most trusted entry.  As shown in Figure 2, the polygon combination tool fuses these data using the predefined “Best” order to produce a “Best” fused geospatial picture. 

This technique also supports the combination of multiple attributes from multiple sources.  For example, as shown on the right side of Figure 2, we can take Bathymetry and Bottom sediments (assuming that the “Best” values for each of these have already been established) and then combine them.  The system then creates the fused single picture shown in the example.  Note that after fusion there are seven different regions with six different data combinations.

Measurement Fusion and Optimization Tool

 


Figure 3.  Measurement Fusion and Optimization Tool Example

(Contour Plot of Absolute Value of Estimation Errors)

Our next tool is a fully automated measurement fusio n and optimization tool for assimilating new measurements into an existing database.  This tool is based on mature statistical techniques that work well for the bathymetry and bottom sediments that are critical to mine warfare operations, can produce improved results over standard spatial interpolation and prediction methods, and can be applied to any single valued (or multi-valued) geospatial data.  Also, these techniques require no distribution assumptions, accurately estimate the variance of the prediction error, and recommend optimal points at which to obtain additional measurements.

When performing measurement fusion, we first use a data set typical of the area of interest to determine how the measurements are correlated, and to determine trends in the data.  These correlation statistics can be anisotropic, which allows us to model the fact that water depth, for example, is usually relatively constant as one moves parallel to a coastline, but can change quickly as one moves perpendicular to the coastline.

After determining these correlation statistics, a relatively small sample of data from the area of interest can then be used directly in the measurement fusion process to estimate data values throughout the area of interest.

This sample display demonstrates how selected measurements can be combined with existing environmental data to predict a geographic extension of those measurements.  In the example shown in Figure 3, the 32 white circles show the small random sample of measured data that was used in the measurement fusion process. 

For this example, we started with 3233 bathymetric soundings and randomly chose 10% of these points.  We then developed the correlation functions using this random sample of 323 measured points.  Once the correlation functions were calculated, we then selected 32 random soundings to represent the new data measurements.  These 32 new points were combined by the measurement fusion tool with the correlation functions to estimate the bathymetry for the entire area.  The results of this combination are shown in Figure 3.  The 32 white circles represent the randomly selected “measurements,” and the colors in the geoplot show the difference between the bathymetry values calculated by our measurement fusion tool and the 3233 actual soundings.  As can be seen, the majority of the errors were less than one meter (dark blue), although some were as high as 9 meters.  It is also evident that the randomly selected measurement set was clustered to the north end of the sample, and that the majority of the larger errors were at the southern end.

This sampling process models a real-world operation in which we would have a few hundred measured points from an area similar to the area of interest that are used to estimate the correlation functions, and a few tens of points in the area of interest that are available to be used in the actual measurement fusion process.

Data Analysis Tool to Relate Geospatial Data to Operational Needs
 

Text Box: Predicted Sensor Effectiveness as a function of Bottom Type and Bathymetry using sample data from multiple sources
 

 

 

Text Box: Predicted Sensor Effectiveness as a function of Bottom Type and Bathymetry using sample data from multiple sources

 

 

 

 

 

 

 

Figure 4.  Sample Results for Mine-Countermeasures (MCM) Tactics

The fourth challenge is to tie the environment or other geospatial data to mission/operational success.  In our system, the sensor effectiveness for a given countermine system (in this case a generic side scan sonar) is related to the environmental factors that drive its effectiveness.  The resulting polygons are then displayed on the geographic chart and used for mine-countermeasures planning. 

 

In the example above this effectiveness is shown using a colored “stoplight” scale, with the light green showing areas in which mine-countermeasures operations can be conducted more effectively, and the dark red showing areas in which mine-countermeasures operations can be conducted less effectively.  This approach allows plans to be developed that place more countermine effort in the red areas than in the green areas.  This is significant since, without this type of capability, all areas are generally treated as having the same mine-countermeasures operational effectiveness, usually the worst, which requires a significant amount of additional time to complete the assigned task.


 

Home | Contact Us | Site Index | Career Opportunities

Technology | Projects | Products | Locations | Legal Notices | Search

© 2005 Daniel H. Wagner Associates, Inc.  - All rights reserved.