A computational geometric / information theoretic method to invert physics-based MEC models attributes for MEC discrimination
The presence of subsurface munitions and explosives of concern (MEC) is a significant issue worldwide. Discrimination of MEC from non-MEC items enables resources be focused on mitigating risk. Geophysical data is collected and physically-based models inverted with the intent that the inverted model...
Gespeichert in:
Datum: | 2011 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | English |
Veröffentlicht: |
Інститут проблем математичних машин і систем НАН України
2011
|
Schriftenreihe: | Математичні машини і системи |
Schlagworte: | |
Online Zugang: | http://dspace.nbuv.gov.ua/handle/123456789/83512 |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Назва журналу: | Digital Library of Periodicals of National Academy of Sciences of Ukraine |
Zitieren: | A computational geometric / information theoretic method to invert physics-based MEC models attributes for MEC discrimination / L.M. Deschaine, P. Nordin, Já.D. Pintér // Мат. машини і системи. — 2011. — № 2. — С. 50-61. — Бібліогр.: 10 назв. — англ. |
Institution
Digital Library of Periodicals of National Academy of Sciences of Ukraineid |
irk-123456789-83512 |
---|---|
record_format |
dspace |
spelling |
irk-123456789-835122015-06-21T03:02:30Z A computational geometric / information theoretic method to invert physics-based MEC models attributes for MEC discrimination Deschaine, L.M. Nordin, P. Pintér, Já.D. Нові інформаційні і телекомунікаційні технології The presence of subsurface munitions and explosives of concern (MEC) is a significant issue worldwide. Discrimination of MEC from non-MEC items enables resources be focused on mitigating risk. Geophysical data is collected and physically-based models inverted with the intent that the inverted model parameters form the basis for MEC discrimination. However, MEC discrimination via model inversion has significant difficulties in noisy environments and with uncertain sensor location. Our computational geometric approach is demonstrated to produce an information-rich set of attributes useful for MEC discrimination including the inverted model information content along with valuable additional information not obtainable using the inversion approach. Наявність залишкових підповерхневих боєприпасів і вибухових речовин (БВР) є серйозною проблемою в усьому світі. Дискримінація БВР від не БВР-елементів дозволяє спрямовувати ресурси на пом'якшення ризиків. Збір фізичних даних і інвертування, моделей, що фізично визначаються, проводяться з наміром використовувати інвертовані модельні параметри як базис для дискримінації БВР. Однак дискримінація БВР через модельну інверсію стикається зі значними труднощами в середовищах з шумами, а також при невизначеному місцезнаходженні сенсорів. Наш обчислювально-геометричний підхід демонструє можливість отримувати безліч інформаційних атрибутів, корисних для БВР-дискримінації, включаючи інформаційний зміст інвертованої моделі разом з цінною додатковою інформацією, недоступною при використанні інверсного підходу. Наличие остаточных подповерхностных боеприпасов и взрывчатых веществ (БВВ) является серьезной проблемой во всем мире. Дискриминация БВВ от не БВВ-элементов позволяет направлять ресурсы на смягчение рисков. Сбор физических данных и инвертирование физически определяемых моделей производятся с намерением использовать инвертированные модельные параметры в качестве базиса для дискриминации БВВ. Однако дискриминация БВВ через модельную инверсию сталкивается со значительными трудностями в средах с шумами, а также при неопределенном местоположении сенсоров. Наш вычислительно-геометрический подход демонстрирует возможность получать множество информационных атрибутов, полезных для БВВ-дискриминации, включая информационное содержание инвертированной модели вместе с ценной дополнительной информацией, недоступной при использовании инверсного подхода. 2011 Article A computational geometric / information theoretic method to invert physics-based MEC models attributes for MEC discrimination / L.M. Deschaine, P. Nordin, Já.D. Pintér // Мат. машини і системи. — 2011. — № 2. — С. 50-61. — Бібліогр.: 10 назв. — англ. 1028-9763 http://dspace.nbuv.gov.ua/handle/123456789/83512 519.6; 662.2 en Математичні машини і системи Інститут проблем математичних машин і систем НАН України |
institution |
Digital Library of Periodicals of National Academy of Sciences of Ukraine |
collection |
DSpace DC |
language |
English |
topic |
Нові інформаційні і телекомунікаційні технології Нові інформаційні і телекомунікаційні технології |
spellingShingle |
Нові інформаційні і телекомунікаційні технології Нові інформаційні і телекомунікаційні технології Deschaine, L.M. Nordin, P. Pintér, Já.D. A computational geometric / information theoretic method to invert physics-based MEC models attributes for MEC discrimination Математичні машини і системи |
description |
The presence of subsurface munitions and explosives of concern (MEC) is a significant issue worldwide. Discrimination of MEC from non-MEC items enables resources be focused on mitigating risk. Geophysical data is collected and physically-based models inverted with the intent that the inverted model parameters form the basis for MEC discrimination. However, MEC discrimination via model inversion has significant difficulties in noisy environments and with uncertain sensor location. Our computational geometric approach is demonstrated to produce an information-rich set of attributes useful for MEC discrimination including the inverted model information content along with valuable additional information not obtainable using the inversion approach. |
format |
Article |
author |
Deschaine, L.M. Nordin, P. Pintér, Já.D. |
author_facet |
Deschaine, L.M. Nordin, P. Pintér, Já.D. |
author_sort |
Deschaine, L.M. |
title |
A computational geometric / information theoretic method to invert physics-based MEC models attributes for MEC discrimination |
title_short |
A computational geometric / information theoretic method to invert physics-based MEC models attributes for MEC discrimination |
title_full |
A computational geometric / information theoretic method to invert physics-based MEC models attributes for MEC discrimination |
title_fullStr |
A computational geometric / information theoretic method to invert physics-based MEC models attributes for MEC discrimination |
title_full_unstemmed |
A computational geometric / information theoretic method to invert physics-based MEC models attributes for MEC discrimination |
title_sort |
computational geometric / information theoretic method to invert physics-based mec models attributes for mec discrimination |
publisher |
Інститут проблем математичних машин і систем НАН України |
publishDate |
2011 |
topic_facet |
Нові інформаційні і телекомунікаційні технології |
url |
http://dspace.nbuv.gov.ua/handle/123456789/83512 |
citation_txt |
A computational geometric / information theoretic method to invert physics-based MEC models attributes for MEC discrimination / L.M. Deschaine, P. Nordin, Já.D. Pintér // Мат. машини і системи. — 2011. — № 2. — С. 50-61. — Бібліогр.: 10 назв. — англ. |
series |
Математичні машини і системи |
work_keys_str_mv |
AT deschainelm acomputationalgeometricinformationtheoreticmethodtoinvertphysicsbasedmecmodelsattributesformecdiscrimination AT nordinp acomputationalgeometricinformationtheoreticmethodtoinvertphysicsbasedmecmodelsattributesformecdiscrimination AT pinterjad acomputationalgeometricinformationtheoreticmethodtoinvertphysicsbasedmecmodelsattributesformecdiscrimination AT deschainelm computationalgeometricinformationtheoreticmethodtoinvertphysicsbasedmecmodelsattributesformecdiscrimination AT nordinp computationalgeometricinformationtheoreticmethodtoinvertphysicsbasedmecmodelsattributesformecdiscrimination AT pinterjad computationalgeometricinformationtheoreticmethodtoinvertphysicsbasedmecmodelsattributesformecdiscrimination |
first_indexed |
2025-07-06T10:16:38Z |
last_indexed |
2025-07-06T10:16:38Z |
_version_ |
1836892290135621632 |
fulltext |
50 © Deschaine L.M., Nordin P., Pintér Já.D., 2011
ISSN 1028-9763. Математичні машини і системи, 2011, № 2
UDC 519.6; 662.2
L.M. DESCHAINE, P. NORDIN, JA.D. PINTER
A COMPUTATIONAL GEOMETRIC / INFORMATION THEORETIC METHOD TO
INVERT PHYSICS-BASED MEC MODELS ATTRIBUTES FOR MEC DISCRIMINA-
TION
Анотація. Наявність залишкових підповерхневих боєприпасів і вибухових речовин (БВР) є серйоз-
ною проблемою в усьому світі. Дискримінація БВР від не БВР-елементів дозволяє спрямовувати
ресурси на пом'якшення ризиків. Збір фізичних даних і інвертування, моделей, що фізично визнача-
ються, проводяться з наміром використовувати інвертовані модельні параметри як базис для
дискримінації БВР. Однак дискримінація БВР через модельну інверсію стикається зі значними
труднощами в середовищах з шумами, а також при невизначеному місцезнаходженні сенсорів.
Наш обчислювально-геометричний підхід демонструє можливість отримувати безліч
інформаційних атрибутів, корисних для БВР-дискримінації, включаючи інформаційний зміст
інвертованої моделі разом з цінною додатковою інформацією, недоступною при використанні
інверсного підходу.
Ключові слова: боєприпаси і вибухові речовини, метод обчислювальної геометрії, техніка інверсії
фізичної моделі.
Аннотация. Наличие остаточных подповерхностных боеприпасов и взрывчатых веществ (БВВ)
является серьезной проблемой во всем мире. Дискриминация БВВ от не БВВ-элементов позволяет
направлять ресурсы на смягчение рисков. Сбор физических данных и инвертирование физически
определяемых моделей производятся с намерением использовать инвертированные модельные па-
раметры в качестве базиса для дискриминации БВВ. Однако дискриминация БВВ через модельную
инверсию сталкивается со значительными трудностями в средах с шумами, а также при неопре-
деленном местоположении сенсоров. Наш вычислительно-геометрический подход демонстрирует
возможность получать множество информационных атрибутов, полезных для БВВ-
дискриминации, включая информационное содержание инвертированной модели вместе с ценной
дополнительной информацией, недоступной при использовании инверсного подхода.
Ключевые слова: боеприпасы и взрывчатые вещества, метод вычислительной геометрии, техни-
ка инверсии физической модели.
Abstract. The presence of subsurface munitions and explosives of concern (MEC) is a significant issue
worldwide. Discrimination of MEC from non-MEC items enables resources be focused on mitigating risk.
Geophysical data is collected and physically-based models inverted with the intent that the inverted model
parameters form the basis for MEC discrimination. However, MEC discrimination via model inversion
has significant difficulties in noisy environments and with uncertain sensor location. Our computational
geometric approach is demonstrated to produce an information-rich set of attributes useful for MEC dis-
crimination including the inverted model information content along with valuable additional information
not obtainable using the inversion approach.
Keywords: munitions and explosives of concern, computational geometric method, physics model inver-
sion technique.
1. Introduction
Solving MEC discrimination decision problems requires an in-depth understanding of the under-
lying science of geophysics. Our overall goal is to demonstrate the enhanced accuracy and per-
formance possible from using machine learning modeling to fuse the information content ob-
tained from MEC feature attributes derived from both data-driven models (using computational
geometry) and physics-based models. We describe the techniques, and how the machine-learning
independent information-theoretic approach can be used to assess the contribution from each fea-
ture source (computational geometry or the fitted physics models) in MEC discrimination chal-
lenge. The physics-based governing equations provide the relevant scientific problem space of
ISSN 1028-9763. Математичні машини і системи, 2011, № 2 51
Fig. 1. Typical MEC and non-MEC items.
(Image: US Army Environmental Command:
Standardized Target Specifications:
Technology Demonstration Sites)
MEC item responses to geophysical interrogation. Computational geometry provides attributes
for MEC and non-MEC (i.e. clutter, shrapnel). Hence, a key objective of this work is to merge
and extend the techniques, effectively fusing both a priori physics-based and automatic modeling-
based components to extend the maximum total discrimination/classification accuracy beyond
that achievable by either method used independently. A related and equally important objective is
to quantify the relative value of each component of the information sources in relationship to ac-
curacy.
2. Overview of MEC Discrimination
MEC discrimination presents one of the toughest and most challenging problems in the genre of
subsurface identification tasks. A MEC item can, for instance, be unexploded ordnance of various
sizes and be buried below ground (fig. 1). MEC
can retain their ability to detonate; they pose a
continuing risk. The United States Department of
Defense (DOD) has invested heavily in basic re-
search and development to address this challenge,
but because typically MEC targets are small and
surrounded by clutter (e.g., shrapnel or non-MEC
items), accurate and reliable discrimination has
been a challenge. Hence, while progress is being
made, safe, efficient and cost-effective solutions
have so far proven elusive.
Initially, MEC discrimination research fo-
cused on two primary approaches to evaluate a
Target of Interest (TOI): the first, a physics-based
approach [1], relied on mathematical models whe-
reby model parameters were fitted to field data by solving the inverse modeling problem. A
second approach, which used machine-learning modeling and multidisciplinary computational
geometry insights to derive features from the field data, clearly outperformed the other methods
in use at that time to discriminate MEC from non-MEC [2]. Both approaches are described be-
low.
2.1. Inverse (Fitted) Physics-Based Models
This section explains the inverse physics-based modeling approach for discriminating MEC items
using electromagnetic (EMI)-based and magnetic (MAG) instruments.
One method to investigate the presence of MEC items is by conducting non-destructive
geophysical surveys. This approach has value only if the resulting information is useable for lo-
cating anomalies and discriminating between MEC and non-MEC items. Since the MEC objects
are not observable (being primarily below ground), the location, depth, and orientation of the
MEC item are unknown. These model parameters are solved for by inverse modeling and are
used to assess whether a TOI is a MEC item or not.
EMI uses induction theory and leverages the hypothesis that the distributions of the ei-
genvalues of magnetic polarizability provide an understandable basis for MEC versus non-MEC
discrimination. This hypothesis is based on the observation that a MEC item can be approximated
by an axisymetric cylindrical (as illustrated on Fig. 1) and, therefore, has only two unique eigen-
values, one that represents the length of the object and the other two that represent the axial
symmetry. Irregular objects (e.g., clutter), however, exhibit three distinct eigenvalues (that is, dif-
ferent responses in three orthogonal directions). The model of the signal ( )tS that is generated by
the EMI equipment is:
52 ISSN 1028-9763. Математичні машини і системи, 2011, № 2
( ) =tS ( ) =tB . (1)
Where t is time, Tr is the sum of the diagonal elements of a matrix (trace), TR is the
transmit/receive matrix, and ( )tB is a symmetric-effective polarizability matrix. ( )tS is computed
from the convolution of the magnetic polarizability with the transmit waveform. The best-fit ei-
genvalues (β1, β2, β3) correspond to the responses induced when the primary field is aligned with
the principal axes of the object. A magnetic (MAG) survey response is described by a simple di-
pole model. A tool that provides the best fit estimate for both EMI and MAG data (UX-Analyze)
has been developed by ESTCP to facilitate these calculations [4]. Fig. 2 illustrates the results of
an inverse model fit for an anomaly investigated using both the EMI and MAG geophysical tech-
niques.
This approach provides fitted model parameters that are listed under the “fit results” out-
put summary. There are seven EMI-fitted model parameters, which are then used as inputs for
machine learning modeling: these are the depth of the object (Depth), its size (Size), the eigenva-
lues (β1, β2, β3), the Coh and the best-fit value (chi2). Inverse physics modeling for the MAG
sensor provides as outputs depth, size, declination, inclination, solid angle, and the magnetic
moment. MEC discrimination insight is gained from data collected later in the decay curve which
captures the anomaly metal thickness. The core concept regarding the EMI inverse model tech-
nique is that the polarizability will have one large (β1) and two small (β2, β3) and equivalent val-
ues to describe the conical MEC-shaped item. MAG relies on the shape and amplitude aspects.
Hence, both shape (cylindrical versus fragments) and metal thickness (casings versus sheet metal)
are also useful MEC discrimination information.
While theoretically sound, significant practical challenges to this method include the need
to overcome data collection positioning error (requires resolution on the centimeter scale); and
signal-to-noise ratio (S/N) must be very high, on the order of 100, and non-uniqueness of the ei-
genvalue solutions. The inverse model parameters used in this work were developed by [5].
Fig. 2. Inverse modeling analysis of EMI (left) and MAG (right) for one anomaly using UX-
Analyze. (From fig. 2–7 in [4])
ISSN 1028-9763. Математичні машини і системи, 2011, № 2 53
Fig. 3. MEC discrimination solution compared to
results from the JPG Phase IV UXO (MEC)
Discrimination Project
2.2. Computational Geometric Model
We first developed and tested the multi-disciplinary, machine-learning approach using computa-
tional geometric modeling techniques in the fall of 2001 on publicly available information and
data sets for a MEC (then called “UXO” for unexploded ordnance) discrimination from a “prove-
out” site known as the Jefferson Proving Ground – Phase IV. The approach performed far better
than any technique used at that time [2]. The data used were collected by others using a Protem-
47, time domain geophysical unit that provided 20 time gates of signal S(t) information. The
compiling genetic programming system (CGPS), a machine-learning technique developed by
Nordin [6], was used as the classification
algorithm (we later coined the phrase
“linear genetic programming” [LGP] to
differentiate it from other genetic pro-
gramming algorithms). The results of
this study are summarized in Deschaine
[2] and are shown in fig. 3.
Fig. 3 shows the performance of
the published results from 10 analyses
conducted by vendors who provided
MEC discrimination services as part of
the JPG Phase IV project. The horizontal
axis shows the performance of each me-
thod in correctly identifying anomalies
that did not contain buried MEC; whe-
reas the vertical axis shows the performance of each method in correctly identifying anomalies
that did contain buried MEC. The angled line in the figure represents what could be expected
from random guessing.
The difficulties of modeling these data are evident: most methods performed little better
than random guessing would. Notwithstanding this limitation, the machine-learning based com-
putational geometric approach using the CGPS algorithm still provided the best-known approach
at the time for correctly identifying MEC and for correctly rejecting non-MEC using various data
set configurations on blind data [2]. The dashed line from the NAVEA solution in Fig. 3 indicates
that the data set for the machine-learning algorithm was used. Note that the data we used was
from a well conducted study, yet the analysis method used by others only produced results
slightly above average. (We selected this data because of its computational geometric value.)
Note that we intentionally did not use the data set labeled Geophex, even though it had the best
performance of the group as analyzed by others, because we concluded that the NAVEA data had
more information for high accuracy MEC discrimination – the team doing the original analysis
just were not able to exploit it. The gray dot in the upper right-hand corner of the figure shows the
CGPS solution on unseen data. Because the number of data points was small, we used a re-
sampling technique to estimate the 95% confidence interval on this solution; the black rectangle
in Fig. 3 shows that interval. CGPS – combined with computational geometric approach – pro-
duced by far the most accurate discrimination results.
Since the initial UXO/MEC discrimination success in 2001, we have been assessing the
challenge of quickly finding targets of interest and then extracting a small, focused set of
MEC/non-MEC relevant discrimination features for input to machine-learning algorithms and
production-size data sets. The initial approach we tested was to use genetic programming for au-
tomated feature extraction, but it was unsuccessful in practice. The approach we found that is ro-
bust, practical, flexible, and effective is a multi-disciplinary formulation of computational geome-
try. This approach was inspired by successes in the medical field, but because there are essential-
ly an infinite number of features that can be derived using computational geometry, this approach
54 ISSN 1028-9763. Математичні машини і системи, 2011, № 2
Fig. 4. Development of computational geometr-
ically derived attributes using a globally
optimized ellipsoid
presents a particular challenge for any machine-learning approach, namely that of input attribute
explosion. For example, the approach used to generate the results cited herein uses a field instru-
ment with four (4) time gates and generated 633 attributes based on raw data, statistical prop-
erties, and insight from the physics-based MEC
discrimination equations (though not the specif-
ic inputs from inverse physics model fitting).
The geometric attributes are based on finding
an optimized ellipsoid that is constructed either
automatically using the Lipschitz Global Opti-
mization (LGO) technique [8] or by an expert
geophysicist who draws a polygon around the
target of interest. To generate the features, the
ellipsoid is divided into slices and the features
are computed as a whole geometric shape, with-
in quadrants and within the segmentations. Fig.
4 illustrates the computational geometric
process of segregating an ellipsoid fit to field data for attribute derivation.
Given the prospect that the next generation of geophysical instruments will produce even
more data and resultant features, the industry would benefit from an efficient and reproducible
site-specific feature reduction methodology – which is precisely the role the information-theoretic
approach Minimum Redundancy Maximum Relevance (MRMR) would serve.
3. Attribute Analysis: Computational Geometry and Inverse Physics Models
Our hypothesis is that when the attributes from computational geometry and fitted inverse phys-
ics-based modeling approach are combined, the resulting model generated with machine learning
will perform better than – or at least as well as – either approach used alone. We will now test
this hypothesis first theoretically using information theory, and then empirically, using machine
learning.
3.1. Mutual Information Analysis
Understandability of the individual attributes and relationships used for MEC classification anal-
ysis is important for the users of the solution. While the computational geometric approach has
been shown to be a viable approach, the amount of attributes can make the solution daunting to
understand. Methods for feature compression such as principal components analysis, while quite
valuable for reducing the number of inputs in a data set used for machine learning, require com-
plex computations to be performed that combine many attributes into a single input vector. This,
however, is something that obfuscates solution understandability. In the section below, we de-
scribe and test an approach to reduce the attributes required for MEC discrimination modeling
using mutual information that offers the additional advantage of preserving the individual
attribute identity.
To test the approach on both attribute reduction and relevancy assessment, the data sets
from the ESTCP Camp Sibert project [3, 5] are combined so they contain attributes from both the
fitted physics-based model parameters and the computational geometric approach; the MEC iden-
tity is a binary label (1 for MEC, 0 for non-MEC). The data was collected by others as part of the
project and provided to us for this analysis. The EMI data set consists of 174 instances (rows), of
which 67 are MEC and 107 are non-MEC. There are seven attributes for the fitted physics-based
model and 551 for the computational geometric based model. The MAG data set consists of 182
instances (rows) of which 56 are MEC and 126 are non-MEC. There are six attributes for the fit-
ted physics-based model and 82 for the computational geometric-based model.
ISSN 1028-9763. Математичні машини і системи, 2011, № 2 55
Information content has long been used for assessing important of attributes for model
building [10]. The method used here is based on mutual information, using a maximum-
dependency, minimum-redundancy framework as developed by Peng [7]. This technique pro-
vides the necessary theoretical engine to select the best candidate features independent of a ma-
chine-learning classifier. The computations are based on the following model:
Given two random variables ( )yx, , their mutual information ( )yx, is defined in terms of
their marginal and joint probability density functions ( )xp , ( )yp and ( )yxp , :
I . (2)
In terms of mutual information, the goal of feature selection is to develop the set S of m
features { }mixi ...1, = which jointly have the largest dependency (or in this case relevance) on
the target class, that is the classification of MEC (aka UXO):
( ) =whereDuxoSD ,,max . (3)
It is likely that using just this formulation will generate a list of features that are redundant
with respect to one another (i.e., not all are needed for the same discrimination accuracy); hence,
a feature redundancy protective measure is used via a maximum relevance and minimum redun-
dancy formulation:
( ) == RSRR ,min . (4)
To optimize D (dependency) and R (redundancy) simultaneously, we can use the objec-
tive function:
( ) RDФRDФ −=,,max . (5)
The goal of this “maximum relevance, minimum redundancy” (MRMR) approach [7] is to
reduce the attribute space. Using a smaller input data set (with the same information content) will
result in faster running as well as higher accuracy of machine-learned models. We use it to assess
the relative importance/redundancy between the fitted attributes from the physics models and the
computational geometry attributes on each of the subsets of the EMI and MAG data.
3.2. Application of MRMR
The first step in applying MRMR to the feature value quantification for the MEC discrimination
challenge is preparing the data set. The target of Interest (TOI) is discrete; each case is labeled
either as a 1 for MEC or as a 0 for not-MEC. However, the computational geometric paradigm
generates features that are represented as continuous variables. Mutual information of discrete
variables was used and the variables discritized by using two thresholds: the mean (+/-) al-
pha*standard deviation as discussed in [7]. MRMR is available as open source, as a web-based
application, C and Matlab code.
Table 1. Parameter settings for the MRMR algorithm
MRMR Parameter Parameter Value Used
Alpha 1.0
Variable states 3
Number variables retained 50
Feature Selection Scheme Mutual Information Difference (MID)
56 ISSN 1028-9763. Математичні машини і системи, 2011, № 2
3.3. MRMR Results
The MRMR ranking produces a rank-ordered list of features, with the top 50 of the 633 candidate
EMI features being retained. To understand how to use the MRMR results, consider an example
of the top three variables, V1, V2 and V3. This ranking means that if a single variable is desired,
then variable V1 should be used. If two variables are desired, then the combination of V1 and V2
is better than the combination of V1 and V3, or V2 and V3. The results discussed below indicate
that information contained by developing a feature data set using the computational geometric
approach for the EMI data set contains all the information that is contained in the inverse physics
modeling. However, the results of MRMR analysis on the MAG data set clearly show the syner-
gy possible when both using both methods are used. This finding is reinforced by the empirical
testing via machine learning, as discussed below.
3.4. Investigation of MRMR Results
Since the MRMR analysis resulted in minimal to no selection of the inverse physics model
attributes, and these very attributes are what the industry relies on for MEC discrimination, analy-
sis was conducted to further understand this finding.
3.4.1. EMI MRMR Results Analysis
Analysis of the EMI data revealed that the only fitted physics-derived variable in the top 50 rank-
ordered set was “Chi2,” which was ranked 45th out of 50 for variable importance. Forty-nine (49)
of the features in the top 50 were computational geometric (CG) features. The eigenvalue
attributes as described by the inverse physics modeling (β1, β2, β3) – did not appear in the list of
top 50 features. Table 2 shows an abbreviated list of features output by the MRMR code.
Table 2. Abbreviated output of MRMR analysis of the EMI data
Feature (attribute) Ranking Feature (attribute)
#1 CG-500
#2 CG-485
#3 CG-335
… …
#45 Chi2
To investigate why the eigenvalue attributes were not ranked with higher priority, we
tested whether or not the computational geometric attributes and the eigenvalues were informa-
tion content redundant. The results of our analysis show that they are, in fact, redundant. To as-
sess to the extent of the redundancy, we developed a function using a common set of eight
attributes from the computational geometric data set that explains more than 99% of the variation
in each of the eigenvalues β1, β2, β3. Hence, the features developed as part of the computational
geometric attribute formulation contain all the information that the eigenvalues have to offer.
This is demonstrated via a regression analysis using Multivariate Adaptive Regression Splines
(MARS) with 10 times cross-validation [9]. Thus, the need to develop attributes by fitting physics
models to the field data is unnecessary, at least in this example. Since the computational geome-
tric approach performs at lower S/N than the inverse physics modeling (10 vs. 100, respectively),
more TOI can be discriminated using this method. Moreover, the features that form the inputs to
the regression models are those that one would expect such as peak values, ratios between the
channels and parameters of power law fits. The results of the computational geometric attribute
data set’s ability to reproduce all of the fitted physics-based derived attributes are shown in Table
3: R2 denotes the correlation coefficient.
ISSN 1028-9763. Математичні машини і системи, 2011, № 2 57
Table 3. Reproducibility of inversion physics-based EMI features using CG-EMI features
Physics-based parameters obtained by inver-
sion
R2 obtained using 10 times cross-validation
β1 0,99313
β2 0,99315
β3 0,99320
Chi2 0,89701
Size 0,98115
Depth 0,99367
Coh 0,76570
In hindsight, it is not surprising that the computational geometric approach includes all of
the information that could be available by fitting physics models to the data. After all, we devel-
oped the computational geometric model with the discrimination physics in mind. However, this
is the first formal analysis that indicates that this information inclusivity is indeed the case.
Moreover, these results show that the computational geometric approach can be used to develop a
physics-based representation from the EMI Data. Interestingly, the one (Chi2 a measure of fitness
of the inverse fitted-physics model) attribute that did appear in the top 50 features is a solid indi-
cator of how well the inverse physics model is expected to fit the data. It is also important to note
that the computation geometric approach was able capture 89,7% of the variation in the expected
fitness of the inverse modeling. This ability to predict a priori how an inverse modeling task
should perform is extremely valuable for quality assurance/quality control purposes.
3.4.2. MAG MRMR Results Analysis
Analysis of the MAG data revealed three of the physics-derived variables in the top 50 of the
rank-ordered set; these are Fit_size (rank #1), Fit_inc ( rank #6), and Fit_Depth (rank #48). The
remaining 47 of the features in the top 50 were computational geometric attributes. A test of the
ability to produce the fitted physics-based attributes from the computational geometric attributes
was conducted, this time with very different results as shown in Table 4.
Table 4. Reproducibility of fitted physics-based MAG features using computational geometric
MAG attributes
Physics-based parameters obtained by inver-
sion (# is the parameter ranking)
R2 obtained using 10 times cross-validation
Depth (#48) 0,67
Size (#1) 0,73
Dec 0,21
Inc (#6) 0,49
Solid Angle 0,30
Magnetic Moment 0,49
Clearly, the less well-developed computational geometric approach for MAG sensors is
currently not as effective as the EMI approach in capturing the information content from the fit-
ted physics-based inversion model; therefore, further work in this area is warranted.
3.5. MRMR Analysis Summary
This MRMR approach is particularly valuable because it provides gives the ability to screen im-
portant features and reject ones of lesser value or that are redundant to making classification pre-
dictions without the need to run classification algorithms. This means that important variables can
be identified in minutes as opposed to hours or days of simulation computation time. Thus the
benefits associated with the machine-learning, algorithm-independent analysis of feature contri-
58 ISSN 1028-9763. Математичні машини і системи, 2011, № 2
bution made possible with the MRMR approach are multifold. Not only is it fast and cost-
efficient, it guides when easily computed data-driven features should replace more complex ones
to obtain features such as those arrived at via fitted physics-based inversion. Additionally, it pro-
vides a very fast and efficient screening mechanism to rank the value of new or proposed fea-
tures, especially when compared to existing features sets. Additionally, these characteristics of
the information-theoretic MRMR approach, when corroborated with results from machine-
learning algorithms, effectively streamline the understanding of attribute importance and help to
focus new research into less well-understood areas. This benefit is discussed in more detail be-
low.
4. Machine Learning Analysis and Results
Machine-learning (ML) techniques are tools that interrogate the information content in the data
set and then replace that content with a representative relation(s). That representation can then be
used to make predictions relative to unseen instances: in this case sensor data returned from a
geophysical investigation.
Based on the information-theoretic MRMR analysis outlined and demonstrated above, we
can anticipate and expect certain outcomes when building models from the data sets using ma-
chine-learning algorithms and various combinations of attributes. For example, models produced
using the EMI data set should rank as:
• Best: Combined Geometric and Fitted Model attributes;
• Second: Geometric attributes, and;
• Third: Fitted physics-attributes.
This ranking reflects the fact that the computational geometric approach replicated the in-
formation content in the inverted fitted physics models. The machine-learned model based on the
combined geometric-fitted physics attribute data may be slightly better (or tie with) the geometric
attribute model, since only one physics attribute appeared in the top 50 features (the measure of
the inverse physics model fitness) and then at a very low rank (#45). The data based on the fitted
physics models will rank as third accurate, to the extent that it does not contain the information
content that the geometric data set provides.
Models produced using the MAG data set are a different story. Clearly, the geometric
attributes present valuable information, as do the fitted physics-inversion attributes. One can only
conclude, therefore, that the combined CG-physics data set will produce a more accurate model
than either data or physics alone.
4.1. Empirical Testing using Machine Learning
The models were constructed from the EMI and MAG data sets (fitted physics, geometric, geo-
metric-fitted physics). All models were developed using 10 times cross-validation, and all used
the designated technique subset (not just the subset of the top 50 features identified above). The
tool used was TreeNET [9] and used with default settings, except the number of trees was set to
2,000.
4.1.1. EMI Machine Learning Results
The model results using the EMI inverse physics model data set is provided in fig. 5 and show
respectable MEC discrimination (ROC>0,95). A receiver operating characteristic (ROC) Chart is
one that plots the accuracy of a classifier over the data set; true positive rate of detection on the y-
axis and the false positive rate on the x-axis. The area under ROC curve (AUC) is used as a
measure of quality of a probabilistic classifier, with an area of 1.0 being best achievable, and 0.50
(blue line) being no better than random guessing. The graph shows the order of MEC removal,
progressing from left to right, with the final excavation occurring at the right-most section of the
ISSN 1028-9763. Математичні машини і системи, 2011, № 2 59
graph. The last remaining MEC item is removed when the value of the y-axis is 1.0. For MEC
removal projects, additional MEC would be removed beyond the last known MEC item as a
means of validation and stakeholder acceptance.
Fig. 5. EMI: Inverse Physics
Model, ROC (AUC) =0,98786
Fig. 6. EMI: Computational
Geometry,
ROC (AUC) =0,99609
Fig. 7. MAG: Inverse Physics
Model, ROC (AUC) =0,96358
Fig. 8. MAG: Computational
Geometry,
ROC (AUC) =0,95366
Fig. 9. EMI: Computational
Geometry and Inverse Physics
Model,
ROC (Are AUC)=0,99623
Fig. 10. MAG: Computation-
al Geometry and Inverse
Physics Model,
ROC (AUC) =0,97251
rated additional information essential for higher accuracy MEC classification. The combined
geometric-fitted physics model slightly outperformed the geometric-only model; this model in-
cluded the fitness of the inverse model to the data.
in fig. 8 also show respectable
MEC discrimination (ROC >
0,95), but with a slightly worse
curve indicating slower identifica-
tion of the final MEC item.
4.1.3. Combined EMI and MAG
Analysis
EMI: The model results using the
EMI-geometric and inverse phys-
ics model data set provided in fig.
The model results using the
EMI geometric data set (provided
in fig. 6) also show respectable
MEC discrimination (ROC > 0,95)
with a much better time/speed
curve for identifying MEC.
4.1.2. MAG Machine-Learning
Results
The model results using the MAG
fitted physics data set, provided in
fig. 7, show respectable MEC dis-
crimination (ROC > 0,95).
The model results using the
MAG geometric data set provided
9 also show respectable MEC dis-
crimination (ROC>0,95), again
with a slightly better curve show-
ing faster classification of MEC
signals. The expectation of the
classifier performance is in con-
cert with the understanding
gained from the information-
theoretic MRMR analysis. The
computational geometric model
performed better than the inverse
physics model, because it repli-
cated basically all the important
the information content of the fit-
ted physics model and also gene-
60 ISSN 1028-9763. Математичні машини і системи, 2011, № 2
Fig. 11. Machine learning over-fitting test using a
Monte-Carlo target (aka label) column scrambling
MAG: The model results using the MAG geometric and inverse physics model data set
(shown in fig. 10) also demonstrate respectable MEC discrimination (ROC>0,95), with a higher
AUC ROC value with a slightly better curve, but again indicate slower identification of the final
MEC found. The expectations of the classifier performance are in concert with the information-
theoretic MRMR analysis in terms of overall performance (a higher AUC ROC value was ob-
tained using the combined data-physics data sets). However, the overall identification of that last
MEC was slower; hence this solution would require more holes to be dug (and non-MEC items
excavated) than the other ones. These mixed results are indicative of a less than fully developed
MAG data and inverse-physics model.
5. Over-Fitting Test
In machine learning, over-fitting (also known as “memorizing”) is an important issue with respect
to assuring predictable performance on unseen data. Being able to predict how an algorithm will
do on unseen data is more important than the algorithm doing well on training data. To guard
against over-fitting, large data sets are divided into training, testing, and/or validation subsets
(where the model’s performance is solely judged on the validation performance statistics) or as in
our case, into a larger number of smaller data sets where a 10 times cross-validation approach
could be used.
The method used in this study to test
for over-fitting is to scramble the target col-
umns of the EMI-geometric attribute data set
using a Monte Carlo method. The target vari-
able (MEC or non-MEC) is solved once; then
the target column is scrambled 99 times (for a
total of 100 runs) using the exact same para-
meter settings and a machine-learning model
is then again built. If there is a structural flaw
in the experiment, it will show up. Specifical-
ly, if the accuracy of the solutions developed
with the Monte Carlo-scrambled (randomized
labels) targets are similar to the true target
sequencing; this is not a good result. This technique provides qa/qc check and safe-guards against
deploying models that, while they may look good in testing phase, they are but mere chance find-
ings and fitting noise – likely to perform poorly upon deployment. As shown in fig. 11, the true
solution (AUC=0,997) exceeds both the average (AUC=0,556, STDEV=0,054) and maximum
(AUC=0,737) of the Monte Carlo over-fitting test. Our conclusion is that the modeling approach
is valid, the model is identifying a signal (not just fitting noise); therefore, the results are expected
to be reasonably reliable for their intended purpose.
6. Summary and Results
We demonstrate the value and understandability of the computational geometric MEC discrimi-
nation method, and developed a methodology for understanding the value of MEC features by
applying information theory. We used machine learning to fuse the information content of
attributes derived from both machine-learning computational geometric and from fitted physics-
based models.
The authors believe that the inverse physics modeling, while providing great insight, over
compresses the information available in the geophysical signals into too few variables and hence
impose an artificial limit on that methods accuracy. The multi-disciplinary computational geome-
tric (MDCG) is intended to extend – not replace-this deep physics-based understanding by sup-
ISSN 1028-9763. Математичні машини і системи, 2011, № 2 61
plementing the discrimination information with factors the inverse physics modeling approach
cannot capture. For the test case, the MDCG approach was found to contain all the information
(in eight common variables) contained in the EMI data set, but not the MAG data set.
The empirical tests conducted using machine-learning are consistent with their perfor-
mance predicted using information theory. We further identified the information overlap between
our computational geometric approach and the fitted physics model approach by others: complete
overlap for the EMI sensor – indicating a rational physical basis for the method – and a partial
overlap for the MAG sensors.
REFERENCES
1. Bell T.H. Subsurface Discrimination Using Electromagnetic Sensors / T.H. Bell, B.J. Barrow, J.T. Mil-
ler // IEEE Transactions on Geoscience and Remote Sensing. – 2001. – Vol. 39, N 6. – P. 1286 – 1293.
2. Using Machine Learning to Compliment and Extend the Accuracy of UXO Discrimination Beyond the
Best Reported Results of the Jefferson Proving Ground / L.M. Deschaine, R.A. Hoover, J. N. Skibinski [et
al.] // Technology Demonstration. Society for Modeling and Simulation International’s Advanced Tech-
nology Simulation Conference. – San Diego, 2002. – April. – P. 46 – 52.
3. Deschaine L. M. Advanced MEC Discrimination Comparative Study on Standardized Test-Site Data
Using Linear Genetic Programming (LGP) Discrimination (MM-0811) [Електронний ресурс] / L.M. De-
schaine. – Completed 2009. – Режим доступу: http://www.estcp.org/Technology/MM-0811-FS.cfm.
4. ESTCP. Technical Report Description and Features of UX-Analyze ESTCP Project MM-0210. – 2009.
– 42 p.
5. Keiswetter D. SAIC Analysis of Survey Data Acquired at Camp Sibert / D. Keiswetter // Interim Re-
port, ESTCP Project MM-0210. – 2008. – July. – P. 112.
6. Nordin J.P. A Compiling Genetic Programming System that Directly Manipulates the Machine Code /
J.P. Nordin // Advances in Genetic Programming / K. Kinnear (ed.). – MIT Press, Cambridge MA, 1994. –
P. 311 – 331.
7. Peng H. Feature selection based on mutual information: criteria of max-dependency, max-relevance,
and min-redundancy / H. Peng, F. Long, C. Ding // IEEE Transactions on Pattern Analysis and Machine
Intelligence. – 2005. – Vol. 27, N 8. – P. 1226 – 1238.
8. Pintér J. D. Global Optimization in Action / J. D. Pintér // Kluwer Academic Publishers, Dordrecht.
Now distributed by Springer Science and Business Media. – New York, 1996. – 512 р.
9. Salford Systems Inc. Salford Data Miner Users Manual: CART Version 6.4, TreeNET Version 2.0,
MARS Version 3.0, and Random Forrest Version 1.0. – San Diego, CA, 2009.
10. Varmuza K. Monatsh. Chem / K. Varmuza. – 1974. – Vol. 105, N 1.
Стаття надійшла до редакції 28.05.2010
|