Statistical Engineering Division
Seminar Series
Detecting Fraud in the Real World
Jose Pinheiro
Biostatistics Division, Novartis Pharmaceuticals
Finding telecommunications fraud in masses of call records is more
difficult than finding a needle in a haystack. In the haystack
problem, there is only one needle that does not look like hay, the
pieces of hay all look similar, and neither the needle nor the hay
changes much over time. Fraudulent calls may be rare like needles
in haystacks, but they are much more challenging to find. Callers
are dissimilar, so calls that look like fraud for one account look
like expected behavior for another, while all needles look the same.
Moreover, fraud has to be found repeatedly, as fast as fraud calls
are placed, the nature of fraud changes over time, the extent of
fraud is unknown in advance, and fraud may be spread over more than
one type of service. For example, calls placed on a stolen wireless
telephone may be charged to a stolen credit card. Finding fraud is
like finding a needle in a haystack only in the sense of sifting
through masses of data to find something rare. This talk describes
some issues involved in creating tools for building fraud systems
that are accurate, able to adapt to changing legitimate and
fraudulent behavior, and easy to use.
(Joint Work with Diane Lambert and Don X. Sun)
NIST Contact:
Walter Liggett, x-2851.