Well, I have a simplified implementation for
VisualSmalltalk of two ambiguous matching algorithm
based on Baeza-Yates, R. A. and Gonnet, G.H., "A new
approach to text searching", and Wu, S., and Manber,
U., "Fast text searching allowing errors" (search in
the ACM.org for details, I don't have the links at
hand right now).
I think maybe it could be useful for data cleansing,
and it should be easy to port to VisualWorks.
I can't post the link because I'm finishing my
website, however you can send me an e-mail if you're
interested.
Hernán
"Fast text searching allowing errors"
--- Günther Schmidt <
[hidden email]> wrote:
> Hi,
>
> I need to import data from an access file and
> eliminate duplicate records.
>
> Does anybody know of "Data cleansing" algos in
> Smalltalk?
>
> Günther
>
>
__________________________________________________
Do You Yahoo!?
Tired of spam? Yahoo! Mail has the best spam protection around
http://mail.yahoo.com