Abstract
Similarity is a vague concept which can be treated in a
quantitative manner only using appropriate mathematical
representation of the objects to compare and a metric on
the space representation. In biology the mathematical representation
of structure relies on strings taken from an alphabet
of m symbols. Very often binary strings, m = 2, are
used. The size of the binary string depends on the complexity
of the structure to represent, so the string can be quite
long. The Hamming distance is the most used metric with
binary strings. The computational effort required to compute
the Hamming distance linearly depends on the size of
the string. However even a linear effort case may be computational
heavy if many computations are required. One of
the fastest computational approach to evaluate Hamming
distances relies on look-up tables. The computational performance,
however, rapidly deteriorates with the size of binary
string length, due to cache misses. We present a computational
strategy and implementation which can handle
huge number of Hamming distance evaluation between binary
strings of arbitrary length keeping computational performance
competitive.
| Lingua originale | Inglese |
|---|---|
| Pagine | 569-572 |
| Numero di pagine | 4 |
| DOI | |
| Stato di pubblicazione | Pubblicato - 2009 |
| Evento | CSIE 2009 IEEE Computer Science and Information Engineering - Los Angeles, USA. Durata: 1 gen 2009 → … |
???event.eventtypes.event.conference???
| ???event.eventtypes.event.conference??? | CSIE 2009 IEEE Computer Science and Information Engineering |
|---|---|
| Città | Los Angeles, USA. |
| Periodo | 1/01/09 → … |
Keywords
- Mathematical representations
- Hamming distance
- Computational performance
Fingerprint
Entra nei temi di ricerca di 'HAMFAST: Fast hamming distance computation'. Insieme formano una fingerprint unica.Cita questo
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver