1 / 12

String Matching in Hardware using the FM-Index

String Matching in Hardware using the FM-Index. Author: Edward Fernandez, Walid Najjar and Stefano Lonardi Publisher: FCCM,2011 Presenter: Jia-Wei,You Date : 2012/4/11. Introduction. String matching is the problem of searching for patterns in a long text .

aaralyn
Download Presentation

String Matching in Hardware using the FM-Index

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. String Matching in Hardware using the FM-Index Author: Edward Fernandez, WalidNajjar and Stefano Lonardi Publisher: FCCM,2011 Presenter:Jia-Wei,You Date:2012/4/11

  2. Introduction • String matching is the problem of searching for patterns in a long text. • A recent breakthrough in this field is the FM-index, a data structure that synergistically combines the Burrows-Wheeler transform and the suffix array. • It is compared to the brute force approach and it is shown that the FM-index has a higher effective throughput than the brute force. This is due to the higher number of character comparisons per cycle performed by the FM-index.

  3. Burrows-Wheeler transform

  4. I-table & C-table Q = GCTAATTAGGTACC$ BWT(Q) = CTTTACAG$AGCGTA SBWT(Q) = $AAAACCCGGGTTTT

  5. Searching and locating Pattern searching using the FM-index starts with initializing the topandbottom pointers to the first and last indices of the C-table respectively. Process one character at a time, beginning with the last character of the pattern. The top and bottom pointers move to different suffix array indices according to the current character processed and the current index where the top and bottom pointers are indexing.

  6. Searching and locating(1/3)

  7. Searching and locating(2/3)

  8. Searching and locating(3/3)

  9. Architecture

  10. Performance(1/3) Xilinx Virtex 6(XC6VLX760) 262144 characters

  11. Performance(2/3)

  12. Performance(3/3)

More Related