1 / 4

A Fault Tolerant Gaussian Elimination Solver for the Cell Broadband Engine

A Fault Tolerant Gaussian Elimination Solver for the Cell Broadband Engine. James Geraci Lead Researcher Square Enix Co., Ltd. Research and Development Division. Introduction to Square Enix Group. Square Enix Group is a Japanese entertainment content/service developer and publisher.

Download Presentation

A Fault Tolerant Gaussian Elimination Solver for the Cell Broadband Engine

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Fault Tolerant Gaussian Elimination Solver for the Cell Broadband Engine James Geraci Lead Researcher Square Enix Co., Ltd. Research and Development Division

  2. Introduction to Square Enix Group • Square Enix Group is a Japanese entertainment content/service developer and publisher. • Best known for following video game franchises. • FINAL FANTASY, DRAGON QUEST (SQUARE ENIX) • Tomb Raider (EIDOS) • Space Invaders (TAITO) • Approximately 3000 employees*1 and ¥135.6 billion*2 ($1.5 billion) in sales. *1 As of March 31, 2009 *2 FY2008 • Develop for Nintendo DS, PSP, Xbox 360, Playstation 3, Wii,PC, iPhone, cell phones, etc….

  3. Fault tolerant Gaussian elimination Fault tolerance idea is to back up on-chip data into main memory at checkpoints. The algorithm’s natural serialization points are used as checkpoints. When a fault occurs, backed up data is used to redistribute workload among remaining cores.

  4. Fault Tolerance Capabilities Fault Tolerance: Core failures lead to redistribution of workload among remaining cores Addition of Cores: Cores are added and rows are dynamically redistributed Fault Tolerance with Replacement: N failed cores are replaced with M new cores

More Related