280 likes | 471 Views
Desktop as a Service using Moka5+VMWare+BioSlax . its potential for Grid and Cloud Computing. Tan Tin Wee Dept of Biochemistry Yong Loo Lin School of Medicine National University of Singapore. Merging two trajectories. LiveCD/DVD/USB/VM. eScience and Grid Computing.
E N D
Desktop as a Service using Moka5+VMWare+BioSlax its potential for Grid and Cloud Computing Tan Tin Wee Dept of Biochemistry Yong Loo Lin School of Medicine National University of Singapore
Merging two trajectories LiveCD/DVD/USB/VM eScience and Grid Computing • APBioGrid : Globus Grid 1.1.6/7– Ong Guan Sin (2002) • APBioBox 1 and 2 with Sun’s BioClusterGrid/BioBox and KOOPLite workflow integration • APBioKnoppix 1 and 2 LiveCD- Derek Kiong, Justin Choo (2003-2006) • BioSlax LiveCD- Mark de Silva and Lim Kuan Siong (2005-2009) • BioSlax LiveDVD, LiveUSB, VMplayer (2007-2009) • BioSlax Server (2008-2009) • BioSlax7.5-VMplayer-MokaFive (2009) • UDBlast-BioSlax-VMplayer-MokaFive (2009) • SINGAREN (1998) • Telemanufacturing, eScience, remote control of Synchrotron beam line (1999-2001) • NUS BioGrid and APBioGrid (2001-2002) • National Grid and LSVGC database (2004-2005) • BioManufacturing Grid (SERC Proof of Concept for Integrated Manufacturing and Services Systems (IMSS) grant) (2004-2006)- KOOPrime’sKOOPlatform- SIMTech and NTU- APSTC Sun Microsystems • TeraCampus Grid TCG@NUS (2005-2009) • 1st World Wide Workflow Grid- Chris Baker (2008)
NUS BioGrid and APBioGrid2001-2002 http://www.bic.nus.edu.sg/biogrid/ http://www.startap.net/starlight/igrid2002/apbiogrid02.html
APBioBox 2002-2004 http://compaq.apbionet.org/grid/apbiobox/http://compaq.apbionet.org/grid/apbiobox/lsvgcApr04/index.shtml
National Grid Office and LS VGC http://lsdb.bii.a-star.edu.sg/
APBioKnoppix (2004-2006)Bioinformatics in a LiveCD Used by 500 NUS students a year in Life Science Module LSM2104 from 2004 to 2007 http://compaq.apbionet.org/grid/apbioknoppix/
Integrated Biolaboratory Manufacturing and Services System 2004-2005 A TSRP pilot project Group 9 Project 042-112-0020 Integrated Manufacturing and Services Systems (IMSS) For acceleration of specific areas of manufacturing, in particular biomanufacturing. www2.jp.apan.net/meetings/bangkok2005/presentation/tantinwee-APAN19-eScienceBKKJan05.ppt
1st WWWFG 2007 http://www.apbionet.org/wwwfg/programme.shtml
Grid Apps on TCG@NUS Grid http://www.nus.edu.sg/comcen/ svu/services/gridapps.htm
UDBlast at TCG@NUS http://srs1.bic.nus.edu.sg/udblast.html http://srs1.bic.nus.edu.sg/udblast_checkjob.html http://srs1.bic.nus.edu.sg/udblast_retrieve.html
Moka5+VMWare+BioSlax Merged two trajectories LiveCD/DVD/USB/VM eScience and Grid Computing • APBioGrid : Globus Grid 1.1.6/7– Ong Guan Sin (2002) • APBioBox 1 and 2 with Sun’s BioClusterGrid/BioBox and KOOPLite workflow integration • APBioKnoppix 1 and 2 LiveCD- Derek Kiong, Justin Choo (2003-2006) • BioSlax LiveCD- Mark de Silva and Lim Kuan Siong (2005-2009) • BioSlax LiveDVD, LiveUSB, VMplayer (2007-2009) • BioSlax Server (2008-2009) • BioSlax7.5-VMplayer-MokaFive (2009) • UDBlast-BioSlax-VMplayer-MokaFive (2009) • SINGAREN (1998) • Telemanufacturing, eScience, remote control of Synchrotron beam line (1999-2001) • NUS BioGrid and APBioGrid (2001-2002) • National Grid and LSVGC database (2004-2005) • BioManufacturing Grid (SERC Proof of Concept for Integrated Manufacturing and Services Systems (IMSS) grant) (2004-2006)- KOOPrime’sKOOPlatform- SIMTech and NTU- APSTC Sun Microsystems • TeraCampus Grid TCG@NUS (2005-2009) • 1st World Wide Workflow Grid- Chris Baker (2008) On TCG@NUS
BioSlax with VMplayerBioinformatics software in a virtual machine Packages Applications Programming languages – C, perl, python etc GUI interfaces Servers – apache, mysql No need to reboot, just install VMplayer, run BioSlax vmx file and create a new instance of a BioSlax virtual machine
Modules e.g. newmodule.lzm Stored in /mnt/sda1/modules Activated during boot up Addable Modules Compile –> dir2lzm -> copy into virtual HDD /mnt/sda1/module
Persistent Changes changes=/mnt/sda1/changes To change the boot up sequence for starting any program Add /etc/rc.d/rc.newprog
BioSlax 7.5 • Bootable with changes • LiveCD, LiveDVD, LiveUSB • BioSlax-VMplayer version • MokaFive version portable on Intel Machines • What has it/can it be used for? • Desktop-as-a-Service • UDGridmpagentftp://sf01.bic.nus.edu.sg/incoming/bioslax/fixes/zz01_grid_mpagent.lzm
BioSlax with VMplayer+MokaFive • Downloadable from MokaFive server • Multiple instantiations • Shoppable • Cacheable LivePC • No need for installation • Portable on a USB • Rapid deployment – user initiated
Grid-enabling BioSlax:UD MP agent in BioSlax • History – TCG@NUS • UDBlast and UDHMMer • Rapid recruitment of machines • No need for user installation • Push or pull instances • Enduser joining of Grid
Joining a virtual machine to TCG@NUS http://srs1.bic.nus.edu.sg/uddevlist.html
UD-BioSlax-VMware-Moka5 • How Desktops-as-a-Service can be used for harnessing free cpu cycles quickly • End-users and members of the public can contribute • “Contribute-to-Benefit” model – enduser can log in to monitor their “credits” depending on how much CPU time they have contributed and how much they pay. • Creation of public participation grid computing and public recruitment of grid-enabled processors without interfering with Enduser’s machine. • Inter-Grid functionality – deploy instances of UD-BioSlax on multiple clouds
Moka5 can do push the updateAlternatively, BioSlax can do PXE boot • Endusers can run the agent whenever they want to • MPagent wrapped in BioSlax, played in Vmplayer, pushed by Moka5 is portable • UD Mpagent updates require reinstallation. • Variation of the scheduler requires reinstallation • Endusers have to install the mpagent. • MPagent not portable
Future Work • BioSlax LivePC pushed to clouds • End users can have BioSlax as a development platform that will work exactly as the BioSlax pushed to the cloud. • Nutch-Hadoop configured on BioSlax and pushed onto Amazon EC2, for example • Bioinformatics datasets will be permanently pushed to EC2 as a common accessible database by data providers for end-users to compute