1 / 25

HENP, Grids and the Networks They Depend Upon

HENP, Grids and the Networks They Depend Upon. Shawn McKee (smckee@umich.edu) March 2004 National Internet2 Day. Outline. HENP: Why do physicist’s care about the network? GRIDs and networks in HENP Doing physics at the LHC Future and Conclusions. Physics and Networks.

shyla
Download Presentation

HENP, Grids and the Networks They Depend Upon

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. HENP, Grids and the Networks They Depend Upon Shawn McKee (smckee@umich.edu) March 2004 National Internet2 Day

  2. Outline • HENP: Why do physicist’s care about the network? • GRIDs and networks in HENP • Doing physics at the LHC • Future and Conclusions Internet2 Day - Shawn McKee - University of Michigan Physics

  3. Physics and Networks So, why do physicists care about networks? • I will try to explain how physics will be done at LHC and the corresponding implications for the network needs • Networks, like Internet2, are critical for the globally distributed, data intensive e-Science collaborations, like physics at the LHC • Details to follow… Internet2 Day - Shawn McKee - University of Michigan Physics

  4. Four LHC Experiments: The Petabyte to Exabyte Challenge ATLAS, CMS, ALICE, LHCBHiggs + New particles; Quark-Gluon Plasma; CP Violation • Data stores ~40 Petabytes/Year and UP; • CPU 0.3 Petaflops and UP • 0.1 to 1.0 Exabytes (1 EB = 1018 Bytes) (2007) (~2012 ?) for the LHC Experiments Internet2 Day - Shawn McKee - University of Michigan Physics

  5. How Much Data is Involved? High Level-1 Trigger(1 MHz) High No. ChannelsHigh Bandwidth(500 Gbit/s) Level 1 Rate (Hz) 106 LHCB ATLAS CMS 105 HERA-B KLOE TeV II 104 Hans Hoffman DOE/NSF Review, Nov 00 High Data Archive(PetaByte) CDF/D0 103 H1ZEUS ALICE NA49 UA1 102 104 105 106 107 LEP Event Size (bytes) Internet2 Day - Shawn McKee - University of Michigan Physics

  6. The Problem Internet2 Day - Shawn McKee - University of Michigan Physics

  7. The Solution Internet2 Day - Shawn McKee - University of Michigan Physics

  8. What is “The Grid”? • There are many answers and interpretations • The term was originally coined in the mid-1990’s (in analogy with the power grid) and can be described thusly: “The grid provides flexible, secure, coordinated resource sharing among dynamic collections of individuals, institutions and resources (virtual organizations:VOs)” Internet2 Day - Shawn McKee - University of Michigan Physics

  9. Grid Perspectives • Users Viewpoint: • A virtual computer which minimizes time to completion for my application while transparently managing access to inputs and resources • Programmers Viewpoint: • A toolkit of applications and API’s which provide transparent access to distributed resources • Administrators Viewpoint: • An environment to monitor, manage and secure access to geographically distributed computers, storage and networks. Internet2 Day - Shawn McKee - University of Michigan Physics

  10. Network Exponentials • Network vs. computer performance • Computer speed doubles every 18 months • Network speed doubles every 9 months • Difference = order of magnitude per 5 years • 1986 to 2000 • Computers: x 500 • Networks: x 340,000 • 2001 to 2010 • Computers: x 60 • Networks: x 4000 Internet2 Day - Shawn McKee - University of Michigan Physics

  11. The Network • As can be seen in the previous transparency, it can be argued it is the evolution of the network which has been the primary motivator for the Grid. • Ubiquitous, dependable worldwide networks have opened up the possibility of tying together geographically distributed resources • The success of the WWWfor sharing information has spawned a push for a system to share resources • The network has become the “virtual bus” of a virtual computer. Internet2 Day - Shawn McKee - University of Michigan Physics

  12. Doing Physics at the LHC ATLAS as an example

  13. ATLAS • A Torroidal LHC Apparatus • Collaboration • 150 institutes • 1850 physicists • Detector • Inner tracker • Calorimeter • Magnet • Muon • United States ATLAS • 29 universities, 3 national labs • 20% of ATLAS Internet2 Day - Shawn McKee - University of Michigan Physics

  14. Internet2 Day - Shawn McKee - University of Michigan Physics

  15. ATLAS Internet2 Day - Shawn McKee - University of Michigan Physics

  16. Discovery Potential for SM Higgs Boson • Good sensitivity over the full mass range from ~100 GeV to ~ 1 TeV • For most of the mass range at least two channels available • Detector performance is crucial: b-tag, leptons, g, E resolution, g / jet separation, ... Internet2 Day - Shawn McKee - University of Michigan Physics

  17. HEP Data Analysis • Raw data • hits, pulse heights • Reconstructed data (ESD) • tracks, clusters… • Analysis Objects (AOD) • Physics Objects • Summarized • Organized by physics topic • Ntuples, histograms, statistical data Internet2 Day - Shawn McKee - University of Michigan Physics

  18. Data Flow from ATLAS 40 MHz (~PB/sec) level 1 - special hardware 75 KHz (75 GB/sec) level 2 - embedded processors 5 KHz (5 GB/sec) level 3 - PCs ATLAS: 10 PB/y ~ one million PC hard drives! 100 Hz (200-400 MB/sec) data recording & offline analysis Internet2 Day - Shawn McKee - University of Michigan Physics

  19. HENP Grid/Network Projects • Grid Physics Network (GriPhyN) • Enabling R&D for advanced data grid systems, focusing in particular on Virtual Data concept • iVDGL: A Global Grid Laboratory • A global grid laboratory to conduct grid test “at scale” • There a numerous other projects focused on various aspects of grids and networks in support of HENP physics… Internet2 Day - Shawn McKee - University of Michigan Physics

  20. UltraLight: Exploring Future Networks for e-Science • UltraLight is a program to explore the integration of cutting-edge network technology with the grid computing and data infrastructure of HEP/Astronomy • The program intends to explore network configurations from common shared infrastructure (current IP networks) thru dedicated optical paths point-to-point. • A critical aspect of UltraLight is its integration with two driving application domains in support of their national and international eScience collaborations: LHC-HEP and eVLBI-Astronomy • The Collaboration includes: • Caltech • Florida Int. Univ. • MIT • Univ. of Florida • Univ. of Michigan • UC Riverside • BNL • FNAL • SLAC • UCAID/Internet2 Internet2 Day - Shawn McKee - University of Michigan Physics

  21. The Move to OGSA and then Managed Integration Systems App-specific Services ~Integrated Systems Open Grid Services Arch Stateful; Managed Web ServicesResrc Framwk Web services + … Increased functionality, standardization GGF: OGSI, … (+ OASIS, W3C) Multiple implementations, including Globus Toolkit Globus Toolkit X.509, LDAP, FTP, … Defacto standards GGF: GridFTP, GSI Custom solutions Time Internet2 Day - Shawn McKee - University of Michigan Physics

  22. Managing Global Systems: Dynamic Scalable Services Architecture MonALISA: http://monalisa.cacr.caltech.edu Internet2 Day - Shawn McKee - University of Michigan Physics

  23. Grid Analysis Environment CLARENS: Web Services Architecture Analysis Client Analysis Client Analysis Client • Analysis Clients talk standard protocols to a simple API • The secure Clarens portal hides the complexity • Key features: Global Scheduler, Catalogs, Monitoring, and Grid-wide Execution service • The network underlies and enables this model HTTP, SOAP, XML/RPC Grid Services Web Server Scheduler Catalogs Fully- Abstract Planner Metadata Partially- Abstract Planner Virtual Data Applications Data Management Monitoring Replica Fully- Concrete Planner Grid Execution Priority Manager Grid Wide Execution Service Internet2 Day - Shawn McKee - University of Michigan Physics

  24. Conclusions • Networks form the critical basis for the future of e-Science • LHC Physics will depend heavily on globally distributed resources => the NETWORK is critical! • Future requirements for grids and networking in support of HENP physics is an open question which will need investigation to define, develop and deploy the needed infrastructure in a timely manner. Internet2 Day - Shawn McKee - University of Michigan Physics

  25. For More Information… • HENP Internet2 SIG • henp.internet2.edu • Global Grid Forum • www.ggf.org • International Virtual Data Grid Laboratory • www.ivdgl.org • Grid Physics Network • www.griphyn.org • UltraLight: ultralight.caltech.edu Questions? Internet2 Day - Shawn McKee - University of Michigan Physics

More Related