820 likes | 828 Views
Explore the importance of data center management, the challenges faced by facilities, and the solutions for effective planning and design. Learn about tools and strategies for capacity planning, power effectiveness, and resilience in data center operations.
E N D
PTS Data Center Solutions Data Center Decisions Presentation 2008
Capacity Planning and Modeling Tools for Data Center Design and Management Pete Sacco PTS Data Center Solutions, Inc. (201) 337-3833 x101 psacco@ptsdcs.com
Agenda Why is data center management important? What can be achieved with effective data center management? The data center facility lifecycle Data center facility lifecycle tools
Why Is DCM Important? IT requirements continually outpace facility capacity leading to ever increasing availability failure rates We are running out of the skilled labor needed to design and operate data center facilities Data center operators everywhere are running out of space, power and cooling simultaneously The costs of operating a data center are rising dramatically as the cost of both raw commodities and energy rise Power densities on a per rack basis are on the rise
DCI -- By 2015, the talent pool of qualified senior level technical and management data center professionals will shrink by 45% AFCOM -- 71% will need to physically expand their data center and/or outsource some applications in the next 10 years due to business growth, aging facilities and new technology DCI -- Over the next five years power failures and limits on power availability will halt data center operations at more than 90% of all companies The Grim Reality
The Grim Reality • ARI -- 2/3 were not planning to build new data centers due to the cost and time it takes to complete such a project • AFCOM -- 81.2% of members have experienced a data center failure in the last five years • DOE -- 100% probability of another ‘August 14,2003’ wide area outage in the next five years
What are the greatest facility problems in data centers? 29% Excessive Heat 29% Insufficient Power 21% Insufficient Floor Space 6% Poor Location 3% Excessive Operating Costs 13% None of the Above AFCOM Survey Results
DCD’s Rack Power Density Survey Source: DatacenterDynamics 2007 Survey
Better predictability of space, power, cooling capacity and redundancy limits means more time to plan on ways to mitigate their affect Increased real-time availability of IT operations as a result of an enhanced understanding of the present state of the power and cooling infrastructure and environment Reduced operating cost from energy usage effectiveness and efficiency as well as operator effectiveness from the use of automated tool sets What Can Be Achieved With DCM?
Power Usage Effectiveness And Saving PUE = 1.7 If a Water-Side Economizer is installed, can achieve a PUE = 1.4
Data Center Management (DCM) An Adaptive Strategy
Traditional Data Center Management IT Assets Facility Assets Access Control Emergency Power Servers Network Storage Software Applications IT Services Liebert SiteScan Network Management Systems Building Management Systems Many Others Sensors SNMP ALC Andover Cooling System Environmental Systems
A process to provide facility and support infrastructure capacity, redundancy predictability and adaptability throughout the data center facility lifecycle: 3D Integrated Space, Power and Cooling Capacity and Redundancy Modeling Graphical Layout and Elevations Room and Rack Space Capacity and Redundancy Planning Room and Rack Power Capacity and Redundancy Planning Room and Rack Cooling Capacity and Redundancy Planning (CFD) Room and Rack Cooling Capacity and Redundancy Planning (non-CFD) Energy Efficiency Analysis (CFD) Energy Efficiency Analysis (non-CFD) Capacity Management
Planning Data Center Planning Narrow the Key Design Criteria
Data Center Planning Tools • CFD Modeling: • Cooling Approach Comparison • Computer Room Airflow Modeling • Based on RLU Density Assumptions
Design Data Center Design
Data Center Design Tools • CFD Modeling: • Cooling Approach Comparison • Computer Room Airflow Modeling • Based on RLU Density Assumptions • AutoCAD: • Schematics and Specifications • Construction Documents • Building Information Modeling: • Visual Coordination Modeling • Aesthetic Evaluation
Design / Build versus Bid and Specification Same process regardless of large or small, new construction or retrofit, complete or partial Construction tools include: MS Project or other Project Management (PM) tools Online Project Management Tools Integrated Communication Technologies Integrated Back Office Software Systems Data Center Construction Construction & Commissioning
Data Center Commissioning Construction & Commissioning
Installation of voice and data structured cabling, pathway and cable management Installation of IT infrastructure including servers and storage, network and security and voice and communications systems Construction tools include: MS Project or other PM tools Online Project Management Tools Integrated Communication Technologies Integrated Back Office Software Systems IT Implementation IT Implementation
Provides organization and ease management of both IT and support infrastructure Should include: Physical location, space requirement and service area requirement Unique ID tag Configuration and ratings information Document and data management (SOW, contacts, etc.) Event history and alerts (start-up, maintenance, etc.) Asset Management Asset Management
For IT devices power ratings data should include: Nameplate Rating Overstated value including peak operation and low-line conditions Design Rating Value from which power protection, distribution and cooling infrastructure sizing can be calculated -- typically 50% to 75% of nameplate rating Real-Time Values Average, weekly peak and weekly minimum Asset Management Asset Management
Asset Management Tools • Spreadsheets or Custom Database: • Inexpensive • High management • Visual Organizers (e.g., Visio or NetZoom) • Less Expensive • High Management • Low Functionality • Asset Management Software: • More Expensive • High Functionality • Medium Learning Curve to Maintain
Real-Time Monitoring • Honeywell • Johnson Controls • Siemens • TAC • Tyco • CA Unicenter • CiscoWorks • HP OpenView • IBM Tivoli • Nagios • Microsoft Operations Manager • WhatsUp Gold • APC InfraStruXure Central • Avocent’s DSView 3 • Emerson’s Aperture VISTA • Epicenter’s Synatix • Modius’ OpenData Enterprise DCiM • Raritan’s CommandCenter Real-Time Monitoring
Building Management System (BMS) is a computer based control system installed in buildings that controls and monitors the building’s mechanical and electrical equipment such as air handling and cooling plant systems, lighting, power systems, fire systems and security systems. Real-Time Monitoring -- BMS Real-Time Monitoring
A Network Management System (NMS) is a combination of hardware and software used to monitor and administer a network as opposed to controlling it. An NMS is commonly a suite of software applications which allow large-scale system and network management of an organization's IT infrastructure. Real-Time Monitoring -- NMS Real-Time Monitoring
A Data Center Monitoring System (DCMS) uses a combination of hardware and software to monitor and administer a data center or computer room facility. Real-Time Monitoring -- DCMS Real-Time Monitoring
Real-Time Monitoring Tools • Power Strips and Branch Circuit Monitoring • By cabinet versus by device • Temperature and humidity • RLU inlet temperature • UPS, PDU and CRAC • SNMP data • IT infrastructure • IPMI data
Data Center Maintenance Computerized Maintenance Monitoring Software (CMMS) is an application that helps organizations increase asset life, track maintenance costs, prevent and predict equipment failures, improve labor productivity, reduce costly equipment downtimes, minimize investments in inventory and lower the total cost of maintenance. Maintenance • Modules • Assets • Work Orders • Purchase Orders • Preventative Maintenance • Inventory • Procedures • Tasks • Projects • Repair Centers • Shops • Labor • Crafts • Requesters • Features • Multi-Site Asset Tracking • Multi-Site Work Orders • Work Order Scheduling / Work Manager • Preventive Maintenance • Procedure Library (300+ Procedures) • Multiple Stockroom Inventory Tracking • Generate Purchase Orders • Project Management Contract Management • Accounts / Budgeting • Categories • Tools Failures • Classifications • Specifications • Companies • Stock Rooms • Tool Rooms • Contacts • Documents • Bulletins • Access Groups • Members • Training Accounts • Hierarchical Classifications • Document Library / HTML • Document Editor • Access Groups / Security • Integrated Report Writer • Print, Email, or Fax Reports
Data Center Maintenance Tools Maintenance
Predictive Analysis Predictive Analysis • CFD Model • Based on RLU density assumptions 5 • CFD Model Updates • Operational baseline model based on real-time data used for assessing changing conditions 1 • Power and Cooling Infrastructure Design • Support infrastructure capacity and redundancy constraints designed and constructed in accordance with established key design criteria 4 • Real-Time Load Data • Actual loads compared to design rating provides capacity and redundancy planning data 2 3 • IT and Support Infrastructure Inventory • Device design load ratings established in coordination with facility constraints
Available DC Facility Lifecycle Tools • Data center lifecycle management tools presently fall into five classes: • Monitoring Tools • Network Management • Building Management • Data Center Monitoring • Asset Management Tools • CFD Modeling Tools • KVM Tool • Maintenance Management Tools • No single vendor encompasses all the attributes necessary for effective management • The available tool sets overlap on many of the attributes in delivering on their core capability
An IT-ready, scalable monitoring system which collects, organizes and distributes critical alerts, surveillance video and key information Provides a unified view of complex power, cooling, security and environmental infrastructure from anywhere on the network Available in three appliance platforms: up to 525 devices, up to 2,025 devices, up to 4,025 devices Real-time monitoring, user-defined reports and graphs and instant fault notification and escalation enable quick assessment and resolution of critical infrastructure events APC’s InfraStruXure Central
Monitors All equipment, all locations, via secure browser All performance data, continuously, in real-time All data, correlations, alerts, analytics, trends and reports Risk conditions, via auditable alarm workflow engine All system layers, for security, reliability and scaling Actionable management info, via dashboard and reports MODIUS’ OpenData Enterprise DCiM Measures Manages Notifies Integrates Informs
Nassoura Technology Associates’ dcTrack 2.0 • Data Center Visualization • View color-coded AutoCAD floor plans in real-time • Multi-site Support • Manage multiple data center sites • Asset Management • Track IT infrastructure assets • IP and Network Management • Manage logical relationships including VLANs
Nassoura Technology Associates’ dcTrack 2.0 (continued) • Servers and Host Devices • Track locations via elevation drawings as well as connectivity • Data Cabling • Track and manage all cable infrastructure, routes and ports • Data and Power Connectivity • Track connectivity as well as real-time power monitoring • Change Management • Maintain a defined change management process • Administration • Search, customization support and security
Nassoura Technology Associates’ dcTrack 2.0 (continued) Floor Plan IP Address and VLAN Elevations Real-Time Monitoring