1 / 29

BIG Data

BIG Data. Presented: January 2013. Together we build the right solution. Agenda. Introductions. Dennis J Perlot: Founder & CTO, Theia Solutions Over 25 years experience providing award winning, innovative IT solutions Smithsonian Innovators Award Global Innovation Award

vevina
Download Presentation

BIG Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. BIG Data Presented: January 2013 Together we build the right solution

  2. Agenda

  3. Introductions Dennis J Perlot: Founder & CTO, Theia Solutions • Over 25 years experience providing award winning, innovative IT solutions • Smithsonian Innovators Award • Global Innovation Award • Artificial Intelligence/Machine Learning • Technology Community Advocate • Speaker/ Technology Evangelist Megan Cocuzzo: Director, Business Intelligence • Over 15 years experience leveraging “BIG DATA” to deliver innovative financial and resource optimization strategies and tools • Financial Planning & Analysis • Capacity & Resource Planning • Opportunity and Risk Assessment • Capital Funding • Six Sigma Black Belt Professional • ISO 9000 Quality System Auditor

  4. Theia Solutions LLC “Together we build the right solution” • Socially responsible technology services • Application Development • Data Optimization • Cloud Hosting • Data Analytics • Why Theia? • We put people first • Partnerships not just contracts • Innovative solutions

  5. What is “Big Data”? • Data sets that can not be processed with traditional tools such as relational databases, requiring “massively parallel” approaches. • What is considered "big data" varies depending on the organization and the applications that are used to process and analyze the data set in its domain. • Traditional tools can not handle the 3 V’s: A visualization created by IBM of Wikipedia edits. At multiple terabytes in size, the text and images of Wikipedia are a classic example of big data.

  6. The data explosion!!!

  7. Just how BIG… • 1000 Megabytes = 1 Gigabyte • 1000 Gigabytes = 1 Terabyte • 1000 Terabytes = 1 Petabyte [where most corporations are] • 1000 Petabytes = 1 Exabyte • 1000 Exabytes = 1 Zettabyte [where Facebook and Google are] • 1000 Zettabytes = 1 Yottabyte • 1000 Yottabytes = 1 Brontobyte

  8. Where does it come from? • Web logs and blogs • eCommerce • Mobile - 4.5 billion phones • Sensors – temp, vibration, etc. • Smartphones • 400 million worldwide • Over 50% of US cell users

  9. How fast is it generated? • eCommerce – 56 million plus transactions in Q3 2012 • RFID – location reporting • Large Hadron Collider: 700MB to 1 TB per second • Cell phone location tracking • Must consider data in motion vs. data at rest

  10. Big Data Trends and Tools

  11. Consider the electricity model • Do you build a power plant? • Do you run wires to your home? • Do you buy transformers, etc. • Let someone else worry about all that and just pay for what you use. • This is cloud computing • Pay for what you use • Rapid elasticity • Location transparent resources

  12. Cloud Offerings • Infrastructure as a Service (IaaS) “… servers, servers, get your servers here” • Platform as a Service (PaaS) “… just give me a place for my application and data” • Software as a Service (SaaS) “… like Salesforce.com”

  13. On-Premises Separation of Responsibilities Infrastructure (as a Service) Software (as a Service) Platform (as a Service) You manage Applications Applications Applications Applications You manage Data Data Data Data You manage Runtime Runtime Runtime Runtime Middleware Middleware Middleware Middleware Other Manages Other Manages O/S O/S O/S O/S Virtualization Virtualization Virtualization Virtualization Other Manages Servers Servers Servers Servers Storage Storage Storage Storage IaaS PaaS SaaS Networking Networking Networking Networking

  14. Is it Secure? • Microsoft Azure Platform • SAS 70 Type 1 and Type 2 (now SSAE 16) • ISO 27001 • Safe Harbor • HIPPA • SOX • PCI DSS • Over 250 internal controls • More guards than engineers at most facilities

  15. Who is using the cloud today? Who is NOT using the cloud today……..

  16. For your information…. • 1 billion: Windows Live ID authentications each day • 3 to 4 billion: junk emails filtered daily • 2 billion: queries each month on Bing • 100 million plus: Windows Update users • 6 Regional Data Centers : 2 each in US, Europe, Asia • 400,000 plus: square footage in each datacenter

  17. Azure Data Centers

  18. What the scoop? • Breaks problem down into smaller “chunks” • Why is it called Hadoop? • Doug Cutting was trying to think of a name for his “map reduce” system • His son said “Why don’t you name it after my toy elephant?

  19. Comparison Hadoop Cluster Traditional Data Center

  20. Who is using? • Amazon/A9 • Facebook • Fox interactive media • Google • IBM’s Watson • New York Times • J.P. Morgan • Rackspace • eBay • Yahoo! • More at http://wiki.apache.org/hadoop/PoweredBy

  21. Next Steps & Recommendations • Monitor Hadoop in marketplace • Revise thinking on problems “Why not record every mouse click?” “If we capture it, we can process it” • Think about “recommender” apps • More is better!

  22. Who Are They? • Computer skills • Understands Relational Databases • Write SQL queries • Linking internal and external data • Statistics skills • Design “experiments” • Create analytical models • Top Job on LinkedIn

  23. Why BIG Data Matters and the importance of data agility The next frontier for innovation, competition, and productivity

  24. How is BIG data creating value?

  25. How is BIG data creating value?

  26. Theia Solutions LLC Data Analytics Offerings • Our process begins with an end to end assessment and documentation of your current capabilities and data structure • No two organizations are alike • No two data sets are alike • We partner with you to develop a data strategy to exceed your goals in the form of a strategic roadmap • The key drivers to operational health vary as do the regulatory and compliance needs of each organization in each market/sector

  27. Theia Solutions LLC Data Analytics Offerings

  28. Theia Solutions LLC • So, no matter what your need, Theia Solutions can help you get there • Experienced, agile, specialized teams • Innovative Ideas, Old School Values • Long Terms Partnership with Clients

  29. Questions www.TheiaSolutionsLLC.com Dennis.Perlot@theiasolutionsllc.com Megan.Cocuzzo@theiasolutionsllc.com

More Related