1 / 136

a brief introduction to Data Visualization

a brief introduction to Data Visualization. Jeffrey Heer Stanford University Joe Hellerstein UC Berkeley. The Instructors. Jeffrey Heer Asst Professor, Stanford Computer Science Joe Hellerstein Professor, UC Berkeley. Set A. Set B. Set C. Set D. Summary Statistics Linear Regression

lafleur
Download Presentation

a brief introduction to Data Visualization

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. a brief introduction toData Visualization Jeffrey HeerStanford University Joe HellersteinUC Berkeley

  2. The Instructors Jeffrey Heer Asst Professor, Stanford Computer Science Joe Hellerstein Professor, UC Berkeley

  3. Set A Set B Set C Set D Summary Statistics Linear Regression uX = 9.0 σX = 3.317 Y2 = 3 + 0.5 X uY = 7.5 σY = 2.03 R2 = 0.67 [Anscombe 73]

  4. Set B Set A Y Set D Set C Y X X

  5. 1826(?) Illiteracy in France, Pierre Charles Dupin

  6. cabspotting.org

  7. Wikipedia History Flow (IBM)

  8. Attention “What information consumes is rather obvious: it consumes the attention of its recipients. Hence a wealth of information creates a poverty of attention, and a need to allocate that attention efficiently among the overabundance of information sources that might consume it.” ~Herb Simon as quoted by Hal Varian Scientific American September 1995

  9. Tutorial Topics 1What makes visualizations effective? 2 Deconstruct and evaluate visualizations 3 Supporting interactivity and scale 4 Engaging web users in social data analysis

  10. How do we designeffective visualizations?

  11. Goals of visualization research 1Understand how visualizations convey information What do people perceive/comprehend? How do visualizations correspond with mental models? 2Develop principles and techniques for creating effective visualizations and supporting analysis Amplify perception and cognition Strengthen tie between visualization and mental models

  12. Visualization Reference Model Data Visual Form Task Raw Data Data Tables Visual Structures Views Data Transformations Visual Encodings View Transformations

  13. Nominal, Ordinal and Quantitative N - Nominal (labels) • Fruits: Apples, oranges, … O - Ordinal (rank-ordered) • Quality of meat: Grade A, AA, AAA Q - Interval (location of zero arbitrary) • Dates: Jan, 19, 2006; Location: (LAT 33.98, LONG -118.45) • Like a geometric point. Cannot compare directly • Only differences (i.e. intervals) may be compared Q - Ratio (zero fixed) • Physical measurement: Length, Mass, Temp, … • Counts and amounts • Like a geometric vector, origin is meaningful S. S. Stevens, On the theory of scales of measurements, 1946

  14. Example: U.S. Census Data People: # of people in group Year: 1850 – 2000 (every decade) Age: 0 – 90+ Sex: Male, Female Marital Status: Single, Married, Divorced, …

  15. Example: U.S. Census People Year Age Sex Marital Status 2348 data points

  16. All Marital Status 2000 Year 1990 1980 1970 60+ 40-59 Age 20-39 Sum along Marital Status 0-19 Single Married Divorced Widowed Sum along Age Marital Status All Ages All Years Sum along Year

  17. All Marital Status 2000 Year 1990 Roll-Up 1980 1970 60+ Drill-Down 40-59 Age 20-39 Sum along Marital Status 0-19 Single Married Divorced Widowed Sum along Age Marital Status All Ages All Years Sum along Year

  18. Visualization Reference Model Data Visual Form Task Raw Data Data Tables Visual Structures Views Data Transformations Visual Encodings View Transformations

  19. Jacques Bertin Visual language is a sign system Images perceived as a set of signs Sender encodes information in signs Receiver decodes information from signs Sémiologie Graphique, 1967

  20. C B A Bertin’s Semiology of Graphics • A, B, C are distinguishable • B is between A and C. • BC is twice as long as AB.  Encode quantitative variables "Resemblance, order and proportion are the three signifieds in graphics.” - Bertin

  21. Visual encoding variables Position (x 2) Size Value Texture Color Orientation Shape

  22. Visual encoding variables Position Length Area Volume Value Texture Color Orientation Shape Transparency Blur / Focus …

  23. Playfair 1786

  24. Playfair 1786 x-axis: year (Q) y-axis: currency (Q) color: imports/exports (N, O)

  25. Wattenberg 1998 http://www.smartmoney.com/marketmap/

  26. Wattenberg 1998 rectangle size: market cap (Q) rectangle position: market sector (N), market cap (Q) color hue: loss vs. gain (N, O) color value: magnitude of loss or gain (Q)

  27. Minard 1869: Napoleon’s march

  28. Single axis composition + =

  29. Mark composition y-axis: temperature (Q) + x-axis: longitude (Q) / time (O) = temp over space/time (Q x Q)

  30. Mark composition y-axis: longitude (Q) + x-axis: latitude (Q) + width: army size (Q) = army position (Q x Q) and army size (Q)

  31. longitude (Q) latitude (Q) army size (Q) temperature (Q) latitude (Q) / time (O)

  32. Minard 1869: Napoleon’s march Depicts at least 5 quantitative variables. Any others?

  33. Graphical Perception

  34. Which best encodes quantities? Position Length Area Volume Value (Brightness) Color Hue Orientation (Angle) Shape

  35. Detecting Brightness Which is brighter?

  36. Detecting Brightness (128, 128, 128) (144, 144, 144) Which is brighter?

  37. Detecting Brightness Which is brighter?

  38. Detecting Brightness (134, 134, 134) (128, 128, 128) Which is brighter?

  39. Just Noticeable Difference JND (Weber’s Law) Ratios more important than magnitude Most continuous variation in stimuli perceived in discrete steps

  40. Steps in font size Sizes standardized in 16th century aa aaaa a a a a a a a a a a 6 7 8 9 10 11 12 14 16 18 21 24 36 48 60 72

  41. Information in color and value Value is perceived as ordered • Encode ordinal variables (O) • Encode continuous variables (Q) [not as well] Hue is normally perceived as unordered  Encode nominal variables (N) using color

  42. Compare area of circles

  43. Steven’s Power Law p < 1 : underestimate p > 1 : overestimate [graph from Wilkinson 99, based on Stevens 61]

  44. Graduated sphere map

  45. Apparent magnitude scaling [Cartography: Thematic Map Design, Figure 8.6, p. 170, Dent, 96] S = 0.98A0.87 [from Flannery 71]

  46. How many 3’s 1281768756138976546984506985604982826762 9809858458224509856458945098450980943585 9091030209905959595772564675050678904567 8845789809821677654876364908560912949686 [based on slide from Stasko]

  47. How many 3’s 1281768756138976546984506985604982826762 9809858458224509856458945098450980943585 9091030209905959595772564675050678904567 8845789809821677654876364908560912949686 [based on slide from Stasko]

More Related