1 / 16

Importing and Exporting DataShop Data pslcdatashop

Importing and Exporting DataShop Data http://pslcdatashop.org Slides current to DataShop version 4.1.8. Brett Leber Interaction Designer. Is your data right for DataShop?. It might be if it… was produced by an intelligent tutoring system

kelvin
Download Presentation

Importing and Exporting DataShop Data pslcdatashop

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Importing and Exporting DataShop Data http://pslcdatashop.org Slides current to DataShop version 4.1.8 • Brett Leber • Interaction Designer

  2. Is your data right for DataShop? It might be if it… • was produced by an intelligent tutoring system • follows a student action, tutor response sequence (untutored actions OK) • is primarily textual • encodes some notion of “steps” What kind of data do you have?

  3. Benefits of importing your data DataShop offers: • Web-based visualization and analysis tools for exploring your data • Secure storage and backup • A location on the web where anyone you want can access your data • Web services for programmatic access

  4. How do I get data in? • Directly/Real-time • Some tutors are logging directly to the PSLC logging database • CTAT-based tutors (when configured correctly), can log to disk or to the logging database over the internet • Indirectly • Other tutors are logging to their own file formats or their own databases • These data require a conversion process • Many studies are in this category

  5. XML vs. tab-delimited format XML • Richer description than tab-delimited • More fields • Problem start time • Problem description • Problem tutor flag • More verbose • Requires some familiarity with XML • Not especially readable Tab-delimited • More concise • Can edit in Excel • More easily shareable • Less rich than XML • Missing problem start time, description, and tutor flag

  6. Tutor Message Format <context_message context_message_id="02CE3AE5-F6D5-9177-913F-C34730F1096C" name="START_PROBLEM"> • <meta> • <user_id>student01</user_id> • <session_id>08xz013</session_id> • <time>2010/02/22 06:43:47.002</time> • <time_zone>US/Eastern</time_zone> • </meta> <dataset> <name>Learn a Language Fall 2007</name> <level type="unit"> <name>Learning Logging</name> <problem><name>Translating Tech Talk</name></problem> </level> </dataset> </context_message>

  7. Tutor Message Format • <tool_message • context_message_id ="02CE3AE5-F6D5-9177-913F-C34730F1096C"> • <meta> • <user_id>student01</user_id> • <session_id>08xz013</session_id> • <time>2010/02/22 06:45:48.014</time> • <time_zone>US/Eastern</time_zone> • </meta> • <semantic_event • transaction_id="B503948-9164-DD83-EBB2-1589FD38D435" • name="ATTEMPT" /> • <event_descriptor> • <selection>_level0.VideoPlayerInstance1.sliderButtonName</selection> • <selection type="media_file">mymovie.flv</selection> • <selection type="clip_length">00:08:00.0</input> • <action>cue</action> • <input type="start_cue">00:04:34.8</input> • <input type="stop_cue">00:05:42.2</input> • </event_descriptor> • </tool_message>

  8. Tutor Message Format • <tutor_message • context_message_id ="02CE3AE5-F6D5-9177-913F-C34730F1096C"> • <meta> • <user_id>student01</user_id> • <session_id>08xz013</session_id> • <time>2010/02/22 06:43:56.367</time> • <time_zone>US/Eastern</time_zone> • </meta> • <semantic_event • transaction_id="B503948-9164-DD83-EBB2-1589FD38D435" • name="RESULT" /> • <event_descriptor> • <selection>_level0.VideoPlayerInstance1.sliderButtonName</selection> • <selection type="media_file">mymovie.flv</selection> • <selection type="clip_length">00:08:00.0</input> • <action>cue</action> • <input type="start_cue">00:04:34.8</input> • <input type="stop_cue">00:05:42.2</input> • </event_descriptor> • <action_evaluation>INCORRECT</action_evaluation> • <tutor_advice>Your answer is not correct. Select only the portion of the video where the man it talking about his family.</tutor_advice> • <skill> • <name>family_words</name> • <category>video_portion_selection</category> • </skill> • </tutor_message>

  9. Same thing in tab-delimited And so on

  10. Tools: XML vs. tab-delimited format XML • Java Logging Library • Log in XML to disk or to a logging server • http://pslcdatashop.org/about/libraries.html • Flash Logging Library • Log to a logging server • http://ctat.pact.cs.cmu.edu/index.php?id=logging-flash • Build a tutor with CTAT without programming • Can log to disk or to a logging server • http://ctat.pact.cs.cmu.edu • Convert to XML via your own program • Transform existing log data into valid Tutor Message Format • Validate your XML with a tool we’ve created • http://pslcdatashop.web.cmu.edu/xmlvalidator.html Tab-delimited • DataShop Import Tool • Verify your import file with our Verification Tool • http://pslcdatashop.web.cmu.edu/importverify.html

  11. Documentation For XML: • Guide to the Tutor Message Format:http://pslcdatashop.org/dtd/guide/ For tab-delimited format: • http://pslcdatashop.org/about/importverify.html To learn about terminology: • http://pslcdatashop.org/help?page=terms To learn about existing DataShop output formats: • http://pslcdatashop.org/help?page=export

  12. Case Study: Chinese Writing Study Fall 2009 http://www.learnlab.org/research/wiki/index.php/Perfetti_-_Read_Write_Integration • Researchers presented the DataShop team with their data, which was a tabular format unlike the DataShop format. • DataShop team consulted with the research team to see which DataShop-required fields were missing and which new fields were extra. • DataShop team and researchers arrived at definitions of problems, steps, and knowledge components. • DataShop requires a correct/incorrect tagging of each attempt, so correctness was determined by a threshold (eg, 0.5) • DataShop consultant (Alida) wrote a converter to convert from this tabular format to XML, and imported into DataShop.

  13. Future of importing and the format • Push-button import • Richer, more-flexible format • Multimedia (audio) • Dialogue data

  14. Exporting from DataShop • From the website: • By transaction • By student-step • By student-problem • From web services: • By transaction • By student-step

  15. Exporting from DataShop • Log in to the web application. • Choose a dataset. • Click “Export” tab. • Choose a level of granularity (transaction, step, or problem). • Choose a sample. • Click export button. Tip: “All Data” sample is cached for transaction export, so choosing that sample results in fastest export.

  16. Questions?

More Related