280 likes | 426 Views
Bibliometric Analysis with Sci2: Choose Your Own Adventure. Laura Ridenour School of Library and Information Science, Indiana University. Overview. P aper citation network using Sci2 Gathering data Examining and understanding file format Adjusting program parameters for speed
E N D
Bibliometric Analysis with Sci2:Choose Your Own Adventure Laura Ridenour School of Library and Information Science, Indiana University
Overview • Paper citation network using Sci2 • Gathering data • Examining and understanding file format • Adjusting program parameters for speed • Using the Network Analysis Toolkit • Visualizing a network • Exporting data for use in other programs
Gathering Data • ISI file format • Text file with special formatting • Downloadable from Web of Science • 500 records at a time from WoS • Other sources include the Scholarly Database (http://sdb.cns.iu.edu/search/), but data from this source works slightly differently than shown in this tutorial • Can also use any data with nodes and edges, as long as it’s a clean data set
Required tools • Sci2 - https://sci2.cns.iu.edu/user/welcome.php • Must create an account in order to download • A text editor • Helpful: • Oxygen, Gephi
Allocating Additional Memory • In the Sci2 directory, open the .ini file for Sci2 and change the settings to the following: • -vmargs • -Xms30m • -Xmx1028m • Save and close the file, re-open Sci2 • Note: you may need to adjust the memory parameters slightly depending on system capabilities and settings
Using Multiple ISI Files • Strip out all headers but the beginning and end tag – can do this manually, or with a script
Sci2 • File > Load > (select your file and click ‘Open’) > choose ‘ISI flat format’ > click ‘Select’
In the Data Manager • Right-click the loaded file, select “view” to check and/or modify the file • If you modify the file, be sure to save the file to a permanent location and reload it into the data manager
Using the Network Analysis Toolkit (NAT) • Choose “Network Analysis Toolkit”
Examining the Output • Checking the NAT results can save time. If the network is particularly densely connected, using a faster system may save time
Visualization • Select the extracted network to visualize from the Data Manager
Visualization (cont’d) • Visualization > Networks > GUESS
Choosing a layout (GUESS) • Layout > GEM (or whichever layout you would like to use)
Play!! • Take some time to play with different properties in the network. Feel free to use the interpreter,it uses jython(examples on next slide, from CNS wiki)
Exporting Data • From the Data Manager, right click the appropriate graph file and select “Save”
Exporting Data (cont’d) • Choose GraphML for importing a graph into Gephi • After saving, change the file extension from .XML to .graphml to make the file usable by Gephi • Choose .net for Pajek
Additional Resources • http://wiki.cns.iu.edu/ • http://gephi.org/