SOCNET is a service of INSNA, the professional association for social
network researchers (http://www.insna.org). To unsubscribe, send
an email message to
I thought many of you on this list would find of great interest our latest paper out today, which represents one of the first pilot large-scale content analyses of JSTOR, DTIC, and the Internet Archive. The hope is that this paper will serve as a blueprint and template for others and inspire, seed, and enable a new wave of large-scale internet and literature content analysis research and to open the door to new disciplinary applications like socio-cultural and area studies work.
For those interested in working with academic literature collections like JSTOR, government document repositories like DTIC, or the open web via the Internet Archive, this paper provides a blueprint for how to work with the collections, their nuances, artifacts, and strengths, lessons learned (for example how to work with the Internet Archive's 1.6-billion PDF archive in the absence of fulltext search or metadata), and general workflows.