***** To join INSNA, visit http://www.insna.org ***** <<<-------- Jeff Yan-------->>> > We are researching the questioning and replying relation in a big Internet > software technology forum. We have access to the log files of the forum, > and can get information of each question discussed. > > The dataset is huge. Over 100,000 questions were posted in the forum > during > past three months and several hundards thousands people participated in > the discussion. > > We will get a txt file, containing the long list of question ID, sender, > and repliers. For each question generally there are 5 to 30 repliers. > The information of one question is recorded in one row. > > Our problem is we cannot find a way to convert this dataset into a > network matrix that can be analyzed by Pajek. Are there any computer > program which can help to create and manipulate huge network matrix? > Anybody who has experience? > > We would appreciate and acknowledge any suggestions and relevant work or > references. You should use sparse network descriptions - list of existing arcs/edges or lists of neighbors. For Pajek see page 8 in http://uk.cambridge.org/catalogue/catalogue.asp?isbn=0521602629 or slides 20, 21, 24 in http://vlado.fmf.uni-lj.si/pub/networks/doc/seminar/nicta01.pdf You don't need to specify the coordinates of vertices. For a conversion program see http://vlado.fmf.uni-lj.si/pub/networks/pajek/howto/text2pajek.htm Vlado -- Vladimir Batagelj, University of Ljubljana, Department of Mathematics Jadranska 19, PO Box 2964, 1111 Ljubljana, Slovenia http://vlado.fmf.uni-lj.si _____________________________________________________________________ SOCNET is a service of INSNA, the professional association for social network researchers (http://www.insna.org). To unsubscribe, send an email message to [log in to unmask] containing the line UNSUBSCRIBE SOCNET in the body of the message.