Here's some summariziation of semantic network analysis and related
approaches that have treated of the underlying issue you are getting at,
with various operationalizations in each:

            The essence of semantic network analysis is rather
straightforward (Danowski, 1988). Text is analyzed to determine some measure
of the extent to which words are related, which indicates something about
their meaning.  One measure of this relationship is the extent to which word
pairs co-occur within a given meaning unit.  Then, this measure of
relatedness across a set of words is used to group, cluster, or scale the
words (or some subset, such as the more frequently used words).  These
clusters can be directly interpreted, or used to derive more quantitative
measures for use in other analyses, or bases for formal content analysis.
Network approaches have been applied to the study of semantic memory and
association processes (Chang, 1986; Collins & Quillian, 1969; Flores-d'Arcais
& Schreuder, 1987), information retrieval algorithms and systems (Savoy,
1992), citation analysis (Callon, Courtial, Turner, & Bauin, 1983; Danowski
& Martin, 1979; Lievrouw, Rogers, Lowe, & Nadel, 1987; and Rice & Crawford,
1992), content analysis of traditional and CMC media (Cuilenburg,
Kleinnijenhuis, & de Ridder, 1986; Danowski, 1982), and responses to
open-ended survey questions (Carley & Palmquist, 1992; Rice & Danowski,
1993). Semantic network analysis using has been applied to understanding
positioning of candidates and issues in presidential debates (Doerfel &
Marsh, 2003), and the structure of interests in the International
Communication Association (Doerfel & Barnett, 1999), among other topics.
These and other prior studies provide the underlying arguments about
representing cognition and meaning through content associations.

The following are some references I retrieved from the Ingenta, using author
> Dear All,
> I'm carrying out a social network study on language use in an electronic
> community. Data are drawn from the email archive containing all the
> messages exchanged among community members.
> The study hypothesizes a positive relationship between network similarity
> and  similarity in language use.
> I organized my data as following:
> -First, I divided the archive into 60 topic-specific email subsets (groups
> of emails on the same topic);
> -Second, for each of the 60 email subsets I built a two-mode matrix (ROWS
> = community members which sent or recieved at least one email on that
> topic; COLUMNS = email sent on that topic).
> -Third, I computed with UCINET VI a meaure of similarity among the columns
> of those 60 two-mode matrices. So, I got 60 square mail-x-mail matrices,
> where xij = value of network similarity between email i and email j. I
> call those matrices "Network-Similarity Matrices".
> -Fourth, I have other 60 square mail-x-mail matrices (one for each
> topic-specific subset), where xij = value of similarity between the TEXT
> of email i and the TEXT of email j. I call those matrices "Text-Similarity
> Matrices".
> Now, in order to test the relationship hypothesized above, I would like to
> do the following:
> -Building two diagonal matrices. The first one should have all the
> "network similarity matrices" on the diagonal and structural zeros
> elsewhere. The second one should be exactly the same with the "text
> similarity matrices" on the diagonal.
> -Run a QAP regression using those two big diagonal matrices as inputs.
> May I kindly ask you an opinion on these last two steps of my analysis? Do
> they make sense to you and what kind of weaknesses do you notice? Do you
> know other studies adopting a similar approach?
> Thanks a lot,
> Best Regards,
> JosŤ De Fatima Garrois
