The following code creates a adjacency matrix. For the example, students were asked to name up to five other students in their class that they considered to be friends. We start with a dataset that contains an id number for each case, as well as five variables that contain the id numbers of the friends each student identified. The code produces a square matrix with id numbers as row and column headings. The rows are which student was nominating friends, the columns are for nominees. A one indicates that the student for that row named the student for that column as a friend.
clear
input id friend1 friend2 friend3 friend4 friend5
44006 45611 55610 74158 55898 .
45611 44006 45623 45621 74158 55898
45621 71643 45611 . . .
45623 59642 71643 45611 73452 55610
55610 45623 45611 44006 55898 71643
55898 74158 55610 45621 . .
59642 55898 71643 45621 45611 74158
71643 55898 73452 59642 45611 44006
73452 71643 45623 . . .
74158 55898 45611 59642 45621 44006
end
* place variables to be in the resulting matrix in one matrix
* you will need to replace friend* with the appropriate variable names
mkmat friend*, mat(F)
* place the case id in another matrix
* you will need to replace id with the name of your id variable
mkmat id, mat(id)
* get number of rows in dataset
scalar define n = _N
* create fid, which is each case's row in the dataset, the data must remain
* (or be sorted) in the order defined by this fid
gen fid = _n
mata
/* get matrices from Stata into Mata*/
a=st_matrix("F")
id = st_matrix("id")
/* make an n by n matrix of zeros, where n is number of rows in dataset (cases)*/
b = J(st_numscalar("n"),st_numscalar("n"), 0)
/* loop through the rows and columns of the matrix b, setting the value in a cell
equal to 1 if the two cases are associated */
for(i = 1; i <= rows(a); i++) {
for(j = 1; j<=cols(a); j++) {
for(k=1; k<=rows(id); k++) {
if (st_data(k,"id")==a[i,j]) {
b[i, k]=1
}
}
}
}
/* add id as first column */
c = id,b
/* add id as first row with a dot in front */
c = (.\id)'\c
/* make this combined matrix accessible in Stata as "adj" */
st_matrix("adj", c)
/* exit mata */
end
mat list adj
/* matrix adj should look like this
adj[11,11]
c1 c2 c3 c4 c5 c6 c7 c8 c9 c10 c11
r1 . 44006 45611 45621 45623 55610 55898 59642 71643 73452 74158
r2 44006 0 1 0 0 1 1 0 0 0 1
r3 45611 1 0 1 1 0 1 0 0 0 1
r4 45621 0 1 0 0 0 0 0 1 0 0
r5 45623 0 1 0 0 1 0 1 1 1 0
r6 55610 1 1 0 1 0 1 0 1 0 0
r7 55898 0 0 1 0 1 0 0 0 0 1
r8 59642 0 1 1 0 0 1 0 1 0 1
r9 71643 1 1 0 0 0 1 1 0 1 0
r10 73452 0 0 0 1 0 0 0 1 0 0
r11 74158 1 1 1 0 0 1 1 0 0 0
*/
preserve
set obs `=_N+1'
svmat adj, names(col)
* you may want to change temp.txt to another file name, you may also
* want to add a file path
outsheet c* using temp.txt, replace nonames
restore
The file temp.txt will contain the following matrix. Note that the rows are occupied by cases (as they were in the original dataset), while the columns correspond to the id numbers listed in the variables friend1-friend5. The matrix is not necessarily symmetric.
44006 45611 45621 45623 55610 55898 59642 71643 73452 74158 44006 0 1 0 0 1 1 0 0 0 1 45611 1 0 1 1 0 1 0 0 0 1 45621 0 1 0 0 0 0 0 1 0 0 45623 0 1 0 0 1 0 1 1 1 0 55610 1 1 0 1 0 1 0 1 0 0 55898 0 0 1 0 1 0 0 0 0 1 59642 0 1 1 0 0 1 0 1 0 1 71643 1 1 0 0 0 1 1 0 1 0 73452 0 0 0 1 0 0 0 1 0 0 74158 1 1 1 0 0 1 1 0 0 0
