在igraph中删除网络中的弱相关性(顶点和边) [英] Delete weak correlations from network in igraph (vertices and edges)

查看:226
本文介绍了在igraph中删除网络中的弱相关性(顶点和边)的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我需要根据相关矩阵绘制网络。
我的一小部分数据:

I need to plot a network from a correlation matrix. A small subset of my data:

Taxon                                                          CD1         CD2
Actinomycetaceae;g__Actinomyces                        0.072998825 0.031399459
Coriobacteriaceae;g__Atopobium                         0.040946468 0.002703265
Corynebacteriaceae;g__Corynebacterium                  0.002517201 0.006446247
Micrococcaceae;g__Rothia                               0.001174694 0.002703265
Porphyromonadaceae;g__Porphyromonas                    0.023326061 0.114368892
Prevotellaceae;g__Prevotella                           0.252894781 0.102308172
Flavobacteriaceae;g__Capnocytophaga                    0.001174694 0.029320025
Aerococcaceae;g__Abiotrophia                           0.002013761 0.003327095
Carnobacteriaceae;g__Granulicatella                    0.042960228 0.049490539
Gemellaceae;g__Gemella                                 0.027857023 0.067165731
Streptococcaceae;g__Streptococcus                      0.220506796 0.182782283
ClostridialesFamilyXI.IncertaeSedis;g__                0.000000000 0.000623830
ClostridialesFamilyXIII.IncertaeSedis;g__Mogibacterium 0.006880349 0.002495321
Lachnospiraceae;Other                                  0.000335627 0.000831774
Clostridia                                             0.004363148 0.002079434
Lachnospiraceae;g__Oribacterium                        0.003524081 0.002079434
Peptostreptococcaceae;g__Peptostreptococcus            0.000167813 0.005198586
Veillonellaceae;Other                                  0.001342507 0.001455604
Veillonellaceae;g__Veillonella                         0.047323376 0.082553545
Fusobacteriaceae;g__Fusobacterium                      0.009229737 0.010813059
Fusobacteriaceae;g__Leptotrichia                       0.092465179 0.076523186
Neisseriaceae;g__Neisseria                             0.013592885 0.027656477
Pasteurellaceae;g__Haemophilus                         0.014431952 0.092534831
SR1;c__;f__;g__                                        0.000000000 0.002079434
TM7;c__TM7-3;f__;g__                                   0.065782849 0.018299023
Erysipelotrichaceae;g__Bulleidia                       0.007551603 0.004366812
Bacteroidia                                            0.000000000 0.000415887
Porphyromonadaceae;g__Tannerella                       0.000671254 0.002079434
Flavobacteriaceae                                      0.002013761 0.001247661
Bacilli                                                0.002181574 0.002911208
Clostridia;f__;g__                                     0.000671254 0.002703265
ClostridialesFamilyXIII.IncertaeSedis;g__Eubacterium   0.003020641 0.002079434
Lachnospiraceae;g__Moryella                            0.003188454 0.000623830
Veillonellaceae;g__Selenomonas                         0.004866588 0.021834061
Fusobacteriaceae                                       0.000335627 0.001871491
Campylobacteraceae;g__Campylobacter                    0.001510321 0.001247661
Pasteurellaceae;g__Actinobacillus                      0.002852828 0.000207943
Burkholderiaceae;g__Lautropia                          0.000000000 0.002495321
Lactobacillaceae;g__Lactobacillus                      0.000000000 0.000000000
Staphylococcaceae;g__Staphylococcus                    0.000000000 0.000000000

这就是我所做的:

library(vegan)
library(psych)

mydata <- read.csv(file="L5_filt.txt", header=T, row.names=1, sep="\t")
mydata_t <- t(as.matrix(mydata))
graph.f<-graph.adjacency(cor.matrix$r, weighted=TRUE, mode="upper")

t.names <- colnames(cor.matrix)[as.numeric(V(t.graph)$name)]
graph.f = simplify(graph.f)

我只想绘制强相关性(> +0.6和< -0.6)
我想要相对于正相关和负相关的边缘颜色不同!

I want to plot only strong correlations (>+0.6 and <-0.6) I want different colors for edges relative to positive and negative correlations!

E(graph.f)[weight < 0.6 & weight > -0.6]$width<-0
E(graph.f)[weight > 0.6]$width<-2.5
E(graph.f)[weight < -0.6]$width<-2.5
E(graph.f)[weight > 0.6]$color<-"red"
E(graph.f)[weight < -0.6]$color<-"green"

par(mai=c(1,1,0.1,0.15), mar=c(1, 0, 1, 1), mgp=c(2,1,0), mfrow=c(1,2), cex=0.7, lwd=0.5)
plot (graph.f, vertex.size=5, vertex.shape="circle", vertex.label.color="red", 
    vertex.label=t.names, vertex.label.cex=0.9, layout=layout.fruchterman.reingold)

结果确实接近我想要的结果,但是我不知道如何从图中删除具有弱相关性的顶点(我的意思是与

The result is really near to what I want, but I don't know how to delete from the plot the vertices with weak correlations (I mean the nodes related to the edges for which I set width=0) and the names of these vertices.

如何修改代码?

谢谢!

推荐答案

如果我正确理解了您的问题...

If I understand your question correctly...

首先删除所有符合条件的边。

First delete all edges that match your condition.

然后删除所有邻居为零的顶点。

Then delete all vertices with zero neighbours.

使用随机数据的可重现示例,其中我不想绘制小于0.1的相关性:

Reproducible example with random data, where I don't want to plot correlations less than 0.1:

set.seed(999);mydata=matrix(runif(24),ncol=2)
rownames(mydata)=LETTERS[1:12]
g=graph.adjacency(cov(t(mydata)),weighted=TRUE)
plot(g)

这是一个包含12个顶点的完整图形。

that's a complete graph of 12 vertices.

g=delete.edges(g, which(E(g)$weight <=.1)) # here's my condition.
plot(g)

留下一个带有一些分离顶点的瘦图。

that leaves a skinny graph with a few detached vertices.

g=delete.vertices(g,which(degree(g)<1))
plot(g)

可以清理。

这篇关于在igraph中删除网络中的弱相关性(顶点和边)的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆