使用R计算ggplot计算重叠密度图的面积 [英] calculate area of overlapping density plot by ggplot using R

查看:413
本文介绍了使用R计算ggplot计算重叠密度图的面积的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我怎样才能得到重叠密度曲线下的区域?

我如何用R解决问题? (这里有一个python的解决方案:



现在我们可以整合以获得重叠区域。



<$ p $ (MDensity,40,Intersect)$ value +
integrate(FDensity,Intersect) ,80)$ value
[1] 0.2952838


How can I get the area under overlapping density curves?

How can I solve the problem with R? (There is a solution for python here: Calculate overlap area of two functions )

set.seed(1234)
df <- data.frame(
  sex=factor(rep(c("F", "M"), each=200)),
  weight=round(c(rnorm(200, mean=55, sd=5),
                 rnorm(200, mean=65, sd=5)))
  )

(Source: http://www.sthda.com/english/wiki/ggplot2-density-plot-quick-start-guide-r-software-and-data-visualization )

ggplot(df, aes(x=weight, color=sex, fill=sex)) + 
 geom_density(aes(y=..density..), alpha=0.5)

"The points used in the plot are returned by ggplot_build(), so you can access them." So now, I have the points, and I can feed them to approxfun, but my problem is that i don't know how to subtract the density functions.

Any help greatly appreciated! (And I believe in high demand, there is no solution for this readily available.)

解决方案

I will make a few base R plots, but the plots are not actually part of the solution. They are just there to confirm that I am getting the right answer.

You can get each of the density functions and solve for where they intersect.

##  Create the two density functions and display
FDensity = approxfun(density(df$weight[df$sex=="F"], from=40, to=80))
MDensity = approxfun(density(df$weight[df$sex=="M"], from=40, to=80))
plot(FDensity, xlim=c(40,80), ylab="Density")
curve(MDensity, add=TRUE)

Now solve for the intersection

## Solve for the intersection and plot to confirm
FminusM = function(x) { FDensity(x) - MDensity(x) }
Intersect = uniroot(FminusM, c(40, 80))$root
points(Intersect, FDensity(Intersect), pch=20, col="red")

Now we can just integrate to get the area of the overlap.

integrate(MDensity, 40,Intersect)$value + 
    integrate(FDensity, Intersect, 80)$value
[1] 0.2952838

这篇关于使用R计算ggplot计算重叠密度图的面积的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆