透视变形矩形的比例 [英] proportions of a perspective-deformed rectangle

查看：311 发布时间：2016/12/26 10:34:19 image-processing geometry computer-vision reverseprojection projective-geometry

本文介绍了透视变形矩形的比例的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

假设由透视扭曲的矩形的2d图片：

我知道形状最初是一个矩形，但我不知道它的原始大小。

如果我知道这个图片中的角的像素坐标，我如何计算原始比例，即矩形的商（宽度/高度）？

（背景：目标是自动解除矩形文档的照片，边缘检测可能通过hough变换完成）

更新：

对于是否可能根据给定的信息来确定宽度：高度比率，已经进行了一些讨论。我天真的想法是，它必须是可能的，因为我可以想到没有办法，例如1：4矩形投影到上面描述的四边形。该比率显然接近1：1，因此应该有一种方法来确定它的数学。但我没有任何证据，这超出了我的直觉的猜测。

我还没有完全理解下面提出的论点，但我认为必须有一些隐含的假设，

然而，经过几个小时的搜索，我终于找到了一些与这个问题相关的论文。
我很难理解在那里使用的数学，到目前为止没有成功。特别是第一篇论文似乎正在讨论我想做什么，不幸的是没有代码示例和非常密集的数学。

Zhengyou Zhang ，Li-Wei He，Whiteboard scanning and image enhancement
http://research.microsoft.com/en-us/um/people/zhang/papers/tr03-39.pdf p.11

由于透视失真，矩形的图像看起来是一个四边形，但是，由于我们知道它是一个空间矩形，我们能够估计相机的焦距和矩形的宽高比。

ROBERT M. HARALICK从透视投影确定相机参数一个矩形
http://portal.acm.org/citation.cfm ？id = 87146

我们将展示如何使用3D未知大小和位置的矩形的2D透视投影空间以确定相对于矩形的计划的相机视角参数。

这是我尝试回答我的问题。 -Wei He，
白板扫描和图像增强 http://research.microsoft.com/en-us/um/people/zhang/papers/tr03-39.pdf

我在SAGE中操作了一段时间的方程式，并得到了c-style中的这个伪代码：

  
 //如果有问题：GPLv2或更高版本授权
 // legend：
 // sqr（x）= x * x 
 sqrt（x）= x的平方根
 
 //让m1x，m1y ... m4x，m4y是（x，y）像素坐标
 //检测到的四边形
 // ie（m1x，m1y）是第一个角的坐标，
 //（m2x，m2y）的第二个角等。 
 // let u0，v0是图像主点的像素坐标
 //对于普通相机，这将是图像的中心，
 // ie u0 = IMAGEWIDTH / 2; v0 = IMAGEHEIGHT / 2 
 //如果图像被不对称地裁剪，则此假设不成立
 
 //首先，转换图像，使主点为（0,0）
 //这使得以下方程更容易
 m1x = m1x  -  u0; 
 m1y = m1y  -  v0; 
 m2x = m2x  -  u0; 
 m2y = m2y  -  v0; 
 m3x = m3x-u0; 
 m3y = m3y  -  v0; 
 m4x = m4x  -  u0; 
 m4y = m4y  -  v0; 
 
 
 //临时变量k2，k3 
 double k2 =（（m1y  -  m4y）* m3x  - （m1x  -  m4x）* m3y + m1x * m4y  -  m1y * m4x ）/ 
（（m2y-m4y）* m3x-（m2x-m4x）* m3y + m2x * m4y-m2y * m4x） 
 
 double k3 =（（m1y-m4y）* m2x-（m1x-m4x）* m2y + m1x * m4y-m1y * m4x）/ 
 （m3x-m4x）* m2y + m3x * m4y-m3y * m4x）; 
 
 // f_squared是摄像机的焦距，平方
 //如果k2 == 1或者k3 == 1那么这个方程是不可解的
 //如果焦距已知，则不需要该方程
 //在这种情况下赋值f_squared = sqr（focal_length）
 double f_squared = 
  - （（k3 * m3y  -  m1y）* * m2y-m1y）+（k3 * m3x-m1x）*（k2 * m2x-m1x））/ 
（（k3-1）*（k2-1） 
 
 //原始矩形的宽度/高度比例
 double whRatio = sqrt（
（sqr（k2-1）+ sqr（k2 * m2y  -  m1y）/ f_squared + sqr（k2 * m2x-m1x）/ f_squared）/ 
（sqr（k3-1）+ sqr（k3 * m3y  -  m1y）/ f_squared + sqr（k3 * m3x  -  m1x）/ f_squared）
）; 
 
 //如果k2 == 1且k3 == 1，则焦距方程不可解
 //但焦距不需要计算比值。 
 //我仍然在试图弄清楚在哪种情况下k2和k3变成1 
 //但是似乎是当矩形没有扭曲的透视，
 // ie直视上。然后方程是明显的：
 if（k2 == 1 && k3 == 1）whRatio = sqrt（
（sqr（m2y-m1y）+ sqr（m2x-m1x））/ 
 （sqr（m3y-m1y）+ sqr（m3x-m1x））
 
 
 //测试后，我发现上面的等式
 //实际上给出了height / width比例矩形，
 //不是宽度/高度比例
 //如果有人可以找到导致这个错误，
 //我会非常感激。
 // until then：
 whRatio = 1 / whRatio;

更新：确定的方程式：

以下是 SAGE < a>。
可以在 http：//www.sagenb。 org / home / pub / 704 / 。
（Sage在解方程中非常有用，可以在任何浏览器中使用，查看它）

 ＃计算透视失真的矩形的宽高比
 
＃
＃b：b＃b：b＃[zhang-single]：Single - 查看矩形的几何
＃应用于白板图片整理
＃by Zhenggyou Zhang 
＃http://research.microsoft.com/users/zhang/Papers/WhiteboardRectification.pdf 
 
＃四边形四个角的像素坐标（m1，m2，m3，m4）
＃查看[zhang-single]图1 
 m1x = var '）
 m1y = var（'m1y'）
 m2x = var（'m2x'）
 m2y = var（'m2y'）
 m3x = var 
 m3y = var（'m3y'）
 m4x = var（'m4x'）
 m4y = var（'m4y'）
 
图像的点
＃对于普通相机，这将是图像的中心，
＃ie u0 = IMAGEWIDTH / 2; v0 = IMAGEHEIGHT / 2 
＃如果图像被不对称地裁剪，这个假设不成立
 u0 = var（'u0'）
 v0 = var（'v0'）
 
＃像素长宽比;对于正常的相机像素是正方形，所以s = 1 
s = var（'s'）
 
＃四边形的同构坐标
 m1 = vector（[m1x，m1y， 1]）$ b $ b m2 = vector（[m2x，m2y，1]）$ b $ b m3 = vector（[m3x，m3y，1]）$ b $ b m4 = vector（[m4x，m4y，1] ）
 
 
＃以下等式稍后用于计算焦距
＃和矩形的宽高比。 
＃临时变量：k2，k3，n2，n3 
 
参见[zhang-single]方程11,12 
 k2_ = m1.cross_product（m4）.dot_product ）/ m2.cross_product（m4）.dot_product（m3）
 k3_ = m1.cross_product（m4）.dot_product（m2）/ m3.cross_product（m4）.dot_product（m2）
 k2 = var 'k2'）
 k3 = var（'k3'）
 
＃参见[zhang-single]公式14,16 
 n2 = k2 * m2  -  m1 
 n3 = k3 * m3  -  m1 
 
 
＃相机的焦距。 
f = var（'f'）
＃参见[zhang-single]方程21 
 f_ = sqrt（
 -1 /（
 n2 [2] * n3 [2] * s ^ 2 
）*（
（
 n2 [0] * n3 [0]  - （n2 [0] * n3 [2] + n2 [2] [0]）* u0 + n2 [2] * n3 [2] * u0 ^ 2 
）* s ^ 2 +（
 n2 [1] * n3 [1] * n3 [2] + n2 [2] * n3 [1]）* v0 + n2 [2] * n3 [2] * v0 ^ 2 
）
）
）$ b b 
 
＃标准针孔相机矩阵
＃参见[zhang-single]公式1 
 A = matrix（[[f，0，u0]，[0，s * f ，v0]，[0,0,1]]）
 
 
原始矩形的宽度/高度比例
＃参见[zhang-single] $ b whRatio = sqrt（
（n2 * A.transpose（）^（ -  1）* A ^（ -  1）* n2.transpose（））/ 
（n3 * ^（ -  1）* A ^（ -  1）* n3.transpose（））
）

在c代码中的简化等式由

  print打印简化等式确定，假设u0 = 0，v0 = 0，s = 1
 printk2：=，k2_ 
 printk3：=，k3_ 
 printf：=，f_（u0 = 0，v0 = 0 ，s = 1）
打印whRatio：=，whRatio（u0 = 0，v0 = 0，s = 1）
 
简化方程，假设u0 = 0，v0 = s = 1 
 k2：=（（m1y-m4y）* m3x-（m1x-m4x）* m3y + m1x * m4y-m1y * m4x）/（（m2y 
-m4y）* m3x- m2x-m4x）* m3y + m2x * m4y-m2y * m4x）
 k3：=（（m1y-m4y）* m2x-（m1x-m4x）* m2y + m1x * m4y-m1y * m4x）/ m3y 
-m4y）* m2x-（m3x-m4x）* m2y + m3x * m4y-m3y * m4x）
f：= sqrt（ - （（k3 * m3y-m1y）*（k2 * m2y- m1y）+（k3 * m3x-m1x）*（k2 * m2x 
-m1x））/（（k3-1）*（k2-1）））
 whRatio：= sqrt 1）^ 2 +（k2 * m2y-m1y）^ 2 / f ^ 2 +（k2 * m2x- 
 m1x）^ 2 / f ^ 2）/（（k3-1）^ 2 + * m3y  -  m1y）^ 2 / f ^ 2 +（k3 * m3x  -  
 m1x）^ 2 / f ^ 2））
 
打印一个方程中的一切：
 printwhRatio：=，whRatio（f = f _）（k2 = k2_，k3 = k3 _）（u0 = 0，v0 = 0，s = 1）
 
 b $ b whRatio：= sqrt（（（（m1y-m4y）* m2x-（m1x-m4x）* m2y + m1x * m4y- 
 m1y * m4x）/（（m3y-m4y）* m2x- （m1x-m4x）* m2y + m3x * m4y-m3y * m4x） -  
 1）*（（（m1y-m4y）* m3x-（m1x-m4x）* m3y + m1x * m4y- （（m1y-
 m4y）* m3x-（m2x-m4x）* m3y + m2x * m4y-m2y * m4x）-1）*（（（m1y-
 m4y）* m3x- m4x）* m3y + m1x * m4y-m1y * m4x）* m2y /（（m2y-m4y）* m3x 
-（m2x-m4x）* m3y + m2x * m4y-m2y * m4x） /（（m1y-m4y）* m2x-
（m1x-m4x）* m2y + m1x * m4y-m1y * m4x）* m3y / b m4x）* m2y + m3x * m4y-m3y * m4x）-m1y）*（（（m1y-m4y）* m3x-（m1x- 
 m4x）* m3y + m1x * m4y-m1y * m4x）* m2y /（m1y-m4y）* m3x-（m2x-m4x）* m3y 
 + m2x * m4y-m2y * m4x）-m1y）+ m2y + mby * m4x）* m3x /（（m3y-m4y）* m2x-（m3x-m4x）* m2y + m3x * m4y 
-m3y * m4x） （m2y-m4y）* m3x-（m2x-m4x）* m3y + m4x）* m2x /（（m2y-m4y）* m3x-（m2x-m4y）* m3x-（m1x-m4x）* m3y + m1x * m4y- m2x * m4y-m2y * m4x）
-m1x））+（（（m1y-m4y）* m2x-（m1x-m4x）* m2y + m1x * m4y- 
 m1y * m4x）/ （m1y-m4y）* m2x-（m3x-m4x）* m2y + m3x * m4y-m3y * m4x） -  
 1）* * m4y-m1y * m4x）/（（m2y-
 m4y）* m3x-（m2x-m4x）* m3y + m2x * m4y-m2y * m4x） m4y）* m3x-（m1x-m4x）* m3y + m1x * m4y-m1y * m4x）* m2x /（（m2y-m4y）* m3x 
-（m2x-m4x）* m3y + m2x * m4y-m2y * m4x）* m1x * m3x /（（m3y-m4y）* m4x）* m3x * m4x * m4x * m2x-（m3x-
 m4x）* m2y + m3x * m4y-m3y * m4x）-m1y）*（（（m1y-m4y）* m3x-（m1x- 
 m4x）* m3y + m1x * m4y-m1y * m4x）* m2y /（（m2y-m4y）* m3x-（m2x-m4x）* m3y 
 + m2x * m4y-m2y * m4x） m2x-（m3x-m4x）* m2y + m3x * m4y 
-（m3x-m4x）* m2x  - （m1x-m4x）* m2y + 
 m1x * m4y-m1y * m4x）* m3x / m3y * m4x）* m2x /（（m2y-m4y）* m3x-（m1y-m4y）* m3x-（m1x-m4x）* m3y + m1x * m4y- （m2x-m4x）* m3y + m2x * m4y-m2y * m4x）
-m1x）） - （（m1y-m4y）* m3x-（m1x-m4x）* m3y + m1x * m4y- 
 m1y * m4x）/（（m2y-m4y）* m3x-（m2x-m4x）* m3y + m2x * m4y-m2y * m4x） * m2x-（m1x-m4x）* m2y + m1x * m4y- 
 m1y * m4x）/（（m3y-m4y）* m2x-（m3x-m4x）* m2y + m3x * m4y- （m2y-mby）* m3x  - （m2x-m4x）* m3x  - （m1x-m4x）* m3y + m1x * m4y-m1y * m4x） （m1x-m4x）* m3y + m2x * m4y-m2y * m4x）-1 *（（m1y-
 m4y）* m2x-（m1x-m4x）* m2y + m1x * m4y-m1y * m4x）* m3y / （（m3y-m4y）* m2x 
-（m3x-m4x）* m2y + m3x * m4y-m3y * m4x）-m1y）^ 2 /（（（（m1y-m4y）* m2x- 
 （m1x-m4x）* m2y + m1x * m4y-m1y * m4x）* m3y /（（m3y-m4y）* m2x-（m3x- 
 m4x）* m2y + m3x * m4y- ）*（（m2y-m4y）* m3x-（m2x-m4x）*（m1y-m4y）* m3x-（m1x-
 m4x）* m3y + m1x * m4y- m3y 
 + m2x * m4y-m2y * m4x）-m1y）+（（（m1y-m4y）* m2x-（m1x-m4x）* m2y + 
 m1x * m4y-m1y * m4x）* m3x /（（m1y-m4y）* m2x-（m3x-m4x）* m2y + m3x * m4y 
-m3y * m4x）-m1x）* m3y + m1x * m4y-
 m1y * m4x）* m2x /（（m2y-m4y）* m3x-（m2x-m4x）* m3y + m2x * m4y-m2y * m4x）
  -  m1x） +（（m1y-m4y）* m2x-（m3x-m4x）* m2x-（m3x-m4x）* m2y + m1x * m4y- 
 m1y * m4x） * m4y-m3y * m4x） -  
 1）*（（（m1y-m4y）* m3x-（m1x-m4x）* m3y + m1x * m4y-m1y * m4x）/ m4y）* m3x-（m2x-m4x）* m3y + m2x * m4y-m2y * m4x）-1 *（（m1y-
 m4y）* m2x-（m1x-m4x）* m2y + m1x * m4y -  m1y * m4x）* m3x /（（m3y-m4y）* m2x 
-（m3x-m4x）* m2y + m3x * m4y-m3y * m4x）-m1x）^ 2 / ）* m2x-
（m1x-m4x）* m2y + m1x * m4y-m1y * m4x）* m3y /（（m3y-m4y）* m2x-（m3x- 
 m4x）* m2y + m3x * m4y-m1y * m4x）* m2y /（（m2y-m4y）* m3y * m4x）* m1y * m3x-（m2x-m4x）* m3y 
 + m2x * m4y-m2y * m4x）-m1y）+（（（m1y-m4y）* m2x-（m1x-m4x）* m2y + 
 m1x * m4y * m4x）* m3x /（（m3y-m4y）* m2x-（m3x-m4x）* m2y + m3x * m4y 
-m3y * m4x） m3x-（m2x-m4x）* m3y + m1x * m4y- 
 m1y * m4x）* m2x /（（m2y-m4y）* m3x-（m2x-m4x）* m3y + m2x * m4y-m2y * m4x） 
-m1x）） - （（（m1y-m4y）* m2x-（m1x-m4x）* m2y + m1x * m4y- 
 m1y * m4x）/（（m3y-m4y）* m2x- m3x  -  m4x）* m2y + m3x * m4y  -  m3y * m4x） -  
 1）^ 2））

  
＃一些测试：
＃ - 选择一个随机矩形，
＃ 
＃ - 在上面的等式中插入角，
＃ - 检查长宽比是否正确。 
 
 from sage.plot.plot3d.transform import rotate_arbitrary 
 
 #redundandly随机旋转矩阵
 rand_rotMatrix = \ 
 rotate_arbitrary（（uniform（-5 ，均匀（-5,5），均匀（-5,5）），均匀（-5,5））* \\ $ 
 rotate_arbitrary（（uniform（-5,5） 5,5,5），uniform（-5,5）），uniform（-5,5））* \ 
 rotation_arbitrary（（uniform（-5,5），uniform（-5,5），uniform -5,5）），uniform（-5,5））
 
 #random translation vector 
 rand_transVector = vector（uniform（-10,10），uniform（-10,10 ），uniform（-10,10）））。transpose（）
 
 #random矩形参数
 rand_width = uniform（0.1,10）
 rand_height = uniform ）
 rand_left = uniform（-10,10）
 rand_top = uniform（-10,10）
 
 #random焦距和主点
 rand_f = uniform 0.1,100）
 rand_u0 = uniform（-100,100）
 rand_v0 = uniform（-100,100）
 
＃同质标准针孔投影，见[zhang-single] $ b hom_projection = A * rand_rotMatrix.augment（rand_transVector）
 
＃在平面z = 0中构造一个随机矩形，然后随机投影
 rand_m1hom = hom_projection * vector（（rand_left，rand_top ，0,1））。transpose（）
 rand_m3hom = hom_projection * vector（（rand_left + rand_width） ，rand_top，0,1））。transpose（）
 rand_m4hom = hom_projection * vector（（rand_left + rand_width，rand_top + rand_height，0,1））。transpose（）
 
 #change类型从1×3矩阵到向量
 rand_m1hom = rand_m1hom.column（0）
 rand_m2hom = rand_m2hom.column（0）
 rand_m3hom = rand_m3hom.column（0）
 rand_m4hom = rand_m4hom。 [2] 
 rand_m2hom = rand_m2hom / rand_m2hom [2] 
 rand_m2hom = rand_m2hom / rand_m2hom [2] 
 
 
 
 rand_m4hom = rand_m4hom / rand_m4hom [2] 
 
 #substitute f，u0，v0的随机值
 rand_m1hom = rand_m1hom（f = rand_f，s = 1，u0 = rand_u0， 
 rand_m3hom = rand_m3hom（f = rand_f，s = 1，u0 = rand_u0，v0 = rand_v0）
 rand_m2hom = rand_m2hom（f = rand_f，s = 1，u0 = rand_u0，v0 = rand_v0） rand_v0）
 rand_m4hom = rand_m4hom（f = rand_f，s = 1，u0 = rand_u0，v0 = rand_v0）
 
＃打印随机选择的值
 printground truth：f =，rand_f，; ratio =，rand_width / rand_height 
 
＃替换等式中的所有变量：
 printcalculated：f =，\ 
 f_（k2 = k2_，k3 = k3 _）（s = 1，u0 = rand_u0，v0 = rand_v0）（
 m1x = rand_m1hom [0]，m1y = rand_m1hom [1]，
 m2x = rand_m2hom [ ]，
 m3x = rand_m3hom [0]，m3y = rand_m3hom [1]，
 m4x = rand_m4hom [0]，m4y = rand_m4hom [1]，
） 1 / ratio =，\ 
 1 / whRatio（f = f _）（k2 = k2_，k3 = k3 _）（s = 1，u0 = rand_u0，v0 = rand_v0）（
 m1x = rand_m1hom [0]，m1y = rand_m1hom [1]，
 m2x = rand_m2hom [0]，m2y = rand_m2hom [1]，
 m3x = rand_m3hom [0]，m3y = rand_m3hom [1] b m4x = rand_m4hom [0]，m4y = rand_m4hom [1]，
）
 
 printk2 =，k2_（
 m1x = rand_m1hom [0]，m1y = rand_m1hom [1]，
 m2x = rand_m2hom [0]，m2y = rand_m2hom [1]，
 m3x = rand_m3hom [0]，m3y = rand_m3hom [1]，
 m4x = rand_m4hom [0] ，m4y = rand_m4hom [1]，
），; k3 =，k3_（
 m1x = rand_m1hom [0]，m1y = rand_m1hom [1]，
 m2x = rand_m2hom [0]，m2y = rand_m2hom [1]，
 m3x = rand_m3hom [ 0]，m3y = rand_m3hom [1]，
 m4x = rand_m4hom [0]，m4y = rand_m4hom [1]，
）
 
注意： 
＃实际上是高/宽比，
＃不是宽/高比
＃这与[zhang-single] 
＃如果任何人都可以找到导致此错误，我会感激
 
地面实况：f = 72.1045134124554; ratio = 3.46538779959142 
计算：f = 72.1045134125; 1 / ratio = 3.46538779959 
 k2 = 0.99114614987; k3 = 1.57376280159

Given a 2d picture of a rectangle distorted by perspective:

I know that the shape was originally a rectangle, but I do not know its original size.

If I know the pixel coordinates of the corners in this picture, how can I calculate the original proportions, i.e. the quotient ( width / height ) of the rectangle?

(background: the goal is to automatically undistort photos of rectangular documents, edge detection will probably be done with hough transform)

UPDATE:

There has been some discussion on whether it is possible at all to determine the width:height ratio with the information given. My naive thought was that it must be possible, since I can think of no way to project for example a 1:4 rectangle onto the quadrangle depicted above. The ratio appears clearly close to 1:1, so there should be a way to determine it mathematically. I have however no proof for this beyond my intuitive guess.

I have not yet fully understood the arguments presented below, but I think there must be some implicit assumption that we are missing here and that is interpreted differently.

However, after hours of searching, I have finally found some papers relevant to the problem. I am struggling to understand the math used in there, so far without success. Particularly the first paper seems to discuss exactly what I wanted to do, unfortunately without code examples and very dense math.

Zhengyou Zhang , Li-Wei He, "Whiteboard scanning and image enhancement" http://research.microsoft.com/en-us/um/people/zhang/papers/tr03-39.pdf p.11

"Because of the perspective distortion, the image of a rectangle appears to be a quadrangle. However, since we know that it is a rectangle in space, we are able to estimate both the camera’s focal length and the rectangle’s aspect ratio."
ROBERT M. HARALICK "Determining camera parameters from the perspective projection of a rectangle" http://portal.acm.org/citation.cfm?id=87146

"we show how to use the 2D perspective projection of a rectangle of unknown size and position in 3D space to determine the camera look angle parameters relative to the plans of the rectangle."

解决方案

Here is my attempt at answering my question after reading the paper

Zhengyou Zhang , Li-Wei He, "Whiteboard scanning and image enhancement" http://research.microsoft.com/en-us/um/people/zhang/papers/tr03-39.pdf

I manipulated the equations for some time in SAGE, and came up with this pseudo-code in c-style:


// in case it matters: licensed under GPLv2 or later
// legend:
// sqr(x)  = x*x
// sqrt(x) = square root of x

// let m1x,m1y ... m4x,m4y be the (x,y) pixel coordinates
// of the 4 corners of the detected quadrangle
// i.e. (m1x, m1y) are the cordinates of the first corner, 
// (m2x, m2y) of the second corner and so on.
// let u0, v0 be the pixel coordinates of the principal point of the image
// for a normal camera this will be the center of the image, 
// i.e. u0=IMAGEWIDTH/2; v0 =IMAGEHEIGHT/2
// This assumption does not hold if the image has been cropped asymmetrically

// first, transform the image so the principal point is at (0,0)
// this makes the following equations much easier
m1x = m1x - u0;
m1y = m1y - v0;
m2x = m2x - u0;
m2y = m2y - v0;
m3x = m3x - u0;
m3y = m3y - v0;
m4x = m4x - u0;
m4y = m4y - v0;


// temporary variables k2, k3
double k2 = ((m1y - m4y)*m3x - (m1x - m4x)*m3y + m1x*m4y - m1y*m4x) /
            ((m2y - m4y)*m3x - (m2x - m4x)*m3y + m2x*m4y - m2y*m4x) ;

double k3 = ((m1y - m4y)*m2x - (m1x - m4x)*m2y + m1x*m4y - m1y*m4x) / 
            ((m3y - m4y)*m2x - (m3x - m4x)*m2y + m3x*m4y - m3y*m4x) ;

// f_squared is the focal length of the camera, squared
// if k2==1 OR k3==1 then this equation is not solvable
// if the focal length is known, then this equation is not needed
// in that case assign f_squared= sqr(focal_length)
double f_squared = 
    -((k3*m3y - m1y)*(k2*m2y - m1y) + (k3*m3x - m1x)*(k2*m2x - m1x)) / 
                      ((k3 - 1)*(k2 - 1)) ;

//The width/height ratio of the original rectangle
double whRatio = sqrt( 
    (sqr(k2 - 1) + sqr(k2*m2y - m1y)/f_squared + sqr(k2*m2x - m1x)/f_squared) /
    (sqr(k3 - 1) + sqr(k3*m3y - m1y)/f_squared + sqr(k3*m3x - m1x)/f_squared) 
) ;

// if k2==1 AND k3==1, then the focal length equation is not solvable 
// but the focal length is not needed to calculate the ratio.
// I am still trying to figure out under which circumstances k2 and k3 become 1
// but it seems to be when the rectangle is not distorted by perspective, 
// i.e. viewed straight on. Then the equation is obvious:
if (k2==1 && k3==1) whRatio = sqrt( 
    (sqr(m2y-m1y) + sqr(m2x-m1x)) / 
    (sqr(m3y-m1y) + sqr(m3x-m1x))


// After testing, I found that the above equations 
// actually give the height/width ratio of the rectangle, 
// not the width/height ratio. 
// If someone can find the error that caused this, 
// I would be most grateful.
// until then:
whRatio = 1/whRatio;

Update: here is how these equations where determined:

The following is code in SAGE. It can be accessed online at http://www.sagenb.org/home/pub/704/. (Sage is really useful in solving equations, and useable in any browser, check it out)

# CALCULATING THE ASPECT RATIO OF A RECTANGLE DISTORTED BY PERSPECTIVE

#
# BIBLIOGRAPHY:
# [zhang-single]: "Single-View Geometry of A Rectangle 
#  With Application to Whiteboard Image Rectification"
#  by Zhenggyou Zhang
#  http://research.microsoft.com/users/zhang/Papers/WhiteboardRectification.pdf

# pixel coordinates of the 4 corners of the quadrangle (m1, m2, m3, m4)
# see [zhang-single] figure 1
m1x = var('m1x')
m1y = var('m1y')
m2x = var('m2x')
m2y = var('m2y')
m3x = var('m3x')
m3y = var('m3y')
m4x = var('m4x')
m4y = var('m4y')

# pixel coordinates of the principal point of the image
# for a normal camera this will be the center of the image, 
# i.e. u0=IMAGEWIDTH/2; v0 =IMAGEHEIGHT/2
# This assumption does not hold if the image has been cropped asymmetrically
u0 = var('u0')
v0 = var('v0')

# pixel aspect ratio; for a normal camera pixels are square, so s=1
s = var('s')

# homogenous coordinates of the quadrangle
m1 = vector ([m1x,m1y,1])
m2 = vector ([m2x,m2y,1])
m3 = vector ([m3x,m3y,1])
m4 = vector ([m4x,m4y,1])


# the following equations are later used in calculating the the focal length 
# and the rectangle's aspect ratio.
# temporary variables: k2, k3, n2, n3

# see [zhang-single] Equation 11, 12
k2_ = m1.cross_product(m4).dot_product(m3) / m2.cross_product(m4).dot_product(m3)
k3_ = m1.cross_product(m4).dot_product(m2) / m3.cross_product(m4).dot_product(m2)
k2 = var('k2')
k3 = var('k3')

# see [zhang-single] Equation 14,16
n2 = k2 * m2 - m1
n3 = k3 * m3 - m1


# the focal length of the camera.
f = var('f')
# see [zhang-single] Equation 21
f_ = sqrt(
         -1 / (
          n2[2]*n3[2]*s^2
         ) * (
          (
           n2[0]*n3[0] - (n2[0]*n3[2]+n2[2]*n3[0])*u0 + n2[2]*n3[2]*u0^2
          )*s^2 + (
           n2[1]*n3[1] - (n2[1]*n3[2]+n2[2]*n3[1])*v0 + n2[2]*n3[2]*v0^2
          ) 
         ) 
        )


# standard pinhole camera matrix
# see [zhang-single] Equation 1
A = matrix([[f,0,u0],[0,s*f,v0],[0,0,1]])


#the width/height ratio of the original rectangle
# see [zhang-single] Equation 20
whRatio = sqrt (
               (n2*A.transpose()^(-1) * A^(-1)*n2.transpose()) / 
               (n3*A.transpose()^(-1) * A^(-1)*n3.transpose())
              )

The simplified equations in the c-code where determined by

print "simplified equations, assuming u0=0, v0=0, s=1"
print "k2 := ", k2_
print "k3 := ", k3_
print "f  := ", f_(u0=0,v0=0,s=1)
print "whRatio := ", whRatio(u0=0,v0=0,s=1)

    simplified equations, assuming u0=0, v0=0, s=1
    k2 :=  ((m1y - m4y)*m3x - (m1x - m4x)*m3y + m1x*m4y - m1y*m4x)/((m2y
    - m4y)*m3x - (m2x - m4x)*m3y + m2x*m4y - m2y*m4x)
    k3 :=  ((m1y - m4y)*m2x - (m1x - m4x)*m2y + m1x*m4y - m1y*m4x)/((m3y
    - m4y)*m2x - (m3x - m4x)*m2y + m3x*m4y - m3y*m4x)
    f  :=  sqrt(-((k3*m3y - m1y)*(k2*m2y - m1y) + (k3*m3x - m1x)*(k2*m2x
    - m1x))/((k3 - 1)*(k2 - 1)))
    whRatio :=  sqrt(((k2 - 1)^2 + (k2*m2y - m1y)^2/f^2 + (k2*m2x -
    m1x)^2/f^2)/((k3 - 1)^2 + (k3*m3y - m1y)^2/f^2 + (k3*m3x -
    m1x)^2/f^2))

print "Everything in one equation:"
print "whRatio := ", whRatio(f=f_)(k2=k2_,k3=k3_)(u0=0,v0=0,s=1)

    Everything in one equation:
    whRatio :=  sqrt(((((m1y - m4y)*m2x - (m1x - m4x)*m2y + m1x*m4y -
    m1y*m4x)/((m3y - m4y)*m2x - (m3x - m4x)*m2y + m3x*m4y - m3y*m4x) -
    1)*(((m1y - m4y)*m3x - (m1x - m4x)*m3y + m1x*m4y - m1y*m4x)/((m2y -
    m4y)*m3x - (m2x - m4x)*m3y + m2x*m4y - m2y*m4x) - 1)*(((m1y -
    m4y)*m3x - (m1x - m4x)*m3y + m1x*m4y - m1y*m4x)*m2y/((m2y - m4y)*m3x
    - (m2x - m4x)*m3y + m2x*m4y - m2y*m4x) - m1y)^2/((((m1y - m4y)*m2x -
    (m1x - m4x)*m2y + m1x*m4y - m1y*m4x)*m3y/((m3y - m4y)*m2x - (m3x -
    m4x)*m2y + m3x*m4y - m3y*m4x) - m1y)*(((m1y - m4y)*m3x - (m1x -
    m4x)*m3y + m1x*m4y - m1y*m4x)*m2y/((m2y - m4y)*m3x - (m2x - m4x)*m3y
    + m2x*m4y - m2y*m4x) - m1y) + (((m1y - m4y)*m2x - (m1x - m4x)*m2y +
    m1x*m4y - m1y*m4x)*m3x/((m3y - m4y)*m2x - (m3x - m4x)*m2y + m3x*m4y
    - m3y*m4x) - m1x)*(((m1y - m4y)*m3x - (m1x - m4x)*m3y + m1x*m4y -
    m1y*m4x)*m2x/((m2y - m4y)*m3x - (m2x - m4x)*m3y + m2x*m4y - m2y*m4x)
    - m1x)) + (((m1y - m4y)*m2x - (m1x - m4x)*m2y + m1x*m4y -
    m1y*m4x)/((m3y - m4y)*m2x - (m3x - m4x)*m2y + m3x*m4y - m3y*m4x) -
    1)*(((m1y - m4y)*m3x - (m1x - m4x)*m3y + m1x*m4y - m1y*m4x)/((m2y -
    m4y)*m3x - (m2x - m4x)*m3y + m2x*m4y - m2y*m4x) - 1)*(((m1y -
    m4y)*m3x - (m1x - m4x)*m3y + m1x*m4y - m1y*m4x)*m2x/((m2y - m4y)*m3x
    - (m2x - m4x)*m3y + m2x*m4y - m2y*m4x) - m1x)^2/((((m1y - m4y)*m2x -
    (m1x - m4x)*m2y + m1x*m4y - m1y*m4x)*m3y/((m3y - m4y)*m2x - (m3x -
    m4x)*m2y + m3x*m4y - m3y*m4x) - m1y)*(((m1y - m4y)*m3x - (m1x -
    m4x)*m3y + m1x*m4y - m1y*m4x)*m2y/((m2y - m4y)*m3x - (m2x - m4x)*m3y
    + m2x*m4y - m2y*m4x) - m1y) + (((m1y - m4y)*m2x - (m1x - m4x)*m2y +
    m1x*m4y - m1y*m4x)*m3x/((m3y - m4y)*m2x - (m3x - m4x)*m2y + m3x*m4y
    - m3y*m4x) - m1x)*(((m1y - m4y)*m3x - (m1x - m4x)*m3y + m1x*m4y -
    m1y*m4x)*m2x/((m2y - m4y)*m3x - (m2x - m4x)*m3y + m2x*m4y - m2y*m4x)
    - m1x)) - (((m1y - m4y)*m3x - (m1x - m4x)*m3y + m1x*m4y -
    m1y*m4x)/((m2y - m4y)*m3x - (m2x - m4x)*m3y + m2x*m4y - m2y*m4x) -
    1)^2)/((((m1y - m4y)*m2x - (m1x - m4x)*m2y + m1x*m4y -
    m1y*m4x)/((m3y - m4y)*m2x - (m3x - m4x)*m2y + m3x*m4y - m3y*m4x) -
    1)*(((m1y - m4y)*m3x - (m1x - m4x)*m3y + m1x*m4y - m1y*m4x)/((m2y -
    m4y)*m3x - (m2x - m4x)*m3y + m2x*m4y - m2y*m4x) - 1)*(((m1y -
    m4y)*m2x - (m1x - m4x)*m2y + m1x*m4y - m1y*m4x)*m3y/((m3y - m4y)*m2x
    - (m3x - m4x)*m2y + m3x*m4y - m3y*m4x) - m1y)^2/((((m1y - m4y)*m2x -
    (m1x - m4x)*m2y + m1x*m4y - m1y*m4x)*m3y/((m3y - m4y)*m2x - (m3x -
    m4x)*m2y + m3x*m4y - m3y*m4x) - m1y)*(((m1y - m4y)*m3x - (m1x -
    m4x)*m3y + m1x*m4y - m1y*m4x)*m2y/((m2y - m4y)*m3x - (m2x - m4x)*m3y
    + m2x*m4y - m2y*m4x) - m1y) + (((m1y - m4y)*m2x - (m1x - m4x)*m2y +
    m1x*m4y - m1y*m4x)*m3x/((m3y - m4y)*m2x - (m3x - m4x)*m2y + m3x*m4y
    - m3y*m4x) - m1x)*(((m1y - m4y)*m3x - (m1x - m4x)*m3y + m1x*m4y -
    m1y*m4x)*m2x/((m2y - m4y)*m3x - (m2x - m4x)*m3y + m2x*m4y - m2y*m4x)
    - m1x)) + (((m1y - m4y)*m2x - (m1x - m4x)*m2y + m1x*m4y -
    m1y*m4x)/((m3y - m4y)*m2x - (m3x - m4x)*m2y + m3x*m4y - m3y*m4x) -
    1)*(((m1y - m4y)*m3x - (m1x - m4x)*m3y + m1x*m4y - m1y*m4x)/((m2y -
    m4y)*m3x - (m2x - m4x)*m3y + m2x*m4y - m2y*m4x) - 1)*(((m1y -
    m4y)*m2x - (m1x - m4x)*m2y + m1x*m4y - m1y*m4x)*m3x/((m3y - m4y)*m2x
    - (m3x - m4x)*m2y + m3x*m4y - m3y*m4x) - m1x)^2/((((m1y - m4y)*m2x -
    (m1x - m4x)*m2y + m1x*m4y - m1y*m4x)*m3y/((m3y - m4y)*m2x - (m3x -
    m4x)*m2y + m3x*m4y - m3y*m4x) - m1y)*(((m1y - m4y)*m3x - (m1x -
    m4x)*m3y + m1x*m4y - m1y*m4x)*m2y/((m2y - m4y)*m3x - (m2x - m4x)*m3y
    + m2x*m4y - m2y*m4x) - m1y) + (((m1y - m4y)*m2x - (m1x - m4x)*m2y +
    m1x*m4y - m1y*m4x)*m3x/((m3y - m4y)*m2x - (m3x - m4x)*m2y + m3x*m4y
    - m3y*m4x) - m1x)*(((m1y - m4y)*m3x - (m1x - m4x)*m3y + m1x*m4y -
    m1y*m4x)*m2x/((m2y - m4y)*m3x - (m2x - m4x)*m3y + m2x*m4y - m2y*m4x)
    - m1x)) - (((m1y - m4y)*m2x - (m1x - m4x)*m2y + m1x*m4y -
    m1y*m4x)/((m3y - m4y)*m2x - (m3x - m4x)*m2y + m3x*m4y - m3y*m4x) -
    1)^2))


# some testing:
# - choose a random rectangle, 
# - project it onto a random plane,
# - insert the corners in the above equations,
# - check if the aspect ratio is correct.

from sage.plot.plot3d.transform import rotate_arbitrary

#redundandly random rotation matrix
rand_rotMatrix = \
           rotate_arbitrary((uniform(-5,5),uniform(-5,5),uniform(-5,5)),uniform(-5,5)) *\
           rotate_arbitrary((uniform(-5,5),uniform(-5,5),uniform(-5,5)),uniform(-5,5)) *\
           rotate_arbitrary((uniform(-5,5),uniform(-5,5),uniform(-5,5)),uniform(-5,5))

#random translation vector
rand_transVector = vector((uniform(-10,10),uniform(-10,10),uniform(-10,10))).transpose()

#random rectangle parameters
rand_width =uniform(0.1,10)
rand_height=uniform(0.1,10)
rand_left  =uniform(-10,10)
rand_top   =uniform(-10,10)

#random focal length and principal point
rand_f  = uniform(0.1,100)
rand_u0 = uniform(-100,100)
rand_v0 = uniform(-100,100)

# homogenous standard pinhole projection, see [zhang-single] Equation 1
hom_projection = A * rand_rotMatrix.augment(rand_transVector)

# construct a random rectangle in the plane z=0, then project it randomly 
rand_m1hom = hom_projection*vector((rand_left           ,rand_top            ,0,1)).transpose()
rand_m2hom = hom_projection*vector((rand_left           ,rand_top+rand_height,0,1)).transpose()
rand_m3hom = hom_projection*vector((rand_left+rand_width,rand_top            ,0,1)).transpose()
rand_m4hom = hom_projection*vector((rand_left+rand_width,rand_top+rand_height,0,1)).transpose()

#change type from 1x3 matrix to vector
rand_m1hom = rand_m1hom.column(0)
rand_m2hom = rand_m2hom.column(0)
rand_m3hom = rand_m3hom.column(0)
rand_m4hom = rand_m4hom.column(0)

#normalize
rand_m1hom = rand_m1hom/rand_m1hom[2]
rand_m2hom = rand_m2hom/rand_m2hom[2]
rand_m3hom = rand_m3hom/rand_m3hom[2]
rand_m4hom = rand_m4hom/rand_m4hom[2]

#substitute random values for f, u0, v0
rand_m1hom = rand_m1hom(f=rand_f,s=1,u0=rand_u0,v0=rand_v0)
rand_m2hom = rand_m2hom(f=rand_f,s=1,u0=rand_u0,v0=rand_v0)
rand_m3hom = rand_m3hom(f=rand_f,s=1,u0=rand_u0,v0=rand_v0)
rand_m4hom = rand_m4hom(f=rand_f,s=1,u0=rand_u0,v0=rand_v0)

# printing the randomly choosen values
print "ground truth: f=", rand_f, "; ratio=", rand_width/rand_height

# substitute all the variables in the equations:
print "calculated: f= ",\
f_(k2=k2_,k3=k3_)(s=1,u0=rand_u0,v0=rand_v0)(
  m1x=rand_m1hom[0],m1y=rand_m1hom[1],
  m2x=rand_m2hom[0],m2y=rand_m2hom[1],
  m3x=rand_m3hom[0],m3y=rand_m3hom[1],
  m4x=rand_m4hom[0],m4y=rand_m4hom[1],
),"; 1/ratio=", \
1/whRatio(f=f_)(k2=k2_,k3=k3_)(s=1,u0=rand_u0,v0=rand_v0)(
  m1x=rand_m1hom[0],m1y=rand_m1hom[1],
  m2x=rand_m2hom[0],m2y=rand_m2hom[1],
  m3x=rand_m3hom[0],m3y=rand_m3hom[1],
  m4x=rand_m4hom[0],m4y=rand_m4hom[1],
)

print "k2 = ", k2_(
  m1x=rand_m1hom[0],m1y=rand_m1hom[1],
  m2x=rand_m2hom[0],m2y=rand_m2hom[1],
  m3x=rand_m3hom[0],m3y=rand_m3hom[1],
  m4x=rand_m4hom[0],m4y=rand_m4hom[1],
), "; k3 = ", k3_(
  m1x=rand_m1hom[0],m1y=rand_m1hom[1],
  m2x=rand_m2hom[0],m2y=rand_m2hom[1],
  m3x=rand_m3hom[0],m3y=rand_m3hom[1],
  m4x=rand_m4hom[0],m4y=rand_m4hom[1],
)

# ATTENTION: testing revealed, that the whRatio 
# is actually the height/width ratio, 
# not the width/height ratio
# This contradicts [zhang-single]
# if anyone can find the error that caused this, I'd be grateful

    ground truth: f= 72.1045134124554 ; ratio= 3.46538779959142
    calculated: f=  72.1045134125 ; 1/ratio= 3.46538779959
    k2 =  0.99114614987 ; k3 =  1.57376280159

这篇关于透视变形矩形的比例的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

透视变形矩形的比例 [英] proportions of a perspective-deformed rectangle

问题描述

更新：

更新：确定的方程式：

UPDATE:

Update: here is how these equations where determined:

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

透视变形矩形的比例 [英] proportions of a perspective-deformed rectangle

问题描述

更新：

更新：确定的方程式：

UPDATE:

Update: here is how these equations where determined:

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭