Python/PIL仿射变换

3条回答

网友
1楼 · 编辑于 2024-05-23 22:51:11

我认为this应该回答你的问题。
如果不是，您应该考虑仿射转换可以连接到另一个转换中。
所以你可以把你想要的手术分成：
将orgin移到图像的中心
旋转
将原点移回
调整大小
你甚至可以计算出一个变换。

网友
2楼 · 编辑于 2024-05-23 22:51:11

好的！所以我整个周末都在努力理解，我想我有一个让我满意的答案。谢谢大家的意见和建议！
我先看看这个：
affine transform in PIL python？
虽然我看到作者可以做任意的相似性转换不能解释为什么我的代码不能工作，也不能解释我们需要转换的图像布局，也不提供线性我问题的代数解。
但我确实从他的代码中看到，他在划分矩阵（a，b，d和e）的比例，这让我觉得很奇怪。我回去看书了我引用的PIL文件：
“im.transform（size，AFFINE，data，filter）=>；图像
对图像应用仿射变换，并将结果放入新图像中以给定的大小。
数据是一个6元组（a、b、c、d、e、f），它包含仿射变换矩阵。对于输出图像中的每个像素（x，y），新的值取自输入中的位置（a x+b y+c，d x+e y+f）图像，四舍五入到最接近的像素。
此函数可用于缩放、平移、旋转和剪切原始形象。”
所以参数（a，b，c，d，e，f）是a变换矩阵，但是映射目标图像中的（x，y）到源图像中的（a x+b y+c，d x+e y+f）形象。但不是要应用的转换矩阵的参数，但是相反。即：
奇怪的
与Matlab不同
但现在，幸运的是，我完全理解了
我附上我的代码：
import Image import math from numpy import matrix from numpy import linalg def rot_x(angle,ptx,pty): return math.cos(angle)*ptx + math.sin(angle)*pty def rot_y(angle,ptx,pty): return -math.sin(angle)*ptx + math.cos(angle)*pty angle = math.radians(45) im = Image.open('test.jpg') (x,y) = im.size xextremes = [rot_x(angle,0,0),rot_x(angle,0,y-1),rot_x(angle,x-1,0),rot_x(angle,x-1,y-1)] yextremes = [rot_y(angle,0,0),rot_y(angle,0,y-1),rot_y(angle,x-1,0),rot_y(angle,x-1,y-1)] mnx = min(xextremes) mxx = max(xextremes) mny = min(yextremes) mxy = max(yextremes) print mnx,mny T = matrix([[math.cos(angle),math.sin(angle),-mnx],[-math.sin(angle),math.cos(angle),-mny],[0,0,1]]) Tinv = linalg.inv(T); print Tinv Tinvtuple = (Tinv[0,0],Tinv[0,1], Tinv[0,2], Tinv[1,0],Tinv[1,1],Tinv[1,2]) print Tinvtuple im = im.transform((int(round(mxx-mnx)),int(round((mxy-mny)))),Image.AFFINE,Tinvtuple,resample=Image.BILINEAR) im.save('outputpython2.jpg')
以及python的输出：
让我在最后总结中再次说明这个问题的答案：
PIL需要应用仿射变换的逆。

网友
3楼 · 编辑于 2024-05-23 22:51:11

我想通过carlosdc和Ruediger Jungbeck对答案进行一些扩展，以提供一个更实用的python代码解决方案和一些解释。

首先，如carlosdc's answer所述，PIL使用反仿射变换是绝对正确的。然而，不需要用线性代数来计算原始变换的逆变换，它可以很容易地直接表示出来。我将使用缩放和围绕其中心旋转图像作为示例，如Ruediger Jungbeck's answer中的code linked to中所示，但是扩展它来完成剪切也是相当简单的。

在讨论如何表示缩放和旋转的反仿射变换之前，请考虑如何找到原始变换。正如Ruediger Jungbeck's answer所暗示的，缩放和旋转的组合运算的变换被发现为缩放关于原点的图像和旋转关于原点的图像的基本运算符的组合。

但是，由于我们想缩放并围绕图像的中心旋转图像，并且原点（0，0）是图像的defined by PIL to be the upper left corner，我们首先需要翻译图像，使其中心与原点重合。在应用缩放和旋转之后，我们还需要将图像转换回这样一种方式，即图像的新中心（缩放和旋转后可能与旧中心不同）最终位于图像画布的中心。

因此，我们所追求的原始“标准”仿射变换将是以下基本运算符的组合：

找到图像的当前中心 $(c_x, c_y)$ ，并通过 $(-c_x, -c_y)$ 转换图像，因此图像的中心位于原点 $(0, 0)$ 。
按某个比例因子 $(s_x, s_y)$ 缩放有关原点的图像。
将图像绕原点旋转一定角度 $\theta$ 。
找到图像的新中心 $(t_x, t_y)$ ，并通过 $(t_x, t_y)$ 翻译图像，这样新中心将最终位于图像画布的中心。

为了找到我们要找的变换，我们首先需要知道基本算子的变换矩阵，如下所示：

按 $(x, y)$ ： $\displaystyle \begin{bmatrix} 1 & 0 & x\\ 0 & 1 & y\\ 0 & 0 & 1 \end{bmatrix}$ 翻译
按 $(s_x, s_y)$ ： $\displaystyle \begin{bmatrix} s_x & 0 & 0\\ 0 & s_y & 0\\ 0 & 0 & 1 \end{bmatrix}$ 缩放
旋转 $\theta$ ： $\displaystyle \begin{bmatrix} \cos(\theta) & \sin(\theta) & 0\\ -\sin(\theta) & \cos(\theta) & 0\\ 0 & 0 & 1 \end{bmatrix}$

然后，我们的复合转换可以表示为：

$\displaystyle T = \begin{bmatrix} 1 & 0 & t_x\\ 0 &1 & t_y\\ 0 & 0 & 1 \end{bmatrix}$ $\displaystyle \begin{bmatrix} \cos(\theta) & \sin(\theta) & 0\\ -\sin(\theta) & \cos(\theta) & 0\\ 0 & 0 & 1 \end{bmatrix}$ $\displaystyle \begin{bmatrix} s_x & 0 & 0\\ 0 & s_y & 0\\ 0 & 0 & 1 \end{bmatrix}$ $\displaystyle \begin{bmatrix} 1 & 0 & -c_x\\ 0 &1 & -c_y\\ 0 & 0 & 1 \end{bmatrix}$

等于

$T=\begin{bmatrix}s_x\cos(\theta)&s_y\sin(\theta)&t_x-c_x s_x\cos(\theta)-c_y s_y\sin(\theta)\\-s_x\sin(\theta)&s_y\cos(\theta)&t_y+c_x s_x \sin(\theta)-c_y s_y\cos(\theta)\\0&0&1\end{bmatrix}$

或者

$T=\begin{bmatrix}a&b&t_x-c_x a-c_y b\\d&e&t_y-c_x d-c_y e\\0&0&1\end{bmatrix}$

其中

$a=s_x\cos(\theta),\qquad b=s_y\sin(\theta),\qquad d=-s_x\sin(\theta),\qquad e=s_y\cos(\theta)$ 。

现在，要求这个复合仿射变换的逆，只需要求出每个基本算子逆的逆的逆的逆的逆的组成。也就是说，我们想

通过 $(-t_x, -t_y)$
翻译图像
通过 $-\theta$ 围绕原点旋转图像。
按 $\left(\frac{1}{s_x}, \frac{1}{s_y}\right)$ 缩放有关原点的图像。
通过 $(c_x, c_y)$ 翻译图像。

这会产生一个转换矩阵

$T^{-1}=\begin{bmatrix}a&b&c_x-t_x a-t_y b\\d&e&c_y-t_x d-t_y e\\0&0&1\end{bmatrix}$

其中

$a=\frac{\cos(-\theta)}{s_x},\qquad b=\frac{\sin(-\theta)}{s_x},\qquad d=-\frac{\sin(-\theta)}{s_y},\qquad e=\frac{\cos(-\theta)}{s_y}$ 。

这与Ruediger Jungbeck's answer中的code linked to中使用的转换完全相同。通过重用carlosdc在其文章中使用的计算图像 $(t_x, t_y)$ 的相同技术，并通过 $(t_x, t_y)$ -将旋转应用于图像的所有四个角来翻译图像，然后计算最小和最大X和Y值之间的距离，可以使该方法更加方便。但是，由于图像是围绕其中心旋转的，因此不需要旋转所有四个角，因为每对相对的角都是“对称”旋转的。

下面是carlosdc代码的重写版本，它已被修改为直接使用反仿射变换，并且还增加了缩放比例：

from PIL import Image
import math


def scale_and_rotate_image(im, sx, sy, deg_ccw):
    im_orig = im
    im = Image.new('RGBA', im_orig.size, (255, 255, 255, 255))
    im.paste(im_orig)

    w, h = im.size
    angle = math.radians(-deg_ccw)

    cos_theta = math.cos(angle)
    sin_theta = math.sin(angle)

    scaled_w, scaled_h = w * sx, h * sy

    new_w = int(math.ceil(math.fabs(cos_theta * scaled_w) + math.fabs(sin_theta * scaled_h)))
    new_h = int(math.ceil(math.fabs(sin_theta * scaled_w) + math.fabs(cos_theta * scaled_h)))

    cx = w / 2.
    cy = h / 2.
    tx = new_w / 2.
    ty = new_h / 2.

    a = cos_theta / sx
    b = sin_theta / sx
    c = cx - tx * a - ty * b
    d = -sin_theta / sy
    e = cos_theta / sy
    f = cy - tx * d - ty * e

    return im.transform(
        (new_w, new_h),
        Image.AFFINE,
        (a, b, c, d, e, f),
        resample=Image.BILINEAR
    )


im = Image.open('test.jpg')
im = scale_and_rotate_image(im, 0.8, 1.2, 10)
im.save('outputpython.png')

这就是结果的样子（按（sx，sy）=（0.8，1.2）缩放并旋转逆时针方向10度）：

相关问题更多 >

编程相关推荐

热门问题

热门文章