连接到AWS Aurora集群时偶尔出现``名称解析暂时失败'' [英] Occasional 'temporary failure in name resolution' while connecting to AWS Aurora cluster

查看：100 发布时间：2021/4/3 19:15:07 python amazon-web-services aws-lambda amazon-rds amazon-rds-aurora

本文介绍了连接到AWS Aurora集群时偶尔出现``名称解析暂时失败''的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在运行Amazon Web Services RDS Aurora 5.6数据库集群.有几个lambda在谈论这些数据库实例，它们都是用python编写的.现在一切都运行良好，但是突然之间，从几天前开始，Python代码有时开始引发以下错误:

I am running an Amazon Web Services RDS Aurora 5.6 database cluster. There are a couple of lambda's talking to these database instances, all written in python. Now everything was running well, but then suddenly, since a couple of days ago, the python code sometimes starts throwing the following error:

[ERROR] InterfaceError:2003:无法连接到"CLUSTER-DOMAIN:3306"上的MySQL服务器(-3名称解析暂时失败)

[ERROR] InterfaceError: 2003: Can't connect to MySQL server on 'CLUSTER-DOMAIN:3306' (-3 Temporary failure in name resolution)

这种情况每1000个新连接中就有1个发生.有趣的是，最近两天我没有涉及到整个服务(因为它开始发生).所有lambda都使用官方的MySQL连接器客户端，并在每次初始化时使用以下代码段进行连接:

This happens in 1 every 1000 or so new connections. What is interesting that I haven't touched this whole service in the last couple of days (since it started happening). All lambdas are using the official MySQL-connector client and connect on every initialization with the following snippet:

import mysql.connector as mysql
import os

connection = mysql.connect(user=os.environ['DATABASE_USER'],
                         password=os.environ['DATABASE_PASSWORD'],
                         database=os.environ['DATABASE_NAME'],
                         host=os.environ['DATABASE_HOST'],
                         autocommit=True)

为了排除这是Python MySQL客户端中的问题，我添加了以下内容来解析主机:

To rule out that this is a problem in the Python MySQL client I added the following to resolve the host:

import os
import socket

host = socket.gethostbyname(os.environ['DATABASE_HOST'])

在这里我有时也会出现以下错误:

Also here I sometimes get the following error:

[ERROR] gaierror:[Errno -2]名称或服务未知

[ERROR] gaierror: [Errno -2] Name or service not known

现在，我怀疑这与DNS有关，但是由于我只是在使用群集终结点，因此我无能为力.有趣的是，我最近在不同的区域也遇到了完全相同的问题，设置相同(Aurora 5.6集群，python中的lambda连接到该集群)，并且在此发生了相同的情况.

Now I suspect this has something to do with DNS, but since I'm just using the cluster endpoint there is not much I can do about that. What is interesting is that I also recently encountered exactly the same problem in a different region, with the same setup (Aurora 5.6 cluster, lambda's in python connecting to it) and the same happens there.

我尝试重新启动集群中的所有计算机，但是问题似乎仍然出现.这真的是DNS问题吗?我该如何阻止这种情况的发生?

I've tried restarting all the machines in the cluster, but the problem still seems to occur. Is this really a DNS issue? What can do I to stop this from happening?

不是解决方案

最初，我认为创建指向群集的CNAME可能是一个好主意，但是现在我不确定缓存Aurora DNS查询结果是否明智.造成这种情况的原因很多，在《 Aurora连接管理手册》 :

除非您使用智能数据库驱动程序，否则您将依赖于DNS记录更新和DNS传播以进行故障转移，实例扩展和负载在整个Aurora副本中保持平衡.目前，Aurora DNS区域使用5秒的短生存时间(TTL).确保您的网络和客户端配置不会进一步增加DNS缓存TTL

Unless you use a smart database driver, you depend on DNS record updates and DNS propagation for failovers, instance scaling, and load balancing across Aurora Replicas. Currently, Aurora DNS zones use a short Time-To-Live (TTL) of 5 seconds. Ensure that your network and client configurations don’t further increase the DNS cache TTL

Aurora的群集和读取器端点抽象了角色更改(主实例升级/降级)和拓扑更改(添加和删除实例)发生在数据库集群中

Aurora's cluster and reader endpoints abstract the role changes (primary instance promotion/demotion) and topology changes (addition and removal of instances) occurring in the DB cluster

我希望这会有所帮助！

这篇关于连接到AWS Aurora集群时偶尔出现``名称解析暂时失败''的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

连接到AWS Aurora集群时偶尔出现``名称解析暂时失败'' [英] Occasional 'temporary failure in name resolution' while connecting to AWS Aurora cluster

问题描述

推荐答案

不是解决方案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

连接到AWS Aurora集群时偶尔出现``名称解析暂时失败'' [英] Occasional &#39;temporary failure in name resolution&#39; while connecting to AWS Aurora cluster

问题描述

推荐答案

不是解决方案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

连接到AWS Aurora集群时偶尔出现``名称解析暂时失败'' [英] Occasional 'temporary failure in name resolution' while connecting to AWS Aurora cluster

登录关闭