用于数据分析的特定Postgresql服务器配置 [英] Specific postgresql server configuration for data analysis purposes
问题描述
如果您将Postgresql数据库专门用于数据科学部门和数据分析,是否有使用postgresql.conf文件调整服务器性能的技巧?还是性能调整本身是与目的无关的,并且您将对它进行任何操作都没有真正的区别,因为一切都与提取数据有关?
Is there any tips on tuning server's performance using postgresql.conf file in case you use a postgresql database specifically for data science department and data analysis purposes? Or performance tuning itself is purpose-agnostic and there is no real difference what you will do with it since 'it is all about extracting data'?
这是一个相当晦涩的问题我没有找到答案(在关于数据科学的大量文章中)。
It's a rather obscure question i didn't find an answer for (in miriads of articles on data science topic).
推荐答案
尽管这是一个非常笼统的问题问题,我会尽力给您一个或两个提示:
Though this is a very general question, I'll try my best to give you a hint or two:
您可以先评估需求的轮廓,例如:
You could first asses the outlines of your requirements, such as:
- 我们在谈论大数据块吗? (缓冲区大小)
- 从多少个客户端执行查询? (允许连接)
- 您是否正在使用postgresql的内部函数?
- 您是否需要永久备份或复制表或数据库周围的数据?
- 等,等等..
- are we talking about big-data chunks? (buffer-sizes)
- from how many clients are queries performed? (allowed connections)
- are you using postgresql's internal functions?
- do you need permanent backups or copy tables or db's around?
- etc.,etc.,..
我建议您阅读< a href = https://www.postgresql.org/docs/9.6/static/runtime-config-resource.html rel = nofollow noreferrer>资源消耗和查询计划以及服务器配置
。
I would recommend you read the article from the official docs on resource consumption and the docs on query-planning as well as server configuration in general.
如果您在阅读文档后仍无法得出正确的方法,我可以推荐 pg论坛。经验丰富的用户 akretschmer是postgresql-pro,如果您以详细而有意义的方式提出问题,可能会为您提供帮助;)
In case you can't derive a proper approach after reading the docs I can recommend the pg-forum. The experienced user 'akretschmer' is a postgresql-pro and might be able to help you if you formulate your question in a detailed and meaningful way ;)
这篇关于用于数据分析的特定Postgresql服务器配置的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!