如果浮点数是16字节对齐的，是否可以将浮点数直接转换为m128？ [英] Is it possible to cast floats directly to m128 if they are 16 byte aligned?

查看：130 发布时间：2020/6/3 22:15:54 c++ c alignment sse intrinsics

本文介绍了如果浮点数是16字节对齐的，是否可以将浮点数直接转换为__m128？的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

如果浮点数是16字节对齐的，将浮点数直接转换为 __ m128 是否安全/可行/建议？

Is it safe/possible/advisable to cast floats directly to __m128 if they are 16 byte aligned?

我注意到使用 _mm_load_ps 和 _mm_store_ps 来包装原始阵列会增加大量开销。

I noticed using _mm_load_ps and _mm_store_ps to "wrap" a raw array adds a significant overhead.

我应该注意哪些潜在的陷阱？

What are potential pitfalls I should be aware of?

编辑：

使用加载和存储指令实际上没有开销，我混合了一些数字，这就是为什么我可以获得更好的性能的原因。即使您能够在 __ m128 实例中使用原始内存地址进行一些令人讨厌的处理，当我运行测试时，它也需要花费TWICE AS LONG来完成，而没有 _mm_load_ps 指令，可能会退回到某些故障安全代码路径。

There is actually no overhead in using the load and store instructions, I got some numbers mixed and that is why I got better performance. Even thou I was able to do some HORRENDOUS mangling with raw memory addresses in a __m128 instance, when I ran the test it took TWICE AS LONG to complete without the _mm_load_ps instruction, probably falling back to some fail safe code path.

如果浮点数是16字节对齐的，是否可以将浮点数直接转换为m128？ [英] Is it possible to cast floats directly to m128 if they are 16 byte aligned?

问题描述

推荐答案

相关文章

C/C++开发最新文章

热门教程

热门工具

登录关闭

如果浮点数是16字节对齐的，是否可以将浮点数直接转换为__m128？ [英] Is it possible to cast floats directly to __m128 if they are 16 byte aligned?

问题描述

推荐答案

相关文章

C/C++开发最新文章

热门教程

热门工具

登录 关闭

如果浮点数是16字节对齐的，是否可以将浮点数直接转换为m128？ [英] Is it possible to cast floats directly to m128 if they are 16 byte aligned?

登录关闭