我一直试图弄清楚这些说明到底发生了什么,但无法理解它们。我可以看到 PSHUFLW 指令作用于 XMM 寄存器的前 16 个字节,但无法弄清楚。我已阅读英特尔® 架构指令集扩展编程参考中的描述,但似乎无法理解。我是一个非常直观的人,所以任何帮助将不胜感激。我已经使用代码运行了一些测试
xxm0 = 00000000000000000000000000003E2D
PSHUFLW xmm0, xmm0, N
并有以下结果
N = 0, output = 00000000000000003E2D3E2D3E2D3E2D
N = 1, output = 00000000000000003E2D3E2D3E2D0000
N = 2, output = 00000000000000003E2D3E2D3E2D0000
N = 3, output = 00000000000000003E2D3E2D3E2D0000
N = 4, output = 00000000000000003E2D3E2D00003E2D
N = 5, output = 00000000000000003E2D3E2D00000000
N = 6, output = 00000000000000003E2D3E2D00000000
N = 7, output = 00000000000000003E2D3E2D00000000
N = 8, output = 00000000000000003E2D3E2D00003E2D
N = 9, output = 00000000000000003E2D3E2D00000000
N = 10, output = 00000000000000003E2D00003E2D3E2D
N = 11, output = 00000000000000003E2D00003E2D0000
N = 12, output = 00000000000000003E2D00003E2D0000
N = 13, output = 00000000000000003E2D00003E2D0000
N = 14, output = 00000000000000003E2D000000003E2D
N = 15, output = 00000000000000003E2D000000000000
N = 16, output = 00000000000000003E2D000000000000
N = 17, output = 00000000000000003E2D000000000000
N = 18, output = 00000000000000003E2D000000003E2D
N = 19, output = 00000000000000003E2D000000000000
N = 20, output = 00000000000000003E2D00003E2D3E2D
我想知道这些说明是如何工作的,也许还有一个与此类似的未来说明的视觉指南。我只倒车了几个星期,所以我很新鲜。感谢您的任何帮助,您可以提供。