What is the /d2vzeroupper MSVC compiler optimization flag doing? I was reading through this Compiler Options Quick Reference Guide for Epyc CPUs from AMD: https://developer.amd.com/wordpress/media/2020/04/Compiler%20Options%20Quick%20Ref%20Guide%20for%20AMD%20EPYC%207xx2%20Series%20Processors.pdf For MSVC, to "Optimize for 64-bit AMD processors", they recommend to enable /favor:AMD64 /d2vzeroupper. What /favor:AMD64 is doing is clear, there is documentation about that in the MSVC docs. But ..
I got introduced to SIMD insctuction set just recently and as one of my pet projects thought about using it to implement memcpy and see if it performs better than standard memcpy. What I observe is the standard memcpy always performs better than SIMD based custom memcpy. I expected SIMD to have some advantage here. ..
I am vectorizing a code that overall uses more than 50 AVX2 and SSE instructions including gather, shuffle, pack, unpack, extract, cast and etc. By profiling, I noticed that a single call to _mm256_permute4x64_epi64 makes the code 10 times slower while the rest of the code performs well. The architecture that I am testing on ..
I have an application, developed in c++, in visual studio, on windows, with an Intel CPU. This application is in use on multiple machines, at multiple locations, all with Intel CPUs. Lately, it was installed on a PC with an AMD CPU. On the AMD machine, on a certain function, the application freezes and crashes. ..
I am trying to instal windows on an asus tuf a15 laptop using an amd card. https://eshop.asus.com/es-ES/portatil-asus-tuf-gaming-a15-fa506iv-hn337.html?___store=eur_es_es&fee=55&fep=9142&gclid=Cj0KCQjwhvf6BRCkARIsAGl1GGg8Fj84Gc7h2_31YvkfFnUvvhKss9GRTPvGByx1avDWdS2lXF9UBH0aAkt6EALw_wcB The issue in that in the bios, there the computer says no storage device present and I can find any option for csm also. When trying to install, i get "WE couldn’t find any drivers, to get storage ..