compiler-optimization Archives

C++, Programming

IT Nursery

Why can lambdas be better optimized by the compiler than plain functions?

In his book The C++ Standard Library (Second Edition) Nicolai Josuttis states that lambdas can be better optimized by the compiler than plain ...

June 3, 2022
0 Comments

gcc, Programming

IT Nursery

How to see which flags -march=native will activate?

I’m compiling my C++ app using GCC 4.3. Instead of manually selecting the optimization flags I’m using -march=native, which in theory should add ...

June 3, 2022
0 Comments

C++, Programming

IT Nursery

What is &&& operation in C

#include <stdio.h> volatile int i; int main() { int c; for (i = 0; i < 3; i++) { c = i &&& ...

June 1, 2022
0 Comments

How to compile Tensorflow with SSE4.2 and AVX instructions?

This is the message received from running a script to check if Tensorflow is working: I tensorflow/stream_executor/dso_loader.cc:125] successfully opened CUDA library libcublas.so.8.0 locally ...

May 16, 2022
0 Comments

Why does the Rust compiler not optimize code assuming that two mutable references cannot alias?

As far as I know, reference/pointer aliasing can hinder the compiler’s ability to generate optimized code, since they must ensure the generated binary ...

May 10, 2022
0 Comments

Why do we use volatile keyword? [duplicate]

This question already has answers here: Closed 11 years ago. Possible Duplicate: Why does volatile exist? I have never used it but I ...

May 5, 2022
0 Comments

Why does GCC generate 15-20% faster code if I optimize for size instead of speed?

I first noticed in 2009 that GCC (at least on my projects and on my machines) have the tendency to generate noticeably faster ...

May 3, 2022
0 Comments

Swift Beta performance: sorting arrays

I was implementing an algorithm in Swift Beta and noticed that the performance was very poor. After digging deeper I realized that one ...

April 16, 2022
0 Comments

Replacing a 32-bit loop counter with 64-bit introduces crazy performance deviations with _mm_popcnt_u64 on Intel CPUs

I was looking for the fastest way to popcount large arrays of data. I encountered a very weird effect: Changing the loop variable ...

April 12, 2022
0 Comments

Why doesn’t GCC optimize aaaaaa to (aaa)(aaa)?

I am doing some numerical optimization on a scientific application. One thing I noticed is that GCC will optimize the call pow(a,2) by ...

April 10, 2022
0 Comments