GCC only does this extra stack alignment in main
; that function is special. You won't see it if you look at code-gen for any other function, unless you have a local with alignas(32)
or something.
GCC is just taking a defensive approach with -m32
, by not assuming that main
is called with a properly 16B-aligned stack. Or this special treatment is left over from when -mpreferred-stack-boundary=4
was only a good idea, not the law.
The i386 System V ABI has guaranteed/required for years that ESP+4 is 16B-aligned on entry to a function. (i.e. ESP must be 16B-aligned before a CALL instruction, so args on the stack start at a 16B boundary. This is the same as for x86-64 System V.)
The ABI also guarantees that new 32-bit processes start with ESP aligned on a 16B boundary (e.g. at _start
, the ELF entry point, where ESP points at argc, not a return address), and the glibc CRT code maintains that alignment.
As far as the calling convention is concerned, EBP is just another call-preserved register. But yes, compiler output with -fno-omit-frame-pointer
does take care to push ebp
before other call-preserved registers (like EBX) so the saved EBP values form a linked list. (Because it also does the mov ebp, esp
part of setting up a frame pointer after that push.)
Perhaps gcc is defensive because an extremely ancient Linux kernel (from before that revision to the i386 ABI, when the required alignment was only 4B) could violate that assumption, and it's only an extra couple instructions that run once in the life-time of the process (assuming the program doesn't call main
recursively).
Unlike gcc, clang assumes the stack is properly aligned on entry to main. (clang also assumes that narrow args have been sign or zero-extended to 32 bits, even though the current ABI revision doesn't specify that behaviour (yet). gcc and clang both emit code that does in the caller side, but only clang depends on it in the callee. This happens in 64-bit code, but I didn't check 32-bit.)
Look at compiler output on http://gcc.godbolt.org/ for main and functions other than main if you're curious.
I just updated the ABI links in the x86 tag wiki the other day. http://x86-64.org/ is still dead and seems to be not coming back, so I updated the System V links to point to the PDFs of the current revision in HJ Lu's github repo, and his page with links.
Note that the last version on SCO's site is not the current revision, and doesn't include the 16B-stack-alignment requirement.
I think some BSD versions still don't require / maintain 16-byte stack alignment.