Creating a 64-bit kernel
Difficulty level |
---|
Medium |
Kernel Designs |
---|
Models |
Other Concepts |
This page or section is a work in progress and may thus be incomplete. Its content may be changed in the near future. |
The factual accuracy of this article or section is disputed. |
Prerequisites
Make sure that you have the following done before proceeding:
- Have built a cross-compiler for the x86_64-elf target.
- Read up on long mode and how to initialize/use it (if you intend on not using a 64-bit aware bootloader or rolling your own).
- Decide now on how to load your kernel - your own bootloader, GRUB (with separate loader executable), GRUB2 (elf64 + 32-bit bootstrap code), or a 64-bit capable bootloader such as Limine or BOOTBOOT.
The Main Kernel
The kernel should run in a uniform environment. Let's make this simple for now...
kernel.c
void kernel_main(void)
{
/* What goes here is up to you */
}
Compiling
Compile each source file like any piece of C code, just remember to use the cross-compiler and the proper options. Linking will be done later...
x86_64-elf-gcc -ffreestanding -mcmodel=large -mno-red-zone -mno-mmx -mno-sse -mno-sse2 -c foo.c -o foo.o
The -mcmodel=large argument enables us to run the kernel at any 64-bit virtual memory address we want. In fact, using the 'large' code model is discouraged due to its inefficiency, but it can be fine as a start. Check the SysV AMD64 ABI document for extra details.
You will need to instruct GCC not to use the the AMD64 ABI 128-byte 'red zone', which resides below the stack pointer, or your kernel will be interrupt unsafe. Check this thread on the forums for extra context.
We disable SSE floating point ops. They need special %cr0 and %cr4 setup that we're not ready for. Otherwise, several #UD and #NM exceptions will be triggered.
Linking
The kernel will be linked as an x86_64 executable, to run at a virtual higher-half address. We use a linker script:
link.ld
ENTRY(_start) SECTIONS { . = KERNEL_VMA; .text : AT(ADDR(.text) - KERNEL_VMA) { _code = .; *(.text) *(.rodata*) . = ALIGN(4096); } .data : AT(ADDR(.data) - KERNEL_VMA) { _data = .; *(.data) . = ALIGN(4096); } .eh_frame : AT(ADDR(.eh_frame) - KERNEL_VMA) { _ehframe = .; *(.eh_frame) . = ALIGN(4096); } .bss : AT(ADDR(.bss) - KERNEL_VMA) { _bss = .; *(.bss) /* * You usually need to include generated COMMON symbols * under kernel BSS section or use gcc's -fno-common */ *(COMMON) . = ALIGN(4096); } _end = .; /DISCARD/ : { *(.comment) } }
Feel free to edit this linker script to suit your needs. Set ENTRY(...) to your entry function, and KERNEL_VMA to your base virtual address.
You can link the kernel like this:
x86_64-elf-gcc -ffreestanding <other options> -T <linker script> <all object files> -o <kernel executable> -nostdlib -lgcc
Note: Obviously there is no bootstrap assembly yet, which is the hard part of starting out, and you can't link without it.
Loading
Before you can actually use your kernel, you need to deal with the hard job of loading it. Here are your four options:
With your own boot loader
This method is the simplest (since you write all the code), though it requires the most work.
I won't give any code, but the basic outline is:
- Set up a stable environment
- Do Protected Mode readying stuff (GDT, IDT, A20 gate, etc.)
- Enter Protected Mode (or skip this step and enter long mode directly)
- Parse kernel ELF headers (if kernel is separate from executable)
- Set up Long Mode readying stuff (PAE, PML4, etc.) - Remember to set up the higher-half addressing!
- Enter Long Mode by far jump to the kernel entry point in (virtual) memory
With a 64 bit aware loader
Open Source boot loaders written for long mode kernels already exist, you don't have to reinvent the wheel. Unlike GRUB (which does not support switching to long mode), bootloaders such as Limine or BOOTBOOT can load your 64-bit kernel directly by doing all the things listed in the previous section (and more). It saves you the struggle to write and properly link bootstrap code or to implement your own boot loader entirely from scratch. Therefore using a 64-bit aware bootloader is a nice and easy, reasonably bullet-proof, choice for beginners.
With legacy GRUB
Note: The advise in this section is bit questionable in its current form. See Creating a 64-bit kernel using a separate loader
This requires the use of GRUB or another multiboot1-compliant loader. This may be the most error free of the four, but creating a multiboot-compatible kernel properly has its own set of pitfalls.
A quick rundown:
- Set up a stable environment
- Read the multiboot information struct to see where GRUB loaded your kernel (look at the module section)
- Parse kernel ELF headers
- Set up Long Mode readying stuff (PAE, PML4, etc.) - Remember to set up the higher-half addressing!
- Enter Long Mode by far jump to the kernel entry point
Note that this code has to be stored in a elf32 format and must contain the multiboot1-header.
Also remember to set the text section to start at 0x100000 (-Ttext 0x100000) when linking your loader.
Set up GRUB to boot your loader as a kernel in its own right, and your actual kernel as a module. Something like this in menu.lst:
title My Kernel kernel --type=multiboot <loader executable> module <kernel executable>
With a 32-bit bootstrap in your kernel
This requires the use of any ELF64-compatible loader that loads into protected-mode (GRUB2, or patched GRUB Legacy). This may be the simplest in the long run, but is more difficult to set up. Note that GRUB2, which implements Multiboot 2, does not support switching into long mode.
First, create an assembly file like the following, which will set up virtual addressing and long mode:
bootstrap.S
.section .text .code32 multiboot_header: (only needed if you're using multiboot) bootstrap: (32-bit to 64-bit code goes here) (jump to 64-bit code)
Then, add the following to your original linker file:
link.ld
... ENTRY(bootstrap) ... SECTIONS { . = KERNEL_LMA; .bootstrap : { <path of bootstrap object> (.text) } . += KERNEL_VMA; .text : AT(ADDR(.text) - KERNEL_VMA) { _code = .; *(EXCLUDE_FILE(*<path of bootstrap object>) .text) *(.rodata*) . = ALIGN(4096); } ...
The above edits allow the linker to link the bootstrap code with physical addressing, as virtual addressing is set up by the bootstrap. Note that in this case, KERNEL_VMA will be equivalent to 0x0, meaning that text would have a virtual address at KERNEL_LMA + KERNEL_VMA instead of just at KERNEL_VMA. Change '+=' to '=' and your bootstrap code if you do not want this behaviour.
Compile and link as usual, just remember to compile the bootstrap code as well!
Set up GRUB2 to boot your kernel (depends on your bootloader) with grub.cfg:
menuentry "My Kernel" { multiboot <kernel executable> }
With Visual C++
The technique for creating a 64 bit kernel with a 32 bit bootstrap is similar to GCC. You need to create an assembly bootstrap with nasm (masm may work, but the author uses nasm). Note that this stub must be assembled to a 64 bit object file (-f win64). Your stub then has a BITS 32 directive. Note that, although nasm will not complain about this, Microsoft link will. It complains about address relocations, due to the memory model settings (/LARGEADDRESSAWARE, which is required for /DRIVER). As such, you need a method of generating the correct 32 bit code, while fooling link into generating a 64 bit relocation. Here is a macro for you:
;Encode 32 bit moves without the ADDR32 issue
%macro mov_abs32 2
%if %1==eax
db 0xB8
%elif %1==ebx
db 0xBB
%elif %1==ecx
db 0xB9
%elif %1==edx
db 0xBA
%elif %1==edi
db 0xBF
%elif %1==esi
db 0xBE
%elif %1==ebp
db 0xBD
%elif %1==esp
db 0xBC
%else
%error "Unknown register"
%endif
dq %2+(0x90909090 << 32)
%endmacro
;This translates as mov %1, %2 NOP NOP NOP NOP
;Example usage:
;mov_abs32 eax, (gdtr-KADDR_OFFSET)
;lgdt [eax]
Possible Problems
You may experience some problems. Fix them immediately or risk spending a lot of time debugging later...
My kernel is way too big!
Try each of the following, in order:
- Try linking your kernel with the option "-z max-page-size=0x1000" to force the linker to use 4kb pages.
- Make sure you're compiling with the -nostdlib option (equivalent to passing the both -nodefaultlibs and -nostartfiles options).
- You can try changing the OUTPUT_FORMAT to elf64-little.
- Try cross-compiling the latest version of binutils and gcc.
Kernel Virtual Memory
(This section is based on notes by Travis Geiselbrecht (geist) at the osdev IRC channel)
Long mode provides essentially an infinite amount of address space. An interesting design decision is how to map and use the kernel address space. Linux approaches the problem by permanently mapping the -2GB virtual region 0xffffffff80000000 -> 0xffffffffffffffff to physical address 0x0 upwards. Kernel data structures, which are usually allocated by kmalloc() and the slab allocator, reside above the 0xffffffff80000000 virtual base and are allocate from the physical 0 -> 2GB zone. This necessitates the ability of 'zoning' the page allocator, asking the page allocator to returning a page frame from a specific region, and only from that region. If a physical address above 2GB needs to accessed, the kernel temporarily map it to its space in a temporary mapping space below the virtual addresses base. The Linux approach provides the advantage of not having to modify the page tables much which means less TLB shootdowns on an SMP system.
Another approach is to treat the kernel address space as any other address space and dynamically map its regions. This provides the advantage of simplifying the page allocator by avoiding the need of physical memory 'zones': all physical RAM is available for any part of the kernel. An example of this approach is mapping the kernel to 0xfffffff800000000 as usual. Below that virtual address you put a large mapping for the entire physical address space, and use the virtual 0xfffffff800000000 -> 0xffffffffffffffff region above kernel memory area as a temporary mappings space.
See Also
Articles
Forum Threads
- Creating a 64-bit Kernel Tutorial about this article
- Linker-script writers beware: COMMON Symbols on the obscure 'COMMON' symbols and their effect on BSS
- Long-mode Kernels and the AMD64 ABI 'Red Zone' on the 'red zone' and its major effect on interrupt handling
- Leaving long mode to protected mode
- Switching from long mode to compatibility mode