User:Bellezzasolo/CFS

From OSDev Wiki
Jump to navigation Jump to search
Filesystems
Virtual Filesystems

VFS

Disk Filesystems
CD/DVD Filesystems
Network Filesystems
Flash Filesystems

Chai File System (CFS)

CFS64

Introduction

CFS64 is my file system (Bellezzasolo's). It uses 64 bit addressing, which is the 64. It currently has a (slightly buggy) Win32 formatter and a grub module.
It is currently at version 1.01
I intend to create CFS64 2.0. As a result, these fields will be changing. I intend to improve the data structures used for access performance. Furthermore, Journaling is planned. I am also considering multiple-disk volumes.
Note: all fields are little-endian

Github repository: CFS64
CFS64.jpg

Features

  • User security
  • Flags
  • Inode style tables
  • Runtime-variable size
  • Tree-based

Limitations

There are few limitations. Here are the only ones:

  • One is the 64 bit addressing, which will (one day) be changed in CFS128; but for now it's plenty
  • File and Directory Name lengths.

If you spot any more, add them here

Structure

CFS64 is structured in the follwing way:

  • MASTER Table - logical sector 0 of the partition. Bootloader and essential data
  • Cluster Availability Table - a bit representing every cluster on the partition, in logical order
  • FS Table - CAT for fs entries. i.e. cluster is partially used, not wholly.
  • Entries - unordered entries, which point to respective parts. Will need defragmenting from time to time. No dedicated zone for file data, so no limits.

For error checking multiple copies of the tables can be kept

Detail

MASTER Table

Name Offset (from the start of the MASTER Table) Length (in bytes) Meaning
Jump 0 2 A jump to the bootloader if bootable, 0 otherwise. Normally disassembles to JMP SHORT (0x)3F
Creation Date 2 8 The creation time and date, encoded in TeraTime. TeraTime is Unix time, except the epoch is 1st Jan 2011
Creator ID 10 4 The creator's ID. This is normally 0, but could be set to a range of values for future use
Disk Size (Clusters) 14 8 The number of clusters on the partition. Need I say any more?
Bytes per Cluster 22 2 The number of bytes in a cluster. Could be the same as the disk, but then again might not be :)
Number of Tables 24 1 The number of Copies of the tables. 1 if no backup, another number for backups. 0 is invalid
Major Version 25 2 The major FS version. Can be any 2 byte value
Minor Version 27 1 The minor FS version. Is the .xy after the major version. Can only be between 0 and 99 inclusive
FS Table Pointer 28 8 The first cluster of the FS Table. This is a Cluster Availability Table for FS entries.
Root Table Pointer 36 8 The cluster where the root directory's file entry resides. It will be at offset 0
FS ID 44 8 The FS string. Only "CFS64 " is acceptable
Volume Label 52 11 The volume label. 11 characters only (ASCII)
Disks in Volume 63 2 The number of partitions making up this logical volume. Add one to this value. TFS64 2.0+
Disk index 65 2 The unique identifier of this disk within the volume. TFS64 2.0+
Bootloader 67 (or 0x43) 443 The bootloader. Duh! :)
Boot Signature 510 2 The old 0xAA55

Cluster Availability Table (CAT)

This is just an array of bits. Each bit represents a cluster on the partition. If it's set to 1, the custer is used. Otherwise it's free. The bit's offset from the start of the CAT is the cluster at that offset from the start of the partition. FS data is included (even itself!).

Offset (bytes) Clusters
0 0-7
1 8-15
10000 80000-80007
10001 80008-80015

And in each byte:

Byte n
Bit 7 6 5 4 3 2 1 0
Cluster n*8 + 7 n*8 + 6 n*8 + 5 n*8 + 4 n*8 + 3 n*8 + 2 n*8 + 1 n*8 + 0

FS Pointer

A FSPointer is a structure that addresses filesystem objects.

Name Offset Size Meaning
Cluster 0 8 The cluster on the disk of this object
Byte 8 2 The byte offset within the cluster of the object

FSPointerEx

A FSPointer is a structure that addresses filesystem objects. The Ex version is designed to handle multiple disks.

Name Offset Size Meaning
Disk 0 2 The disk identifier for the disk on which described FS object resides.
Cluster 2 8 The cluster on the disk of this object
Byte 10 2 The byte offset within the cluster of the object

File Entry

A file entry describes a filesystem object such as a directory or a file.

Name Offset Size Meaning
Type 0 1 The type of entry this is. 0xD1 - Directory, 0xF1 - file
Checksum 1 1 A CRC32 checksum of the file entry
File Name 2 482 The UTF-16 filename of the object. 241 characters.
Size 484 8 The size of the object - bytes for a file, entries for a directory.
Attributes 492 10 An FSPointer to attributes inode
Entries 502 10 An FSPointer to respective inode (file or directory)

Directory Inode

The directory inodes are pointed to by a directory entry. They are similar to Ext inodes in function. The structure is as follows:

Name Offset Size Meaning
Type 0 1 A type field. 0xD0 = Directory inode
Checksum 1 1 A CRC32 checksum of the inode.
Entries 2 490 Array of 49 FSPointers, point to file entries of directory contents
Backlink 492 10 FSPointer back to previous element (either inode or entry)
Next Inode 502 10 FSPointer to next inode if needed (cluster is zero if unused)

DirectoryEx Inode

The Ex version supports CFS64 2.0 features. The structure is as follows:

Name Offset Size Meaning
Type 0 1 A type field. 0xD2 = DirectoryEx inode
Checksum 1 1 A CRC32 checksum of the inode.
Entries 2 480 Array of 40 FSPointerExs, point to file entries of directory contents
Reference Count 4 482 Reference count for this inode - traditional hard links
Reserved 2 486 Reserved for future use. Set to 0
Backlink 488 12 FSPointerEx back to previous element (either inode or entry)
Next Inode 500 12 FSPointerEx to next inode if needed (cluster is zero if unused)

File Inode

The file entry is pointeed to by the file table like an Ext inode. The structure is as follows:

Name Offset Size Meaning
Type 0 1 0xF0 = File Inode
Checksum 1 1 A CRC32 checksum of the inode
Clusters 2 488 An array of 61 cluster addresses.
Backlink 490 10 An FSPointer to the previous inode in this chain (or the file entry if the first inode)
Next 500 10 An FSPointer to the next inode. CLuster is 0 if this is the last inode.
Reserved 510 2 Reserved, must be 0

Atttributes Inode

This contains file metadata, including timestamps and security.

Name Offset Size Meaning
Type 0 1 0xA0 for an attributes inode
Checksum 1 2 A CRC32 checksum of the inode
Owner 2 16 The UUID of the owner.
Created 18 8 Teratime creation timestamp
Modified 26 8 Teratime last modification timestamp
Accessed 34 8 Teratime last accessed timestamp
Flags 42 1 File flags - see later for description
Security Entries 43 432 An array of 24 security entries - see next table
Security Inode 475 10 An FSPointer to a security inode, if more entries are required.
Extended Attributes 485 10 An FSPointer to the xattr inode, if present
Backlink 495 10 Points back to file entry
Reference Count 505 4 CFS64 2.0+ - used to keep track of hard links to a file.
Reserved 509 3 Padding bytes, must be 0. May be used in future versions.

Security Entry

Name Offset Size Meaning
UUID 0 16 The UUID of the user or group this relates to
Group 16 1 1 if the UUID is a group, 0 if a user.
Permissions 17 1 Bit 0: r, Bit 1: w, Bit 2: x. Unix-style permissions. other bits reserved.

Security Inode

More security entries if required

Name Offset Size Meaning
Type 0 1 0x50 For Security inode
Checksum 1 1 CRC32 checksum of inode
Entries 2 486 27 security entries
Backlink 488 10 FSPointer to previous inode
Next 498 10 FSPointer to next inode - cluster is 0 if this is the last.
Reserved 508 4 Reserved, may be used in future. Must be 0 for now.

xattr Entry

Name Offset Size Meaning
Type 0 1 0xA8 for xattr table
Checksum 1 2 CRC32 Checksum of the inode
Name 2 256 256 chars for the name (UTF-8). Not UTF-16 as this is internal - the name of the attribute.
Data 258 224 224 Bytes of the xattr data - format is dependent on name.
Inode 482 10 FSPointer to the next inode, if required for the xattr. Cluster is 0 if unused.
Backlink 492 10 Backlink to previous xattr entry, or attributes inode.
Next Entry 502 10 The next xattr: THIS IS A DIFFERENT ATTRIBUTE. The next inode is in the inode field.

xattr Inode

Name Offset Size Meaning
Type 0 1 0xAF for xattr inode
Checksum 1 2 CRC32 Checksum of the inode
Data 2 490 490 bytes of xattr data.
Backlink 492 10 FSPointer to previous inode or entry
Next 502 10 Points to next inode. Cluster is 0 if this is the last inode.

CF64 v2.0 features

Overview

CFS64 v2.0 adds support for some additional features. In particular, it supports journaling, virtual volumes spread across multiple partitions, and support for hard links and symbolic links. Data storage now supports extents and indexes. In particular, it does this in a manner compatible with CFS64 v1, by making use of the xattr mechanism regularly. A non-bootable CFS64 v1 partition can be mounted as v2 without any changes. If a CFS64 v2 partition doesn't use the new inode structures, it could be mounted as a read only v1 disk. Indeed, harder links will function correctly in CFS64 v1 read-only mode.

Multiple Disk Metadata

Metadata regarding multiple disk volumes is stored as xattr's on the root directory. Each disk will have its own root directory, merged into one under the virtual volume. As the root directory exists, the xattr list of disks "DiskVolumeSet" can be stored. This consists of a flat list of the disks, with length as described in the MASTER table.

Name Offset Size Meaning
Disk Index 0 2 The unique index for the disk within this volume
Disk ID 2 16 The UUID of the disk

Hard Links - detail

Hard links are implemented by pointing a directory entry to the file entry for the file/directory to be hard linked. This differs from traditional hard links, which point to the inode. Should the traditional mode be used, the extended inode must be used. The side effect of native hard links (harder links) is that, should the file have its name changed, this will be reflected in the original directory, and also the linked directories. Note that this means consistency checks should be made - the driver should check that there is not a name conflict in any directory. Although there is no limitation of the FS against files with the same name, this is dangerous territory.

Journaling

CFS64 stores a journal file as an xattr ("FileSystemJournal") of the root directory. The structure is as follows:

Header
Name Offset Size Meaning
Mode & flags 0 1 The journaling mode. 0 = disabled, 1 = metadata-only, 2 = complete
Checksum 1 1 A CRC32 checksum of the header
Entries 2 8 The number of transactions recorded in the journal.
Entry
Name Offset Size Meaning
Type 0 1 The type of write. 0 = availability bitmap, 1 = file system structure, 2 = block write.
Checksum 1 1 A CRC32 checksum of the entry - disregard if invalid. Transaction was not completed.
Address 2 8/10 The address of the write. If type=0, this is the block represented. If type=1, this is FSPointer. If type=2, this is block address.
Data 10/12 1/512/4096 The data to be written. Type=0, bit0=CAT value, bit1=FSTable value. Rest reserved. Type=1, this is file system data structure. Type=2, this is the block value.

C structures

 typedef uint64_t teratime;
 typedef uint16_t wchar_t;
 typedef uint16_t char16_t;

//MASTER table
struct CFS64_Master {
	uint8_t bldrjmp[2];		//JMP +0x40 (0x42)
	teratime creation;
	uint32_t creator;
	uint64_t disksize;		//Clusters
	uint16_t bytespercluster;
	uint8_t tablecopies;
	uint16_t majorversion;
	uint8_t minorversion;
	uint64_t FStabpointer;
	uint64_t roottablepointer;
	char FSID[8];			//CFS64
	char16_t vollabel[11];
	uint8_t bootloader[436];
	uint16_t bootsig;		//0xAA55
}__attribute__ ((packed));

enum CreatorID {
	CREATOR_ID_CHAIOS,
	CREATOR_ID_WINFORMATTER,	//us
	CREATOR_ID_LINUX,
	CREATOR_ID_HANDCODED = 0xCAFEBABE,
	CREATOR_ID_THIRD_PARTY = 0xDEADBEEF,
	CREATOR_ID_INCOMPATIBLE = 0xFFFFFFFF
};

//File tables
struct FSPointer {
	uint64_t Cluster;
	uint16_t byte;
}__attribute__ ((packed));

struct CFS64_FileEntry {
	uint8_t type;		//0xF1 for file, 0xD1 for directory,... 0 for free
	uint8_t checksum;
	wchar_t fileName[241];
	uint64_t size;
	struct FSPointer attrbiutes;
	struct FSPointer entries;
}__attribute__ ((packed));

enum FileEntryType {
	SecurityInode = 0x50,
	DirectoryInode = 0xD0,
	DirectoryEntry = 0xD1,
	AttributeInode = 0xA0,
	ExtendedAttributeHeader = 0xA8,
	ExtendedAttributeInode = 0xAF,
	FileInode = 0xF0,
	FileEntry = 0xF1
};

struct CFS64_FileInode {
	uint8_t type;		//0xF1 for directory
	uint8_t checksum;
	uint64_t clusters[61];
	struct FSPointer backlink;
	struct FSPointer nextInode;
	uint16_t reserved;
}__attribute__ ((packed));

struct CFS64_DirInode {
	uint8_t type;		//0xD1 for directory
	uint8_t checksum;
	struct FSPointer entries[49];
	struct FSPointer backlink;
	struct FSPointer nextInode;
}__attribute__ ((packed));

//Now attributes

struct UUID {
    uint32_t Data1;
    uint16_t Data2;
    uint16_t Data3;
    uint8_t Data4[ 8 ];
}__attribute__ ((packed));

struct CFS64_SecurityEntry {
	struct UUID userID;
	uint8_t isgroup;
	struct {
		uint8_t canRead : 1;
		uint8_t canWrite : 1;
		uint8_t canExecute : 1;
	}permissions;
}__attribute__ ((packed));

struct CFS64_Flags {
	uint8_t DeleteOnly : 1;
	uint8_t Hidden : 1;
	uint8_t ContentsHidden : 1;
	uint8_t NonPrivelegedRead : 1;
	uint8_t ModifyOnly : 1;
	uint8_t NonPrivilegedWrite : 1;
	uint8_t NonPrivilegedExecute : 1;
	uint8_t SystemFile : 1;
}__attribute__ ((packed));

struct CFS64_Attributes {
	uint8_t type;
	uint8_t checksum;
	struct UUID ownerID;
	teratime created;
	teratime modified;
	teratime accessed;
	struct CFS64_Flags flags;
	struct CFS64_SecurityEntry securityentry[24];
	struct FSPointer securityInode;
	struct FSPointer xattr;
	struct FSPointer backlink;
	uint8_t reserved[7];
}__attribute__ ((packed));

struct CFS64_Security {
	uint8_t type;
	uint8_t checksum;
	struct CFS64_SecurityEntry entries[27];
	struct FSPointer backlink;
	struct FSPointer nextentry;
	uint32_t reserved;
}__attribute__ ((packed));

struct CFS64_xattr_entry {
	uint8_t type;
	uint8_t checksum;
	char name[256];		//type of attribute, so we don't need unicode
	uint8_t data[224];
	struct FSPointer inode;
	struct FSPointer backlink;
	struct FSPointer nextentry;
}__attribute__ ((packed));

struct CFS64_xattr_inode {
	uint8_t type;
	uint8_t checksum;
	uint8_t data[490];
	struct FSPointer backlink;
	struct FSPointer nextentry;
}__attribute__ ((packed));

Inodes

The "Inode entries" referenced to before are pointers to clusters. For directory inodes these reference both cluster and byte, and therefore are 10 bytes long (64bit cluster and 16 bit byte references). File inodes only need cluster references, and therefore the entries are 8 bytes long.

Indirection

Inodes point to other inodes as a doubly linked list, with backlinks and forward links. This means no size limits like ext2 - but seek times are greater. The structure is optimized for sequential reading.

Unused Inodes

Unused inodes are set to 0.

Special Fields

Here are decriptions of the bit-flags fields:

Flags

Flags are the general attributes, such as read-only, hidden and similar.

7 6 5 4 3 2 1 0
SF NX NW MO NR CH HI DO

Meanings:

  • DO: Delete Only (Read only field modify protect)
  • HI: Hidden
  • CH: Contents Hidden (dir-only)
  • NR: Non-Privileged Read - read ability for people without permissions
  • MO: Modify Only (Read-only deletion protect)
  • NW: Non-Privileged Write - write version of NR
  • NX: Non-Privileged Execute - execute version of NW
  • SF: System File - ultimate protection. Cannot be changed under ChaiOS (my, Bellezzasolo's OS). Drivers should be installed to a special directory.
Flags that apply to all users
  • DO
  • HI
  • CH
  • MO
  • SF
Other flag's behaviour

The NR,NW and NX flags only apply to non-privileged users, i.e those who don't have their permissions bit set.

Project Status

Implemented for grub, however in active development. Here are the components required and their status:

  • Formatter - Buggy (Win32)
  • Grub module - Slightly buggy
  • Driver - Not Started. Waiting for HDD and formatter