The NACHOS code you have been given is capable of executing user
application programs, but in an extremely limited way. In particular, at most
one user application process can run at a time, and the only system call that
has been implemented is the halt
system call that shuts down
NACHOS. In this assignment, you will correct some of these deficiencies and turn
NACHOS into a multiprogramming operating system with a working set of basic
system calls.
In the first part of this assignment, you are to implement the
Tasks
Fork()
, Yield()
, Exit()
,
Exec()
, and Join()
system calls that act as follows:
Fork(func)
system call creates a new user-level (child) process,
whose address space starts out as an exact copy of that of the caller (the
parent), but immediately the child abandons the program of the parent and
starts executing the function supplied by the single argument. Notice this
definition is slightly different from the one in the syscall.h file in Nachos.
Yield()
call is used by a process executing in user mode to
temporarily relinquish the CPU to another process.
Exit(int)
call takes a single argument, which is an integer
status value as in Unix. The currently executing process is terminated. For
now, you can just ignore the status value. Later you will figure out how to
get this value to an interested process.
Exec(filename)
system call spawns a new user-level thread
(process), but creates a new address space and begins executing a new program
given by the object code in the Nachos file whose name is supplied as an
argument to the call. It should return to the parent a SpaceId
which can be used to uniquely identify the newly created process.
Join()
call waits and returns only after a process with the
specified ID (supplied as an argument to that call) has finished.
In the second part of this assignment, you are to implement the file system
calls: Creat()
, Open()
, Read()
,
Write()
, and Close()
. The semantics of these calls are
specified in syscall.h
.
You should extend your file system implementations to handle the console as well
as normal files.
For this assignment, you will be working on
the version of NACHOS in the Test Programs
userprog
directory. You will also need to write some simple C test programs,
compile them using the MIPS cross compiler, and run them under NACHOS to test
that your modifications to NACHOS actually work.
In test
subdirectory of NACHOS, several sample test programs are provided.
The only one that will run properly on an unmodified NACHOS is the
halt program.
Additional background information is listed here.
In your test programs, you need to exercise the various
system calls while we also provide you the
sample test programs . Be sure to test each of the system calls, and to try
forking up to three processes (since each has a 1024 byte stack, that's all that
will fit in NACHOS' 4K byte physical memory) and have them yield back and forth
for awhile to make sure everything is working. Since the facility for I/O from
user program will also be implemented during this assignment, you may
initially have to rely on using debugging printout in the kernel to
track what is happening. Use the DEBUG
macro for this, and make
sure that debugging printout is disabled by default when you submit your code
for grading.
In order for us to see how your program works, some debugging information
must be added in your code. You should print out the following information: You have to provide a writeup in a file
Required Output Prints
System Call: [pid] invoked [call]
where [pid] is the
identifier of the process (SpaceID) and [call] is the name of one of the
system calls that you had to implement. Just give the name without parentheses
(e.g., Fork, Create, Exit).
Loaded
Program: [x] code | [y] data | [z] bss
where [x], [y] and [z] are
the sizes of the code, (initialized) data and bss (uninitialized) data
segments in bytes.
Process [pid] Fork: start at address [addr] with [numPage]
pages memory
where [pid] is the process identifier (SpaceID) of the
parent process, [addr] is the virtual address (in hexadecimal format) of the
function that the new (child) process starts to execute and [numPage] is the
number of pages that the new process gets.
Exec Program: [pid] loading [name]
where [pid] is the identifier of the parent process that
executes a new process and [name] is the name of the executable file that is
being loaded
Process [pid]
exits with [status]
where [pid] is the identifier of the exiting
process and [status] is the exit code.
Process [pid] Fork: start at address [addr] with [numPage] pages memory failed
Exec Program: [pid] loading [name] failed
What to
Submit
HW2_WRITEUP
in which you have to mention
which portions of the assignment you have been able to complete. And for the ones
that are incomplete, what is their current status. The
description of status for completed exercises or those that you
did not start should not be more than 1 line. However, when you
submit code that is not working properly (partial solution), make
sure that your documentation is very verbose so that the
TAs can understand what you have achieved and which parts are
missing. This is necessary to receive at least partial
credit. When you just submit something that does not work and give
no explanations, expect to receive no credit. Also include both group members' names.
Make sure that the writeup file is in the top level code directory.
You can turnin up to 3 times per project and not more than that!
The earlier versions will be discarded.
Note: only one turnin per group is accepted!
Here is an outline of the the major issues you will have to
deal with to make NACHOS into a multiprogrammed system:
Issues to
Consider
ExceptionHandler()
function in exception.cc
to determine which system call or exception occurred, and to transfer control
to an appropriate function. You might want to consider introducing ``stubs''
(functions with empty bodies or with debugging printout so you can tell when
they are called) for all the system calls right away, and then postpone their
actual implementation until a bit later. This strategy will help you
understand better how control is transferred from user mode to system mode
when a system call is executed.
AddrSpace::AddrSpace()
simply determines the amount of memory that will be required by the
application to be run and then allocates that much space contiguously starting
at address zero in physical memory. The page tables (which control the address
translation hardware) are set up so that the logical addresses (what the user
program sees) are identical to the physical addresses (where the data is
actually stored).
The above scheme is inadequate for running more than one application at a time. You will need to design and implement a scheme for allocating and freeing physical memory, and you will need to arrange to set up the page tables so that the logical address space seen by a user application is a contiguous region starting from address zero, even though the data is stored at different physical addresses. You will want to implement a memory management scheme that is flexible enough to extend to virtual memory later in the semester. We suggest implementing a C++ class with methods for allocating and freeing physical memory one page at a time. By setting up the page tables properly, you can give the user application a contiguous logical address space even though each page of actual data might be stored anywhere in physical memory.
Fork()
system call is the most difficult part of this assignment.
It is different from the system call Exec in that Fork will start a new
process that runs a user function specified by the argument of the call, while
Exec will start a process that runs a different executable file. The parameter
types for Fork()
and Exec()
also differ.
Fork(func)
takes an argument func
which is a pointer
to a function. The function must be compiled as part of the user program that
is currently running. By making this system call Fork(func)
, the
user program expects the following: a new thread will be generated for use by
the user program; and this thread will run func in an address space that is an
exact copy of the current one. This implementation of Fork makes it possible
to have and to access multiple entry points in an executable file.
To make the system call Fork(func)
work for the user program,
you will need to know how to find the entry point of the function that is
passed as the parameter. The parameter convention is determined by the
cross-compiler which produces executable code from the user source program.
Look at the file exception.cc to see that this entry point, which is an
address in the executable code's address space, is already loaded into
register 4 when the trap to the exception handler occurs. All you need to do
is to insert code into the exception handler (or call a new function of your
own) which does the following: set up an address space which is a copy of the
address space of the current thread, and load the address that is in register
4 into the program counter. After these steps, use Thread::Fork() to create a
new thread, initialize the MIPS registers for the new process, and have both
the new and old processes return to user mode. The parent should return to
user mode by returning from the exception handler, the child process should
continue to run from the address that is now in the program counter, which is
the entry point of the function. To implement Fork, you will need to introduce
modifications to the AddrSpace
class in addrspace.cc
so that you can make a ``clone'' of a running user application program. We
suggest adding a function that will create a new address space as an exact copy of the
original. You will have to allocate additional physical memory for this copy,
set up the page tables properly for the new address space, and copy the data
from the old address space to the new. Once the physical memory has been
allocated and the page tables set up, you will use Thread::Fork()
to create a new kernel thread, initialize the MIPS registers for the new
process, and then have both the old and the new processes return to user mode.
The child process should continue by finishing the Fork()
system
call. The parent should return to user mode merely by returning from the
ExceptionHandler()
function.
Exit()
system call should work by calling
Thread::Finish()
, but only after deallocating any physical memory
and other resources that are assigned to the thread that is exiting.
Exec()
system call, you will need a method for
transferring data (the name of the executable, supplied as the argument to the
system call) between the user address space and the kernel. You are
not to use functions Machine::ReadMem()
and
Machine::WriteMem()
in machine/translate.cc.
Instead, you will have to code your own functions that take into account the
address translations described by the page tables to locate the proper
physical address for any given logical address. (Recall that strings in C are
stored as sequences of characters in successive memory locations, terminated
by a null character.)
Once the name of the executable has been copied into the kernel, and the file has been verified to exist, the executable file should be consulted to determine the amount of physical memory required for the new program. This physical memory should be allocated and initialized with data from the executable file, the page tables thread should be adjusted for the new program, the MIPS registers should be reinitialized for starting at the beginning of the new program, and control should return to user mode. File progtest.cc contains a sample for executing a binary program.
If you use ``machine->Run'' to execute a user program, it terminates the
current thread. Since Exec()
needs to return a space ID to the
caller, you should find a way to do that.
NOTE: The object code produced by the MIPS cross-compiler assumes that the data segment begins at the physical address immediately following the text segment. In particular, there is no page alignment, so that if the text segment ends in the middle of a page, then the data segment will start just after it and the page will contain both code and data.
Thread::Yield()
after
making sure to save any necessary state information about the currently
executing process.
diskBuffer
(defined in system.cc). All of your user-level file I/O must go through the
diskBuffer
.
-s
'' flag
to Nachos along with the ``-x
'' flag causes Nachos to single-step
while in user mode. This might be helpful for debugging and understanding.
Also, have a look at the file threads/utility.h
to see all the
code letters that can be supplied along with the ``-d
'' flag to
enable various kinds of debugging printout from Nachos.
The ``-d m
'' option prints out each MIPS instruction as it is executed, which is
very helpful for tracing problems with Fork()
and
The ``-rs s
'' forces random yielding when executing binary code. s is a random number
seed. This flag is helpful to improve the robustness of code, especially when synchronization is used.