Project 0: Implement a Basic Shell

Due April 10, 2015 : 11:59:59pm

Objective

Understand basic Unix system calls and use them for a basic shell implementation.

Specification (Weight 15)

This basic shell is a command line interpreter that accepts input from the user and executes the commands. Similar to the well-known shells such as bash or tcsh, this shell can execute commands, redirect the standard input or standard output of commands to files, pipe the output of commands to other commands.

When the shell is ready to accept commands, it prints the prompt "Shell: " (without quotes). At this point, the user can type commands. Commands are alphanumeric tokens (e.g., ls, ps, cat) that represent programs that should be executed. This shell should search for these programs in the directories determined by the PATH environment variable. Commands can have arguments. Thus, tokens that follow a command (separated by white space) are treated as the arguments to this command (e.g., cat x).

In addition to execute commands with their arguments, your shell supports the standard Unix I/O redirection meta-characters with '<' and '>', command pipeline with '|' and a simple signal handling. More details are here.

In case of errors (e.g., invalid input, command not found, ...) your shell should display an error and wait for the next input.

To exit the shell, the user may type "exit" or Ctrl-D (pressing the D button while holding control).

Your shell is supposed to collect the exit codes of all processes that it spawns.

Simplification :

You may assume there is a white space separating all tokens. There are at most 3 commands connected by pipe '|'.

You may assume that the maximum length of individual tokens never exceeds 32 characters, and that the maximum length of an input line never exceeds 512 characters.

How to Test

You may test your program to run a sequence of commands using "shell < testfile'' and an example of this test file is.
ls > temp
cat < inputfile | sort | wc > temp
cat < temp > temp1
rm temp
ls | sort | uniq
exit

What to Submit

Your shell implementation should use the fork() system call and the execvp() system call (or one of its variants) to execute commands, and dup2() for I/O pipes and redirection. It should also use waitpid() or wait() to wait for a program to complete execution. You might also find the documentation for signals useful to be able to collect the status of processes that exit when running in the background.

You need to submit the following files under directory HW0/shell:

A file shell.c that implements the shell described above.
A file Makefile for compiling your code. The compiled executable should be named "shell". Our grading script will use your Makefile as follows and make sure the appropriate running targets.
- make to compile your code.
- make run to run your code with our own test.
- make test to run your code with your test.
Test files that are used in your Makefile.
A file README.txt of the design of your shell, such as design considerations, implementation details, limitations. Show the output of successful test results after running your tests. This will allow us to assign partial credit in case things do not work as expected.

Hints and Suggestions:

A quick guide on C UNIX/Linux processes and I/O redirection
Sample code for a simple shell program is here which includes test files. To get the full credit, make sure your code can execute these test command files correctly. A simple shell such as this needs a command-line parser to figure out what the user is trying to do. To read a line from the user, you may use fgets(3).
If a valid command has been entered, the shell should fork to create a new (child) process, and the child process should exec the command.
Before calling exec to begin execution, the child process may have to close stdin (file descriptor 0) or stdout (file descriptor 1), open the corresponding file or pipe (with open for files, and pipe for pipes), and use dup2(2) to make it the appropriate file descriptor. After calling dup2, close the old file descriptor.
The main challenge of calling execvp is to build the argument list correctly. If you use execvp, remember that the first argument in the array is the name of the command itself, and the last argument must be a null pointer.
The easiest way to redirect input and output is to follow these steps in order: (a) open (or create) the input or output file (or pipe). (b) close the corresponding standard file descriptor (stdin or stdout). (c) use dup2 to make file descriptor 0 or 1 correspond to your newly opened file. (d) close the newly opened file (without closing the standard file descriptor).
When executing a command line that requires a pipe, the pipe must be created before forking the child processes. Also, if there are multiple pipes, the command(s) in the middle may have both input and output redirected to pipes. Finally, be sure the pipe is closed in the parent process, so that termination of the process writing to the pipe will automatically close the pipe and send an EOF (end of file) to the process reading the pipe.
Any pipe or file opened in the parent process may be closed as soon as the child is forked -- this will not affect the open file descriptor in the child.