CSCI 241 Documentation Standards

Put a documentation box like this at the top of each .cpp or .h file that you submit:

//***************************************************************************
//
//  file-name
//  CSCI 241 Assignment assignment-number
//
//  Created by name(s) and z-ID(s)
//
//***************************************************************************

//***************************************************************************
//
//  Student.h
//  CSCI 241 Assignment 2
//
//  Created by Ima Fake (z1234567) and Nick Soapdish (z1638482)
//
//***************************************************************************

Variable Names

Most variable names in your program should describe the quantity or item that they represent. For example, if a variable is to hold a student's test average, don't call it "sa"; call it "studentAverage". If a variable is to hold a person's name, don't call it "s" or "n"; call it "name" or better yet, "studentName" or "employeeName". If a variable is to hold the number of students in a class, don't name it "n" or even "num" or "count"; do name it "student_count" or "number_of_students".

The exception to this rule is for temporary variables that have no intrinsic meaning, especially index variables in for loops, e.g.:

for (int i = 0; i < 10; i++)
    ...

// Bad:

int sum=0,num_gizmos=0,gizmoID,square_of_gizmos=0,
i, j;
double avgGizmos, overallAvg, gizmoSDT,
this,that, theOtherThing;

// Good:
    
int sum = 0;
int num_gizmos = 0;
int square_of_gizmos = 0;
int gizmoID;
int i;
int j;

double average_gizmos,
       overall_average,
       gizmoSDT,
       thisThing,
       thatThing,
       theOtherThing;

We will not require that every variable be declared on a new line, but it is generally a good idea. Also, it's better to write a separate declaration for each variable (as shown with the int variables in the example above) rather than declaring multiple variables of the same type with a single statement. You're less likely to type a semicolon when you meant to type a comma (and vice versa), less likely to forget a critical asterisk or ampersand when declaring multiple pointers or references, and if you change the data type on a single variable you won't mess up your other declarations.

Variable Documentation

Well-chosen, descriptive variable names typically should not require any additional documentation. If you feel you need to document a variable declaration, you can do so with a single line comment like so:

int sum = 0;    // Sum of student test scores.

Line and Section Documentation

Line documentation consists of a comment about a single line of code. Often it is on the same line as the code:

average = (double) sum / count;    // Calculate the program average.

// Calculate the area of the triangle: semi is semiperimeter;
// s1, s2, and s3 are sides.
area = sqrt(semi * (semi - s1) * (semi - s2) * (semi - s3));

A moderate amount of documentation in the main body of your program may be advisable, but if you name your variables and functions with meaningful names, you should not need much. If you name variables well, many programs won't need any Line Documentation. The following line does not need documentation:

programAverage = (double) sumProgramPoints / numPrograms;

/* some comment */

// some comment

Section documentation tells the reader about a multi-line section of code that follows. It is usually placed before a loop or decision construct or a series of lines that do a single task. Examples:

// Loop to accept and process user-supplied glucose measurements.

// Decide if gizmos or widgets are to be used.
    
// Calculate and store averages and standard deviations of measurements.

Your programs should use a moderate amount of Section Documentation for functions (such as main()) that are long or have several distinct tasks. You should put a blank line before any Section Documentation and indent it the same amount as the code block that it describes. Obviously, use of Section documentation is partly a matter of personal judgment, but programs should include at least a minimal amount of Section Documentation.

Code Indentation

Class definition bodies, function and method definition bodies, loop bodies, decision bodies, etc. should be indented in a consistent fashion. Each nested structure must have its own level of indenting. In my opinion, three or four spaces is optimal. Two spaces is the absolute minimum for readability, while more than four spaces is pointless.

Braces

Class definition bodies, function and method definition bodies, loop bodies, decision bodies, etc. generally need to be delimited by braces. Here are two common styles of indentation, both of which are acceptable for this class:

// Allman style:
    
#include <iostream>
#include <iomanip>

using std::cout;
using std::endl;

int main()
{
    int i;

    for (i = 1; i <= 100; i++)
    {
        cout << setw(4) << i;

        if (i % 10 == 0)
        {
            cout << endl;
        }
    }

    return 0;
}

// Kernigan & Ritchie style:
    
#include <iostream>
#include <iomanip>

using std::cout;
using std::endl;

int main() {
    int i;

    for (i = 1; i <= 100; i++) {
        cout << setw(4) << i;

        if (i % 10 == 0) {
            cout << endl;
        }
    }

    return 0;
}

Both of the CSCI 241 instructors prefer Allman style. However, the most imoprtant thing is to pick one of these two styles and use it consistently throughout your programs.

Note that a loop or decision body that contains only a single statement does not need to be delimited by braces. That means the braces around the if body in the examples above are optional.

(What constitutes a single "statement" in C++ is sometimes confusing for students, since a "statement" is not necessarily a single line of code. For example, an if and its entire body is considered a single "statement" by the compiler. if you're in doubt, code the braces. In fact, it doesn't hurt to code the braces even if they're not strictly necessary - after all, you might later insert another statement into the loop or decision body and forget to add the braces. If you get in the habit of always coding them, that mistake won't happen.)

Indentation with Tabs vs. Spaces

Code can be indented using either tabs or spaces. Programmers disagree violently about which is superior; the following scene from the HBO show Silicon Valley S03E06 does a decent job of describing the pros and cons of the two methods:

Once again, I don't particularly care which method you use as long as you use it consistently. I tend to use tabs when writing my own code but spaces when writing example code for the courses I teach, because I want the code to look the same no matter what editor a student is using to view it.

The default tab stop in most Unix editors is set at eight spaces, so if you decide to indent with tabs you should really figure out how to change that setting in your editor.

Unfortunately, IDEs like Dev-C++ and Xcode often default to using a mix of tabs and spaces to perform their auto-indentation, which really combines the worst aspects of both methods of indentation. That's something you can (and should) change in your IDE's editor preferences, because otherwise your code will look like a ragged mess unless the person viewing it has set the tab stops in their editor to the same value you used when indenting.

Use of White Space

// Bad:
    
if( num==0){
num=num+5;
}

for(i=0;i<10;i++)
    { cout<<i<<endl;}
    
total=calculate_total(amount,tax_rate,tip);

// Good:
    
if (num == 0) {
    num = num + 5;
}
        
for (i = 0; i < 10; i++)
{
    cout << i << endl;
}

total = calculate_total(amount, tax_rate, tip);

This is also for your own good. More readable code will make it easier to spot the inevitable syntax errors that you will make.

Insert blank lines between logical sections of the program (or between functions) to help the reader find the main sections. Often each of these sections should have Section Documentation to explain its role in the program. For example:

#include <iostream>
#include <iomanip>
    
using std::cout;
using std::endl;
    
int main()
{
    int num1;
    int num2;
    int greatest_common_factor;

    // Obtain user input of two integers.
    cout << "Enter first number: ";
    cin >> num1;
    ...

    // Calculate Greatest Common Factor.
    ... code to do the calculation ...

    // Display the result.
    cout << "... ";
    ...

    return 0;
}

Identifier Naming Conventions

Programmers frequently need to come up with identifiers, names that label the identity of either a unique object or a unique class of objects. For example, things you might need to name in a C++ program include:

There are a number of common styles used when writing multi-word identifiers in C++:

As with many of the other code formatting conventions discussed in this document, there is no single right answer to the question, "Which style should I use?" It's more important to pick one and use it consistently in your programs. You should also try to avoid using a style in a way that no one else would typically use it - for example, don't use screaming snake case for class names, or upper camel case for variable or function names.

Function and Member Function Documentation

Each function or member function in your program should have a documentation box in the following format:

/**
 * This first line is a brief description.
 *
 * The rest of the lines are a more detailed description of the
 * function that outlines what it does and anything interesting about
 * how it does it.
 *
 * @param x Description of the first parameter.
 * @param y Description of the second parameter.
 * @param z Description of the third parameter.
 *
 * @return This is where you describe the possible return values.
 *
 * @note This is how you can add an optional note about the function that
 *    may be of interest to someone using it.
 *
 * @warning This is how you can add an optional warning to a user of the
 *    function suitable for noting things like 'This function is not thread
 *    safe' and so on.
 *
 * @bug This is how you can add an optional description of a known bug in the
 *    function such as: This only works for positive values of z.
 */
int f(int x, int y, int z);

Often the brief description for a function or method will be sufficient and the more detailed description can be omitted. If a function has no parameters or no return value, those items can be omitted as well. The other tags (@note, @warning, and @bug) will only be needed in rare cases.

When documenting parameters and return values, do not simply list the parameter or return value's data type. That information can easily be found by looking at the first line of the function or method or in its prototype. Your documentation should instead explain the purpose and meaning of the parameter or return value. It's okay to mention the data type, but it is not sufficient.

You can put your documentation for the methods of a class or struct in the .h or .cpp files but NOT BOTH. Never replicate anything! If you do so, your documentation will almost inevitably become inconsistent and contradictory over time.

/**
 * Searches an array of Student objects for a specified name.
 *
 * @param student_array An array of Student objects to search.
 * @param search_name A name for which to search.
 *
 * @return The index of the array element that contains search_name or
 *    -1 if the name is not found.
 *
 * @note Uses the binary search algorithm, so student_array must be sorted
 *    in ascending order by name.
 */
int search_for_name(const Student student_array[], const string& search_name)
{
    // Code for function
    . . .
}