1. The Human Visual System & Perception

Understanding how images form in the eye helps inform how we process digital images.


2. Image Acquisition & Digitization

Images are generated by the combination of an illumination source and the reflection of that energy by objects in a scene.

The Image Formation Model

A simple image can be expressed as a two-dimensional function f(x,y), which represents the amplitude (intensity) at specific spatial coordinates. It is characterized by illumination i(x,y) and reflectance r(x,y):

f(x,y)=i(x,y)r(x,y)

Digitizing the Signal

Because most sensors output a continuous waveform, creating a digital image requires two distinct processes:

  1. Image Sampling: Digitizing the spatial image coordinates.
  2. Image Quantization: Digitizing the image amplitude (intensity levels).

3. Representation, Resolution & Contrast

Once sampled and quantized, digital images are represented as a matrix where integer coordinate values map one-to-one with matrix rows and columns.

Resolution Types


4. Pixel Relationships & Distances

Neighborhoods

A pixel p at coordinates (x,y) has defined neighbor sets:

Distance Measures

For pixels p(x,y) and q(u,v), standard distance functions include:

Pasted image 20260226220239.png


5. Mathematical & Spatial Operations

Linear vs. Non-Linear Operations

An operator H producing output g(x,y) from input f(x,y) is linear if it satisfies:

H[aifi(x,y)+ajfj(x,y)]=aiH[fi(x,y)]+ajH[fj(x,y)]

Common Operations

Spatial Operations

Operations performed directly on the pixels can be classified by their scope:


• A few comments about implementing image arithmetic operations are in order before we leave this section. In practice, most images are displayed using 8 bits (even 24-bit color images consist of three separate 8-bit channels). Thus, we expect image values to be in the range from 0 to 255.

• When images are saved in a standard image format, such as TIFF or JPEG, conversion to this range is automatic.

• When image values exceed the allowed range, clipping or scaling becomes necessary. For example, the values in the difference of two 8-bit images can range from a minimum of -255 to a maximum of 255, and the values of the sum of two such images can range from 0 to 510.

• When converting images to eight bits, many software applications simply set all negative values to 0 and set to 255 all values that exceed this limit.


6. Image Transforms & Probabilities

Transform Domain

When spatial domain processing is insufficient, a 2-D linear transform can be applied. The forward transform is given by:

T(u,v)=x=0M1y=0N1f(x,y)r(x,y,u,v)

The image can be returned to the spatial domain via an inverse transform using the inverse transformation kernel s(x,y,u,v) .

Probabilistic Methods

Treating image intensities as random quantities allows for statistical analysis. The probability of an intensity level zk occurring is p(zk)=nkMN. This yields: