Max pooling is a downsampling technique in deep learning used to reduce the dimensionality of input data, such as images. It works by sliding a window over the input and taking the maximum value within that window. This process helps to reduce the computational load and enhances the detection of features in the input data.