.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "gallery/0401.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_gallery_0401.py>`
        to download the full example code

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_gallery_0401.py:


.. _gallery:0401:

Source Separation I
=======================

.. contents::
    :depth: 2
    :local:


In audio signal processing,
source separation is the process of isolating individual sounds
in an auditory mixture of multiple sounds.

Modeling the Source Separation Problem
-------------------------------------------

In this example we consider the problem
of separating two sources which have been
mixed together. 
One is the sound of a guitar,
and another is the sound of piano.
The mixture is being sensor from two
different places with different mixing
weights at each sensor. For each time
instant, the mixing process can be
described by the equation:

.. math::

    \bv = \bM \bu

where :math:`\bu` is a 2-vector containing
one sample each from the guitar and piano
sounds and :math:`\bv` is a 2-vector containing
the samples from the two different listening sensors.
The matrix :math:`\bM` is a :math:`2 \times 2`
mixing matrix.
The mixing matrix
is known to us in this problem.

Assume that the audio has been captured for :math:`m`
samples. We can put together the source audio in
a :math:`m \times 2` matrix :math:`\by` and the captured
audio from the two sensors as a :math:`m \times 2`
matrix :math:`\bb` with the mixing relationship given by

.. math::

    \bb = \by \bM^T

Here the post multiplication by the matrix
:math:`\bM^T` is equality to processing each
row of input :math:`\by` by the mixing process
to generate a row of output.

The Sensor
''''''''''''''''''

From the perspective of sparse reconstruction,
we treat the mixing process as compressive sampling.
Then we can write the mixing equation as the application
of a linear operator :math:`\Phi`:

.. math::

    \bb = \Phi (\by).

Beware that this linear operator processes the input
:math:`\by` row by row to generate the output :math:`\bb`.
Fortunately, the CR-Sparse linear operator architecture allows
us to process input data along any axis of choice. By default
all linear operators process data column by column, which
is akin to a matrix pre-multiplication. However, in this
problem, we will use a simple :math:`2 \times 2` matrix
post-multiplication as our linear operator for representing
the sensing process. 

An alternative design (as used in SPARCO) would have been
to flatten :math:`\by` to a vector of length :math:`2 m`
and then use a :math:`2 m \times 2 m` sensing matrix :math:`\Phi`
(obtained by computing the Kronecker product of :math:`\bM`
with an identity matrix :math:`\bI_m`) to construct a flattened
version of :math:`\bb`. Our post-multiplication version of
linear operator is a more efficient implementation.


The Sparsifying Basis
''''''''''''''''''''''''

To create a suitable sparsifying basis for this signal,
we will first consider a forward transform as defined below:

#. Split the signal into overlapping windows of length :math:`w` 
   samples each.
#. Let the overlap between windows be of :math:`l = w/2`
   samples (50%) overlap.
#. For the last partial window, pad it with zeros as needed.
#. Compute the DCT transform of each window.
#. Concatenate the transforms together to form a representation
   of the signal.

Let there be :math:`b` such
overlapping windows over a signal of length :math:`m`. Then
the shape of this forward transform operator is :math:`w b \times m`.

We define the adjoint (transpose) of the above forward transform as
the Windowed Discrete Cosine Basis for our music sounds. We denote
this basis by :math:`\Psi`. The representation equation is given by

.. math::

    \by = \Psi (\bx).

Here :math:`\bx` is an :math:`w b \times 2` representation matrix
where each column is a representation of each column in :math:`\by`.
In other words, the first column of :math:`\bx` is a :math:`wb` length
representation of the guitar signal and the second column is the
representation of the piano signal. This is an overcomplete
basis (processing input data column by column). Note that by design,
each column of :math:`\Psi` is unit length.

CR-Sparse includes an operator called ``windowed_op`` which can
transform any linear operator :math:`T` into a windowed operator
as per description above. Starting from the ``cosine_basis``
operator, we construct our sparsifying basis :math:`\Psi`
using ``windowed_op``. In particular we will be using
512 length windows with overlaps of 256 samples.
The details of the construction of the sensing operator
and the sparsifying basis can be seen in the ``prob401.py``
file in the source code.

The Sparse Recovery Problem
'''''''''''''''''''''''''''''

We now combine the operators :math:`\Phi` and :math:`\Psi`
to construct the operator :math:`\bA = \Phi \Psi` and form
the linear equation:

.. math::

    \bb = \bA (\bx).

We can now use a suitable sparse recovery algorithm to
recover :math:`\bx` from :math:`\bb`. We shall do
that using SPGL1 (Spectral Gradient Descent for L1)
in this example.

See also:

* :ref:`api:problems`
* :ref:`api:lop`

.. GENERATED FROM PYTHON SOURCE LINES 155-165

.. code-block:: default


    # Configure JAX to work with 64-bit floating point precision. 
    from jax.config import config
    config.update("jax_enable_x64", True)

    import jax.numpy as jnp
    import cr.nimble as crn
    import cr.sparse as crs
    import cr.sparse.plots as crplot


.. GENERATED FROM PYTHON SOURCE LINES 166-170

Setup
------------------------------
We shall construct our test signal and dictionary
using our test problems module.

.. GENERATED FROM PYTHON SOURCE LINES 170-175

.. code-block:: default


    from cr.sparse import problems
    prob = problems.generate('src-sep-1')
    fig, ax = problems.plot(prob)


.. image-sg:: /gallery/images/sphx_glr_0401_001.png
   :alt: Audio signal 1 (Guitar), Audio signal 2 (Piano), Mixed audio signal 1, Mixed audio signal 2
   :srcset: /gallery/images/sphx_glr_0401_001.png
   :class: sphx-glr-single-img


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    Downloading sparco_prob401_Guitar.wav
    Download complete for sparco_prob401_Guitar.wav
    Downloading sparco_prob401_Piano.wav
    Download complete for sparco_prob401_Piano.wav


.. GENERATED FROM PYTHON SOURCE LINES 176-177

Let us access the relevant parts of our test problem

.. GENERATED FROM PYTHON SOURCE LINES 177-189

.. code-block:: default


    # The sparsifying basis linear operator
    Psi = prob.Psi
    # The combined operator for the linear equation :math:`\bb = \bA \bx`
    A = prob.A
    # Mixture signals
    b0 = prob.b
    # Original signals
    y0 = prob.y
    # The sparse representation of the signal in the dictionary
    x0 = prob.x


.. GENERATED FROM PYTHON SOURCE LINES 190-192

Sparse Reconstruction using SPGL1
-------------------------------------

.. GENERATED FROM PYTHON SOURCE LINES 195-196

The shape of mixed signal 

.. GENERATED FROM PYTHON SOURCE LINES 196-203

.. code-block:: default

    bm, nc = b0.shape
    # The shape of sparsifying basis
    tm, tn = prob.Psi.shape
    print(bm, nc, tm, tn)
    # Prepare an initial (zero) estimate of the model
    x_init = jnp.zeros((tn, nc))


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    14583 2 14583 28672


.. GENERATED FROM PYTHON SOURCE LINES 204-205

Run SPGL1 algorithm

.. GENERATED FROM PYTHON SOURCE LINES 205-211

.. code-block:: default

    import cr.sparse.cvx.spgl1 as crspgl1
    sigma=0.
    options = crspgl1.SPGL1Options(max_iters=300)
    tracker = crs.ProgressTracker(every=10)
    sol = crspgl1.solve_bpic_from_jit(A, b0, sigma, 
        x_init, options=options, tracker=tracker)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    [10] x_norm: 2.97e+05, r_norm: 4.45e+05
    [20] x_norm: 4.58e+05, r_norm: 1.69e+05
    [30] x_norm: 4.97e+05, r_norm: 1.56e+05
    [40] x_norm: 5.54e+05, r_norm: 4.39e+04
    [50] x_norm: 5.55e+05, r_norm: 3.79e+04
    [60] x_norm: 5.56e+05, r_norm: 3.75e+04
    [70] x_norm: 5.57e+05, r_norm: 3.73e+04
    [80] x_norm: 5.66e+05, r_norm: 2.18e+04
    [90] x_norm: 5.69e+05, r_norm: 9.22e+03
    [100] x_norm: 5.70e+05, r_norm: 8.65e+03
    [110] x_norm: 5.70e+05, r_norm: 8.37e+03
    [120] x_norm: 5.70e+05, r_norm: 8.31e+03
    [130] x_norm: 5.70e+05, r_norm: 8.26e+03
    [140] x_norm: 5.70e+05, r_norm: 8.22e+03
    [150] x_norm: 5.70e+05, r_norm: 8.18e+03
    [160] x_norm: 5.72e+05, r_norm: 4.90e+03
    [170] x_norm: 5.72e+05, r_norm: 4.70e+03
    [180] x_norm: 5.72e+05, r_norm: 4.66e+03
    [190] x_norm: 5.72e+05, r_norm: 4.64e+03
    [200] x_norm: 5.72e+05, r_norm: 4.63e+03
    [210] x_norm: 5.72e+05, r_norm: 4.59e+03
    [220] x_norm: 5.73e+05, r_norm: 1.54e+03
    [230] x_norm: 5.74e+05, r_norm: 1.09e+03
    [240] x_norm: 5.74e+05, r_norm: 9.82e+02
    [250] x_norm: 5.74e+05, r_norm: 9.42e+02
    [260] x_norm: 5.74e+05, r_norm: 9.19e+02
    [270] x_norm: 5.74e+05, r_norm: 9.07e+02
    [280] x_norm: 5.74e+05, r_norm: 8.97e+02
    [290] x_norm: 5.74e+05, r_norm: 8.79e+02
    [300] x_norm: 5.74e+05, r_norm: 8.71e+02
    Algorithm converged in 300 iterations.


.. GENERATED FROM PYTHON SOURCE LINES 212-213

Let us check the quality of reconstruction

.. GENERATED FROM PYTHON SOURCE LINES 213-215

.. code-block:: default

    problems.analyze_solution(prob, sol)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    m: 2, n: 28672
    b_norm: original: 677817.092 reconstruction: 677610.672 SNR: 57.82 dB
    y_norm: original: 677851.111 reconstruction: 677371.277 SNR: 51.85 dB
    Iterations: 300 n_times: 461, n_trans: 301


.. GENERATED FROM PYTHON SOURCE LINES 216-217

Let's plot the progress of SPGL1 over different iterations

.. GENERATED FROM PYTHON SOURCE LINES 217-220

.. code-block:: default

    ax = crplot.one_plot(height=6)
    tracker.plot_progress(ax)


.. image-sg:: /gallery/images/sphx_glr_0401_002.png
   :alt: 0401
   :srcset: /gallery/images/sphx_glr_0401_002.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 221-222

The estimated sparse representation

.. GENERATED FROM PYTHON SOURCE LINES 222-223

.. code-block:: default

    x = sol.x


.. GENERATED FROM PYTHON SOURCE LINES 224-225

Let us reconstruct the signal from this sparse representation

.. GENERATED FROM PYTHON SOURCE LINES 225-231

.. code-block:: default

    y = prob.reconstruct(x)
    # Compare the original
    snr = crn.signal_noise_ratio(y0, y)
    print(f'SNR: {snr:.2f} dB')


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    SNR: 51.85 dB


.. GENERATED FROM PYTHON SOURCE LINES 232-233

Let us visually compare original signal with the reconstructed one

.. GENERATED FROM PYTHON SOURCE LINES 233-242

.. code-block:: default

    ax = crplot.h_plots(4)
    ax[0].plot(y0[:, 0])
    ax[0].set_title("Original audio 1 (Guitar)")
    ax[1].plot(y[:, 0])
    ax[1].set_title("Reconstructed audio 1 (Guitar)")
    ax[2].plot(y0[:, 1])
    ax[2].set_title("Original audio 2 (Piano)")
    ax[3].plot(y[:, 1])
    ax[3].set_title("Reconstructed audio 2 (Piano)")


.. image-sg:: /gallery/images/sphx_glr_0401_003.png
   :alt: Original audio 1 (Guitar), Reconstructed audio 1 (Guitar), Original audio 2 (Piano), Reconstructed audio 2 (Piano)
   :srcset: /gallery/images/sphx_glr_0401_003.png
   :class: sphx-glr-single-img


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    Text(0.5, 1.0, 'Reconstructed audio 2 (Piano)')


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** (0 minutes 22.147 seconds)


.. _sphx_glr_download_gallery_0401.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example


    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: 0401.py <0401.py>`

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: 0401.ipynb <0401.ipynb>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_