TSNE vs COSNE : Euclidean vs Hyperbolic#

We compare in this example two dimensionalty reduction methods: T-SNE and CO-SNE on a synthetic hierarchical toy dataset and on singlecell data. The first method computes an embedding in a 2D Euclidean space while the second one operates in the Hyperbolic Poincaré Ball model.

import numpy as np
from torchdr.utils.visu import plot_disk
from torchdr import TSNE, COSNE
import urllib.request
import matplotlib.pylab as plt

Load the SNARE-seq dataset (gene expression) with cell type labels#

def load_numpy_from_url(url, delimiter="\t"):
    """
    Load a numpy array from a URL.

    Parameters
    ----------
    url : str
        URL to load data from.
    delimiter : str, default="\t"
        Delimiter used in the data file.

    Returns
    -------
    numpy.ndarray
        Loaded data as a numpy array.
    """
    response = urllib.request.urlopen(url)
    data = response.read().decode("utf-8")
    data = data.split("\n")
    data = [row.split(delimiter) for row in data if row]
    numpy_array = np.array(data, dtype=float)
    return numpy_array


url_x = "https://rsinghlab.github.io/SCOT/data/snare_rna.txt"
snare_data = load_numpy_from_url(url_x) / 100

url_y = "https://rsinghlab.github.io/SCOT/data/SNAREseq_types.txt"
snare_labels = load_numpy_from_url(url_y)

Computing TSNE and COSNE on SNARE-seq data#

We can now proceed to computing the two DR methods and visualizing the results on the SNARE-seq dataset.

tsne_model = TSNE(verbose=True, max_iter=500)
out_tsne = tsne_model.fit_transform(snare_data)

cosne_model = COSNE(lr=1e-1, verbose=True, gamma=0.5, lambda1=0.01, max_iter=500)
out_cosne = cosne_model.fit_transform(snare_data)


fig, axes = plt.subplots(nrows=1, ncols=2, figsize=(16, 8))
axes[0].scatter(*out_tsne.T, c=snare_labels.squeeze(1), cmap=plt.get_cmap("rainbow"))
axes[0].set_xticks([])
axes[0].set_yticks([])
axes[0].set_title("T-SNE", fontsize=24)
plot_disk(axes[1])
axes[1].scatter(*out_cosne.T, c=snare_labels.squeeze(1), cmap=plt.get_cmap("rainbow"))
axes[1].axis("off")
axes[1].set_title("CO-SNE", fontsize=24)
plt.show()
T-SNE, CO-SNE
  0%|          | 0/500 [00:00<?, ?it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   0%|          | 0/500 [00:00<?, ?it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   0%|          | 1/500 [00:00<01:41,  4.93it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   0%|          | 1/500 [00:00<01:41,  4.93it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   0%|          | 2/500 [00:00<01:57,  4.23it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   0%|          | 2/500 [00:00<01:57,  4.23it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   1%|          | 3/500 [00:00<01:49,  4.55it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   1%|          | 3/500 [00:00<01:49,  4.55it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   1%|          | 4/500 [00:00<01:33,  5.33it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   1%|          | 4/500 [00:00<01:33,  5.33it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   1%|          | 5/500 [00:00<01:34,  5.23it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   1%|          | 5/500 [00:01<01:34,  5.23it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   1%|          | 6/500 [00:01<01:36,  5.14it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   1%|          | 6/500 [00:01<01:36,  5.14it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   1%|▏         | 7/500 [00:01<01:36,  5.12it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   1%|▏         | 7/500 [00:01<01:36,  5.12it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   2%|▏         | 8/500 [00:01<01:21,  6.00it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   2%|▏         | 8/500 [00:01<01:21,  6.00it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   2%|▏         | 9/500 [00:01<01:27,  5.64it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   2%|▏         | 9/500 [00:01<01:27,  5.64it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   2%|▏         | 10/500 [00:01<01:30,  5.43it/s]
[TorchDR] TSNE: Loss: 1.41e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   2%|▏         | 10/500 [00:02<01:30,  5.43it/s]
[TorchDR] TSNE: Loss: 1.41e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   2%|▏         | 11/500 [00:02<01:32,  5.31it/s]
[TorchDR] TSNE: Loss: 1.43e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   2%|▏         | 11/500 [00:02<01:32,  5.31it/s]
[TorchDR] TSNE: Loss: 1.43e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   2%|▏         | 12/500 [00:02<01:33,  5.20it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   2%|▏         | 12/500 [00:02<01:33,  5.20it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   3%|▎         | 13/500 [00:02<01:35,  5.12it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   3%|▎         | 13/500 [00:02<01:35,  5.12it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   3%|▎         | 14/500 [00:02<01:35,  5.10it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   3%|▎         | 14/500 [00:03<01:35,  5.10it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   3%|▎         | 15/500 [00:03<01:50,  4.40it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   3%|▎         | 15/500 [00:03<01:50,  4.40it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   3%|▎         | 16/500 [00:03<01:46,  4.55it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   3%|▎         | 16/500 [00:03<01:46,  4.55it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   3%|▎         | 17/500 [00:03<01:57,  4.12it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   3%|▎         | 17/500 [00:03<01:57,  4.12it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   4%|▎         | 18/500 [00:03<02:05,  3.84it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   4%|▎         | 18/500 [00:04<02:05,  3.84it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   4%|▍         | 19/500 [00:04<01:56,  4.12it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   4%|▍         | 19/500 [00:04<01:56,  4.12it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   4%|▍         | 20/500 [00:04<01:50,  4.36it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   4%|▍         | 20/500 [00:04<01:50,  4.36it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   4%|▍         | 21/500 [00:04<01:45,  4.54it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   4%|▍         | 21/500 [00:04<01:45,  4.54it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   4%|▍         | 22/500 [00:04<01:37,  4.92it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   4%|▍         | 22/500 [00:04<01:37,  4.92it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   5%|▍         | 23/500 [00:04<01:27,  5.45it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   5%|▍         | 23/500 [00:04<01:27,  5.45it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   5%|▍         | 24/500 [00:04<01:29,  5.31it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   5%|▍         | 24/500 [00:05<01:29,  5.31it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   5%|▌         | 25/500 [00:05<01:31,  5.22it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   5%|▌         | 25/500 [00:05<01:31,  5.22it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   5%|▌         | 26/500 [00:05<01:18,  6.04it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   5%|▌         | 26/500 [00:05<01:18,  6.04it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   5%|▌         | 27/500 [00:05<01:23,  5.70it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   5%|▌         | 27/500 [00:05<01:23,  5.70it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   6%|▌         | 28/500 [00:05<01:39,  4.73it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   6%|▌         | 28/500 [00:05<01:39,  4.73it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   6%|▌         | 29/500 [00:05<01:37,  4.81it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   6%|▌         | 29/500 [00:06<01:37,  4.81it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   6%|▌         | 30/500 [00:06<01:36,  4.85it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   6%|▌         | 30/500 [00:06<01:36,  4.85it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   6%|▌         | 31/500 [00:06<01:49,  4.28it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   6%|▌         | 31/500 [00:06<01:49,  4.28it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   6%|▋         | 32/500 [00:06<01:31,  5.12it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   6%|▋         | 32/500 [00:06<01:31,  5.12it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   7%|▋         | 33/500 [00:06<01:45,  4.43it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   7%|▋         | 33/500 [00:07<01:45,  4.43it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   7%|▋         | 34/500 [00:07<01:55,  4.04it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   7%|▋         | 34/500 [00:07<01:55,  4.04it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   7%|▋         | 35/500 [00:07<01:48,  4.28it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   7%|▋         | 35/500 [00:07<01:48,  4.28it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   7%|▋         | 36/500 [00:07<01:57,  3.94it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   7%|▋         | 36/500 [00:07<01:57,  3.94it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   7%|▋         | 37/500 [00:07<01:50,  4.21it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   7%|▋         | 37/500 [00:08<01:50,  4.21it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   8%|▊         | 38/500 [00:08<01:53,  4.08it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   8%|▊         | 38/500 [00:08<01:53,  4.08it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   8%|▊         | 39/500 [00:08<01:38,  4.69it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   8%|▊         | 39/500 [00:08<01:38,  4.69it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   8%|▊         | 40/500 [00:08<01:49,  4.21it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   8%|▊         | 40/500 [00:08<01:49,  4.21it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   8%|▊         | 41/500 [00:08<01:31,  5.04it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   8%|▊         | 41/500 [00:08<01:31,  5.04it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   8%|▊         | 42/500 [00:08<01:44,  4.39it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   8%|▊         | 42/500 [00:09<01:44,  4.39it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   9%|▊         | 43/500 [00:09<01:40,  4.56it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   9%|▊         | 43/500 [00:09<01:40,  4.56it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   9%|▉         | 44/500 [00:09<01:37,  4.66it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   9%|▉         | 44/500 [00:09<01:37,  4.66it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   9%|▉         | 45/500 [00:09<01:48,  4.18it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   9%|▉         | 45/500 [00:09<01:48,  4.18it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   9%|▉         | 46/500 [00:09<01:43,  4.38it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   9%|▉         | 46/500 [00:10<01:43,  4.38it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   9%|▉         | 47/500 [00:10<01:39,  4.54it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:   9%|▉         | 47/500 [00:10<01:39,  4.54it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:  10%|▉         | 48/500 [00:10<01:49,  4.12it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:  10%|▉         | 48/500 [00:10<01:49,  4.12it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:  10%|▉         | 49/500 [00:10<01:57,  3.85it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:  10%|▉         | 49/500 [00:10<01:57,  3.85it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01:  10%|█         | 50/500 [00:10<01:49,  4.12it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  10%|█         | 50/500 [00:11<01:49,  4.12it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  10%|█         | 51/500 [00:11<01:44,  4.31it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  10%|█         | 51/500 [00:11<01:44,  4.31it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  10%|█         | 52/500 [00:11<01:38,  4.53it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  10%|█         | 52/500 [00:11<01:38,  4.53it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  11%|█         | 53/500 [00:11<01:48,  4.12it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  11%|█         | 53/500 [00:11<01:48,  4.12it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  11%|█         | 54/500 [00:11<01:29,  4.98it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  11%|█         | 54/500 [00:11<01:29,  4.98it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  11%|█         | 55/500 [00:11<01:29,  4.98it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  11%|█         | 55/500 [00:11<01:29,  4.98it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  11%|█         | 56/500 [00:11<01:29,  4.98it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  11%|█         | 56/500 [00:12<01:29,  4.98it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  11%|█▏        | 57/500 [00:12<01:29,  4.97it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  11%|█▏        | 57/500 [00:12<01:29,  4.97it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  12%|█▏        | 58/500 [00:12<01:41,  4.35it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  12%|█▏        | 58/500 [00:12<01:41,  4.35it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  12%|█▏        | 59/500 [00:12<01:37,  4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  12%|█▏        | 59/500 [00:12<01:37,  4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  12%|█▏        | 60/500 [00:12<01:42,  4.28it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  12%|█▏        | 60/500 [00:13<01:42,  4.28it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  12%|█▏        | 61/500 [00:13<01:43,  4.26it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  12%|█▏        | 61/500 [00:13<01:43,  4.26it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  12%|█▏        | 62/500 [00:13<01:46,  4.11it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  12%|█▏        | 62/500 [00:13<01:46,  4.11it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  13%|█▎        | 63/500 [00:13<01:40,  4.34it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  13%|█▎        | 63/500 [00:13<01:40,  4.34it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  13%|█▎        | 64/500 [00:13<01:28,  4.92it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  13%|█▎        | 64/500 [00:14<01:28,  4.92it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  13%|█▎        | 65/500 [00:14<01:53,  3.84it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  13%|█▎        | 65/500 [00:14<01:53,  3.84it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  13%|█▎        | 66/500 [00:14<01:45,  4.12it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  13%|█▎        | 66/500 [00:14<01:45,  4.12it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  13%|█▎        | 67/500 [00:14<01:47,  4.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  13%|█▎        | 67/500 [00:14<01:47,  4.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  14%|█▎        | 68/500 [00:14<01:33,  4.64it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  14%|█▎        | 68/500 [00:14<01:33,  4.64it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  14%|█▍        | 69/500 [00:14<01:30,  4.74it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  14%|█▍        | 69/500 [00:15<01:30,  4.74it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  14%|█▍        | 70/500 [00:15<01:42,  4.21it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  14%|█▍        | 70/500 [00:15<01:42,  4.21it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  14%|█▍        | 71/500 [00:15<01:37,  4.41it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  14%|█▍        | 71/500 [00:15<01:37,  4.41it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  14%|█▍        | 72/500 [00:15<01:46,  4.03it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  14%|█▍        | 72/500 [00:16<01:46,  4.03it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  15%|█▍        | 73/500 [00:16<01:52,  3.79it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  15%|█▍        | 73/500 [00:16<01:52,  3.79it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  15%|█▍        | 74/500 [00:16<01:44,  4.06it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  15%|█▍        | 74/500 [00:16<01:44,  4.06it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  15%|█▌        | 75/500 [00:16<01:50,  3.84it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  15%|█▌        | 75/500 [00:16<01:50,  3.84it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  15%|█▌        | 76/500 [00:16<01:42,  4.13it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  15%|█▌        | 76/500 [00:16<01:42,  4.13it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  15%|█▌        | 77/500 [00:16<01:37,  4.36it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  15%|█▌        | 77/500 [00:17<01:37,  4.36it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  16%|█▌        | 78/500 [00:17<01:33,  4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  16%|█▌        | 78/500 [00:17<01:33,  4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  16%|█▌        | 79/500 [00:17<01:30,  4.63it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  16%|█▌        | 79/500 [00:17<01:30,  4.63it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  16%|█▌        | 80/500 [00:17<01:40,  4.18it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  16%|█▌        | 80/500 [00:17<01:40,  4.18it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  16%|█▌        | 81/500 [00:17<01:23,  5.02it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  16%|█▌        | 81/500 [00:18<01:23,  5.02it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  16%|█▋        | 82/500 [00:18<01:23,  5.02it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  16%|█▋        | 82/500 [00:18<01:23,  5.02it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  17%|█▋        | 83/500 [00:18<01:23,  5.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  17%|█▋        | 83/500 [00:18<01:23,  5.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  17%|█▋        | 84/500 [00:18<01:22,  5.02it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  17%|█▋        | 84/500 [00:18<01:22,  5.02it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  17%|█▋        | 85/500 [00:18<01:22,  5.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  17%|█▋        | 85/500 [00:18<01:22,  5.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  17%|█▋        | 86/500 [00:18<01:22,  5.00it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  17%|█▋        | 86/500 [00:18<01:22,  5.00it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  17%|█▋        | 87/500 [00:18<01:22,  5.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  17%|█▋        | 87/500 [00:19<01:22,  5.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  18%|█▊        | 88/500 [00:19<01:22,  5.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  18%|█▊        | 88/500 [00:19<01:22,  5.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  18%|█▊        | 89/500 [00:19<01:34,  4.37it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  18%|█▊        | 89/500 [00:19<01:34,  4.37it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  18%|█▊        | 90/500 [00:19<01:30,  4.51it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  18%|█▊        | 90/500 [00:19<01:30,  4.51it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  18%|█▊        | 91/500 [00:19<01:27,  4.66it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  18%|█▊        | 91/500 [00:20<01:27,  4.66it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  18%|█▊        | 92/500 [00:20<01:26,  4.74it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  18%|█▊        | 92/500 [00:20<01:26,  4.74it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  19%|█▊        | 93/500 [00:20<01:24,  4.83it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  19%|█▊        | 93/500 [00:20<01:24,  4.83it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  19%|█▉        | 94/500 [00:20<01:23,  4.89it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  19%|█▉        | 94/500 [00:20<01:23,  4.89it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  19%|█▉        | 95/500 [00:20<01:22,  4.91it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  19%|█▉        | 95/500 [00:20<01:22,  4.91it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  19%|█▉        | 96/500 [00:20<01:22,  4.93it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  19%|█▉        | 96/500 [00:21<01:22,  4.93it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  19%|█▉        | 97/500 [00:21<01:21,  4.97it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  19%|█▉        | 97/500 [00:21<01:21,  4.97it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  20%|█▉        | 98/500 [00:21<01:20,  4.99it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  20%|█▉        | 98/500 [00:21<01:20,  4.99it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  20%|█▉        | 99/500 [00:21<01:08,  5.83it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  20%|█▉        | 99/500 [00:21<01:08,  5.83it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01:  20%|██        | 100/500 [00:21<01:12,  5.54it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  20%|██        | 100/500 [00:21<01:12,  5.54it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  20%|██        | 101/500 [00:21<01:13,  5.40it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  20%|██        | 101/500 [00:22<01:13,  5.40it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  20%|██        | 102/500 [00:22<01:27,  4.55it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  20%|██        | 102/500 [00:22<01:27,  4.55it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  21%|██        | 103/500 [00:22<01:36,  4.13it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  21%|██        | 103/500 [00:22<01:36,  4.13it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  21%|██        | 104/500 [00:22<01:31,  4.33it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  21%|██        | 104/500 [00:22<01:31,  4.33it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  21%|██        | 105/500 [00:22<01:27,  4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  21%|██        | 105/500 [00:23<01:27,  4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  21%|██        | 106/500 [00:23<01:36,  4.10it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  21%|██        | 106/500 [00:23<01:36,  4.10it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  21%|██▏       | 107/500 [00:23<01:19,  4.94it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  21%|██▏       | 107/500 [00:23<01:19,  4.94it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  22%|██▏       | 108/500 [00:23<01:19,  4.95it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  22%|██▏       | 108/500 [00:23<01:19,  4.95it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  22%|██▏       | 109/500 [00:23<01:30,  4.33it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  22%|██▏       | 109/500 [00:23<01:30,  4.33it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  22%|██▏       | 110/500 [00:23<01:26,  4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  22%|██▏       | 110/500 [00:24<01:26,  4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  22%|██▏       | 111/500 [00:24<01:35,  4.09it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  22%|██▏       | 111/500 [00:24<01:35,  4.09it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  22%|██▏       | 112/500 [00:24<01:29,  4.32it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  22%|██▏       | 112/500 [00:24<01:29,  4.32it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  23%|██▎       | 113/500 [00:24<01:44,  3.69it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  23%|██▎       | 113/500 [00:24<01:44,  3.69it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  23%|██▎       | 114/500 [00:24<01:40,  3.84it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  23%|██▎       | 114/500 [00:25<01:40,  3.84it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  23%|██▎       | 115/500 [00:25<01:33,  4.11it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  23%|██▎       | 115/500 [00:25<01:33,  4.11it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  23%|██▎       | 116/500 [00:25<01:39,  3.85it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  23%|██▎       | 116/500 [00:25<01:39,  3.85it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  23%|██▎       | 117/500 [00:25<01:32,  4.13it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  23%|██▎       | 117/500 [00:25<01:32,  4.13it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  24%|██▎       | 118/500 [00:25<01:27,  4.36it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  24%|██▎       | 118/500 [00:26<01:27,  4.36it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  24%|██▍       | 119/500 [00:26<01:24,  4.53it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  24%|██▍       | 119/500 [00:26<01:24,  4.53it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  24%|██▍       | 120/500 [00:26<01:21,  4.64it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  24%|██▍       | 120/500 [00:26<01:21,  4.64it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  24%|██▍       | 121/500 [00:26<01:30,  4.19it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  24%|██▍       | 121/500 [00:26<01:30,  4.19it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  24%|██▍       | 122/500 [00:26<01:14,  5.04it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  24%|██▍       | 122/500 [00:26<01:14,  5.04it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  25%|██▍       | 123/500 [00:26<01:14,  5.06it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  25%|██▍       | 123/500 [00:27<01:14,  5.06it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  25%|██▍       | 124/500 [00:27<01:14,  5.05it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  25%|██▍       | 124/500 [00:27<01:14,  5.05it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  25%|██▌       | 125/500 [00:27<01:03,  5.91it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  25%|██▌       | 125/500 [00:27<01:03,  5.91it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  25%|██▌       | 126/500 [00:27<00:56,  6.64it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  25%|██▌       | 126/500 [00:27<00:56,  6.64it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  25%|██▌       | 127/500 [00:27<01:01,  6.09it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  25%|██▌       | 127/500 [00:27<01:01,  6.09it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  26%|██▌       | 128/500 [00:27<01:05,  5.72it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  26%|██▌       | 128/500 [00:27<01:05,  5.72it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  26%|██▌       | 129/500 [00:27<01:07,  5.47it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  26%|██▌       | 129/500 [00:28<01:07,  5.47it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  26%|██▌       | 130/500 [00:28<01:09,  5.32it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  26%|██▌       | 130/500 [00:28<01:09,  5.32it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  26%|██▌       | 131/500 [00:28<01:21,  4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  26%|██▌       | 131/500 [00:28<01:21,  4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  26%|██▋       | 132/500 [00:28<01:29,  4.10it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  26%|██▋       | 132/500 [00:28<01:29,  4.10it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  27%|██▋       | 133/500 [00:28<01:25,  4.31it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  27%|██▋       | 133/500 [00:29<01:25,  4.31it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  27%|██▋       | 134/500 [00:29<01:21,  4.51it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  27%|██▋       | 134/500 [00:29<01:21,  4.51it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  27%|██▋       | 135/500 [00:29<01:29,  4.09it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  27%|██▋       | 135/500 [00:29<01:29,  4.09it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  27%|██▋       | 136/500 [00:29<01:24,  4.31it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  27%|██▋       | 136/500 [00:29<01:24,  4.31it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  27%|██▋       | 137/500 [00:29<01:21,  4.47it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  27%|██▋       | 137/500 [00:30<01:21,  4.47it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  28%|██▊       | 138/500 [00:30<01:17,  4.64it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  28%|██▊       | 138/500 [00:30<01:17,  4.64it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  28%|██▊       | 139/500 [00:30<01:26,  4.18it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  28%|██▊       | 139/500 [00:30<01:26,  4.18it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  28%|██▊       | 140/500 [00:30<01:11,  5.04it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  28%|██▊       | 140/500 [00:30<01:11,  5.04it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  28%|██▊       | 141/500 [00:30<01:22,  4.38it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  28%|██▊       | 141/500 [00:30<01:22,  4.38it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  28%|██▊       | 142/500 [00:30<01:08,  5.24it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  28%|██▊       | 142/500 [00:31<01:08,  5.24it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  29%|██▊       | 143/500 [00:31<01:09,  5.16it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  29%|██▊       | 143/500 [00:31<01:09,  5.16it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  29%|██▉       | 144/500 [00:31<01:19,  4.45it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  29%|██▉       | 144/500 [00:31<01:19,  4.45it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  29%|██▉       | 145/500 [00:31<01:07,  5.30it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  29%|██▉       | 145/500 [00:31<01:07,  5.30it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  29%|██▉       | 146/500 [00:31<01:18,  4.53it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  29%|██▉       | 146/500 [00:31<01:18,  4.53it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  29%|██▉       | 147/500 [00:31<01:15,  4.65it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  29%|██▉       | 147/500 [00:32<01:15,  4.65it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  30%|██▉       | 148/500 [00:32<01:14,  4.74it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  30%|██▉       | 148/500 [00:32<01:14,  4.74it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  30%|██▉       | 149/500 [00:32<01:13,  4.80it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  30%|██▉       | 149/500 [00:32<01:13,  4.80it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01:  30%|███       | 150/500 [00:32<01:32,  3.78it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  30%|███       | 150/500 [00:32<01:32,  3.78it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  30%|███       | 151/500 [00:32<01:35,  3.64it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  30%|███       | 151/500 [00:33<01:35,  3.64it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  30%|███       | 152/500 [00:33<01:38,  3.55it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  30%|███       | 152/500 [00:33<01:38,  3.55it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  31%|███       | 153/500 [00:33<01:29,  3.87it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  31%|███       | 153/500 [00:33<01:29,  3.87it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  31%|███       | 154/500 [00:33<01:23,  4.15it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  31%|███       | 154/500 [00:33<01:23,  4.15it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  31%|███       | 155/500 [00:33<01:18,  4.37it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  31%|███       | 155/500 [00:34<01:18,  4.37it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  31%|███       | 156/500 [00:34<01:15,  4.54it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  31%|███       | 156/500 [00:34<01:15,  4.54it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  31%|███▏      | 157/500 [00:34<01:23,  4.11it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  31%|███▏      | 157/500 [00:34<01:23,  4.11it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  32%|███▏      | 158/500 [00:34<01:18,  4.33it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  32%|███▏      | 158/500 [00:34<01:18,  4.33it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  32%|███▏      | 159/500 [00:34<01:25,  3.99it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  32%|███▏      | 159/500 [00:35<01:25,  3.99it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  32%|███▏      | 160/500 [00:35<01:10,  4.83it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  32%|███▏      | 160/500 [00:35<01:10,  4.83it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  32%|███▏      | 161/500 [00:35<01:19,  4.27it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  32%|███▏      | 161/500 [00:35<01:19,  4.27it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  32%|███▏      | 162/500 [00:35<01:25,  3.95it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  32%|███▏      | 162/500 [00:35<01:25,  3.95it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  33%|███▎      | 163/500 [00:35<01:10,  4.80it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  33%|███▎      | 163/500 [00:35<01:10,  4.80it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  33%|███▎      | 164/500 [00:35<01:09,  4.84it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  33%|███▎      | 164/500 [00:36<01:09,  4.84it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  33%|███▎      | 165/500 [00:36<01:17,  4.30it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  33%|███▎      | 165/500 [00:36<01:17,  4.30it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  33%|███▎      | 166/500 [00:36<01:04,  5.16it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  33%|███▎      | 166/500 [00:36<01:04,  5.16it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  33%|███▎      | 167/500 [00:36<01:05,  5.07it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  33%|███▎      | 167/500 [00:36<01:05,  5.07it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  34%|███▎      | 168/500 [00:36<01:14,  4.43it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  34%|███▎      | 168/500 [00:37<01:14,  4.43it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  34%|███▍      | 169/500 [00:37<01:12,  4.56it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  34%|███▍      | 169/500 [00:37<01:12,  4.56it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  34%|███▍      | 170/500 [00:37<01:10,  4.67it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  34%|███▍      | 170/500 [00:37<01:10,  4.67it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  34%|███▍      | 171/500 [00:37<01:18,  4.21it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  34%|███▍      | 171/500 [00:37<01:18,  4.21it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  34%|███▍      | 172/500 [00:37<01:14,  4.37it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  34%|███▍      | 172/500 [00:37<01:14,  4.37it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  35%|███▍      | 173/500 [00:37<01:11,  4.54it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  35%|███▍      | 173/500 [00:38<01:11,  4.54it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  35%|███▍      | 174/500 [00:38<01:19,  4.10it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  35%|███▍      | 174/500 [00:38<01:19,  4.10it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  35%|███▌      | 175/500 [00:38<01:14,  4.34it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  35%|███▌      | 175/500 [00:38<01:14,  4.34it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  35%|███▌      | 176/500 [00:38<01:02,  5.18it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  35%|███▌      | 176/500 [00:38<01:02,  5.18it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  35%|███▌      | 177/500 [00:38<01:02,  5.18it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  35%|███▌      | 177/500 [00:38<01:02,  5.18it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  36%|███▌      | 178/500 [00:38<01:02,  5.13it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  36%|███▌      | 178/500 [00:39<01:02,  5.13it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  36%|███▌      | 179/500 [00:39<01:03,  5.06it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  36%|███▌      | 179/500 [00:39<01:03,  5.06it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  36%|███▌      | 180/500 [00:39<01:09,  4.63it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  36%|███▌      | 180/500 [00:39<01:09,  4.63it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  36%|███▌      | 181/500 [00:39<01:01,  5.22it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  36%|███▌      | 181/500 [00:39<01:01,  5.22it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  36%|███▋      | 182/500 [00:39<01:11,  4.47it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  36%|███▋      | 182/500 [00:40<01:11,  4.47it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  37%|███▋      | 183/500 [00:40<01:09,  4.56it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  37%|███▋      | 183/500 [00:40<01:09,  4.56it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  37%|███▋      | 184/500 [00:40<01:16,  4.13it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  37%|███▋      | 184/500 [00:40<01:16,  4.13it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  37%|███▋      | 185/500 [00:40<01:12,  4.36it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  37%|███▋      | 185/500 [00:40<01:12,  4.36it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  37%|███▋      | 186/500 [00:40<01:09,  4.53it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  37%|███▋      | 186/500 [00:41<01:09,  4.53it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  37%|███▋      | 187/500 [00:41<01:16,  4.09it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  37%|███▋      | 187/500 [00:41<01:16,  4.09it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  38%|███▊      | 188/500 [00:41<01:12,  4.32it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  38%|███▊      | 188/500 [00:41<01:12,  4.32it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  38%|███▊      | 189/500 [00:41<01:18,  3.97it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  38%|███▊      | 189/500 [00:41<01:18,  3.97it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  38%|███▊      | 190/500 [00:41<01:13,  4.23it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  38%|███▊      | 190/500 [00:41<01:13,  4.23it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  38%|███▊      | 191/500 [00:41<01:09,  4.45it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  38%|███▊      | 191/500 [00:42<01:09,  4.45it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  38%|███▊      | 192/500 [00:42<01:07,  4.59it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  38%|███▊      | 192/500 [00:42<01:07,  4.59it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  39%|███▊      | 193/500 [00:42<01:05,  4.71it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  39%|███▊      | 193/500 [00:42<01:05,  4.71it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  39%|███▉      | 194/500 [00:42<01:13,  4.18it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  39%|███▉      | 194/500 [00:42<01:13,  4.18it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  39%|███▉      | 195/500 [00:42<01:18,  3.91it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  39%|███▉      | 195/500 [00:43<01:18,  3.91it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  39%|███▉      | 196/500 [00:43<01:13,  4.15it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  39%|███▉      | 196/500 [00:43<01:13,  4.15it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  39%|███▉      | 197/500 [00:43<01:17,  3.89it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  39%|███▉      | 197/500 [00:43<01:17,  3.89it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  40%|███▉      | 198/500 [00:43<01:21,  3.70it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  40%|███▉      | 198/500 [00:43<01:21,  3.70it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  40%|███▉      | 199/500 [00:43<01:06,  4.55it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  40%|███▉      | 199/500 [00:43<01:06,  4.55it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  40%|████      | 200/500 [00:43<01:04,  4.68it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  40%|████      | 200/500 [00:44<01:04,  4.68it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  40%|████      | 201/500 [00:44<01:11,  4.19it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  40%|████      | 201/500 [00:44<01:11,  4.19it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  40%|████      | 202/500 [00:44<01:16,  3.89it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  40%|████      | 202/500 [00:44<01:16,  3.89it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  41%|████      | 203/500 [00:44<01:11,  4.16it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  41%|████      | 203/500 [00:44<01:11,  4.16it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  41%|████      | 204/500 [00:44<00:59,  5.01it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  41%|████      | 204/500 [00:45<00:59,  5.01it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  41%|████      | 205/500 [00:45<00:59,  5.00it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  41%|████      | 205/500 [00:45<00:59,  5.00it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  41%|████      | 206/500 [00:45<01:07,  4.38it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  41%|████      | 206/500 [00:45<01:07,  4.38it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  41%|████▏     | 207/500 [00:45<01:04,  4.53it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  41%|████▏     | 207/500 [00:45<01:04,  4.53it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  42%|████▏     | 208/500 [00:45<01:11,  4.11it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  42%|████▏     | 208/500 [00:46<01:11,  4.11it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  42%|████▏     | 209/500 [00:46<01:15,  3.85it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  42%|████▏     | 209/500 [00:46<01:15,  3.85it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  42%|████▏     | 210/500 [00:46<01:01,  4.68it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  42%|████▏     | 210/500 [00:46<01:01,  4.68it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  42%|████▏     | 211/500 [00:46<01:08,  4.20it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  42%|████▏     | 211/500 [00:46<01:08,  4.20it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  42%|████▏     | 212/500 [00:46<01:14,  3.89it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  42%|████▏     | 212/500 [00:47<01:14,  3.89it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  43%|████▎     | 213/500 [00:47<01:17,  3.70it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  43%|████▎     | 213/500 [00:47<01:17,  3.70it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  43%|████▎     | 214/500 [00:47<01:19,  3.58it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  43%|████▎     | 214/500 [00:47<01:19,  3.58it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  43%|████▎     | 215/500 [00:47<01:04,  4.42it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  43%|████▎     | 215/500 [00:47<01:04,  4.42it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  43%|████▎     | 216/500 [00:47<01:02,  4.57it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  43%|████▎     | 216/500 [00:47<01:02,  4.57it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  43%|████▎     | 217/500 [00:47<00:59,  4.72it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  43%|████▎     | 217/500 [00:48<00:59,  4.72it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  44%|████▎     | 218/500 [00:48<01:06,  4.21it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  44%|████▎     | 218/500 [00:48<01:06,  4.21it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  44%|████▍     | 219/500 [00:48<00:55,  5.07it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  44%|████▍     | 219/500 [00:48<00:55,  5.07it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  44%|████▍     | 220/500 [00:48<00:55,  5.04it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  44%|████▍     | 220/500 [00:48<00:55,  5.04it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  44%|████▍     | 221/500 [00:48<00:55,  5.03it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  44%|████▍     | 221/500 [00:48<00:55,  5.03it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  44%|████▍     | 222/500 [00:48<00:55,  5.02it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  44%|████▍     | 222/500 [00:49<00:55,  5.02it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  45%|████▍     | 223/500 [00:49<00:55,  5.01it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  45%|████▍     | 223/500 [00:49<00:55,  5.01it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  45%|████▍     | 224/500 [00:49<00:55,  5.00it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  45%|████▍     | 224/500 [00:49<00:55,  5.00it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  45%|████▌     | 225/500 [00:49<01:00,  4.58it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  45%|████▌     | 225/500 [00:49<01:00,  4.58it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  45%|████▌     | 226/500 [00:49<01:09,  3.95it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  45%|████▌     | 226/500 [00:50<01:09,  3.95it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  45%|████▌     | 227/500 [00:50<00:57,  4.79it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  45%|████▌     | 227/500 [00:50<00:57,  4.79it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  46%|████▌     | 228/500 [00:50<01:11,  3.79it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  46%|████▌     | 228/500 [00:50<01:11,  3.79it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  46%|████▌     | 229/500 [00:50<01:06,  4.06it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  46%|████▌     | 229/500 [00:50<01:06,  4.06it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  46%|████▌     | 230/500 [00:50<01:02,  4.30it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  46%|████▌     | 230/500 [00:51<01:02,  4.30it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  46%|████▌     | 231/500 [00:51<01:07,  3.97it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  46%|████▌     | 231/500 [00:51<01:07,  3.97it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  46%|████▋     | 232/500 [00:51<00:55,  4.80it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  46%|████▋     | 232/500 [00:51<00:55,  4.80it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  47%|████▋     | 233/500 [00:51<01:10,  3.80it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  47%|████▋     | 233/500 [00:51<01:10,  3.80it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  47%|████▋     | 234/500 [00:51<01:04,  4.09it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  47%|████▋     | 234/500 [00:52<01:04,  4.09it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  47%|████▋     | 235/500 [00:52<01:01,  4.33it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  47%|████▋     | 235/500 [00:52<01:01,  4.33it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  47%|████▋     | 236/500 [00:52<00:58,  4.49it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  47%|████▋     | 236/500 [00:52<00:58,  4.49it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  47%|████▋     | 237/500 [00:52<00:56,  4.62it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  47%|████▋     | 237/500 [00:52<00:56,  4.62it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  48%|████▊     | 238/500 [00:52<01:02,  4.17it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  48%|████▊     | 238/500 [00:52<01:02,  4.17it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  48%|████▊     | 239/500 [00:52<00:52,  5.01it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  48%|████▊     | 239/500 [00:53<00:52,  5.01it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  48%|████▊     | 240/500 [00:53<00:51,  5.05it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  48%|████▊     | 240/500 [00:53<00:51,  5.05it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  48%|████▊     | 241/500 [00:53<00:44,  5.85it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  48%|████▊     | 241/500 [00:53<00:44,  5.85it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  48%|████▊     | 242/500 [00:53<00:46,  5.59it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  48%|████▊     | 242/500 [00:53<00:46,  5.59it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  49%|████▊     | 243/500 [00:53<00:44,  5.75it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  49%|████▊     | 243/500 [00:53<00:44,  5.75it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  49%|████▉     | 244/500 [00:53<00:46,  5.51it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  49%|████▉     | 244/500 [00:53<00:46,  5.51it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  49%|████▉     | 245/500 [00:53<00:42,  5.98it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  49%|████▉     | 245/500 [00:54<00:42,  5.98it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  49%|████▉     | 246/500 [00:54<00:45,  5.63it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  49%|████▉     | 246/500 [00:54<00:45,  5.63it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  49%|████▉     | 247/500 [00:54<00:46,  5.42it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  49%|████▉     | 247/500 [00:54<00:46,  5.42it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  50%|████▉     | 248/500 [00:54<00:47,  5.29it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  50%|████▉     | 248/500 [00:54<00:47,  5.29it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  50%|████▉     | 249/500 [00:54<00:48,  5.19it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  50%|████▉     | 249/500 [00:54<00:48,  5.19it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  50%|█████     | 250/500 [00:54<00:45,  5.44it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  50%|█████     | 250/500 [00:55<00:45,  5.44it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  50%|█████     | 251/500 [00:55<00:49,  4.99it/s]
[TorchDR] TSNE: Loss: 1.29e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  50%|█████     | 251/500 [00:55<00:49,  4.99it/s]
[TorchDR] TSNE: Loss: 1.29e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  50%|█████     | 252/500 [00:55<00:49,  5.00it/s]
[TorchDR] TSNE: Loss: 1.29e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  50%|█████     | 252/500 [00:55<00:49,  5.00it/s]
[TorchDR] TSNE: Loss: 1.29e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  51%|█████     | 253/500 [00:55<00:49,  4.98it/s]
[TorchDR] TSNE: Loss: 1.29e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  51%|█████     | 253/500 [00:55<00:49,  4.98it/s]
[TorchDR] TSNE: Loss: 1.29e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  51%|█████     | 254/500 [00:55<00:49,  5.00it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  51%|█████     | 254/500 [00:55<00:49,  5.00it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  51%|█████     | 255/500 [00:55<00:49,  5.00it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  51%|█████     | 255/500 [00:56<00:49,  5.00it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  51%|█████     | 256/500 [00:56<00:48,  5.00it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  51%|█████     | 256/500 [00:56<00:48,  5.00it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  51%|█████▏    | 257/500 [00:56<00:55,  4.37it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  51%|█████▏    | 257/500 [00:56<00:55,  4.37it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  52%|█████▏    | 258/500 [00:56<00:46,  5.23it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  52%|█████▏    | 258/500 [00:56<00:46,  5.23it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  52%|█████▏    | 259/500 [00:56<00:46,  5.17it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  52%|█████▏    | 259/500 [00:56<00:46,  5.17it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  52%|█████▏    | 260/500 [00:56<00:53,  4.46it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  52%|█████▏    | 260/500 [00:57<00:53,  4.46it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  52%|█████▏    | 261/500 [00:57<00:44,  5.32it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  52%|█████▏    | 261/500 [00:57<00:44,  5.32it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  52%|█████▏    | 262/500 [00:57<00:45,  5.21it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  52%|█████▏    | 262/500 [00:57<00:45,  5.21it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  53%|█████▎    | 263/500 [00:57<00:45,  5.15it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  53%|█████▎    | 263/500 [00:57<00:45,  5.15it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  53%|█████▎    | 264/500 [00:57<00:39,  5.98it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  53%|█████▎    | 264/500 [00:57<00:39,  5.98it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  53%|█████▎    | 265/500 [00:57<00:41,  5.68it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  53%|█████▎    | 265/500 [00:57<00:41,  5.68it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  53%|█████▎    | 266/500 [00:57<00:36,  6.47it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  53%|█████▎    | 266/500 [00:58<00:36,  6.47it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  53%|█████▎    | 267/500 [00:58<00:32,  7.21it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  53%|█████▎    | 267/500 [00:58<00:32,  7.21it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  54%|█████▎    | 268/500 [00:58<00:35,  6.45it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  54%|█████▎    | 268/500 [00:58<00:35,  6.45it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  54%|█████▍    | 269/500 [00:58<00:32,  7.15it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  54%|█████▍    | 269/500 [00:58<00:32,  7.15it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  54%|█████▍    | 270/500 [00:58<00:35,  6.40it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  54%|█████▍    | 270/500 [00:58<00:35,  6.40it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  54%|█████▍    | 271/500 [00:58<00:38,  5.89it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  54%|█████▍    | 271/500 [00:58<00:38,  5.89it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  54%|█████▍    | 272/500 [00:58<00:47,  4.82it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  54%|█████▍    | 272/500 [00:59<00:47,  4.82it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  55%|█████▍    | 273/500 [00:59<00:46,  4.87it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  55%|█████▍    | 273/500 [00:59<00:46,  4.87it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  55%|█████▍    | 274/500 [00:59<00:39,  5.73it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  55%|█████▍    | 274/500 [00:59<00:39,  5.73it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  55%|█████▌    | 275/500 [00:59<00:34,  6.53it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  55%|█████▌    | 275/500 [00:59<00:34,  6.53it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  55%|█████▌    | 276/500 [00:59<00:37,  5.96it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  55%|█████▌    | 276/500 [00:59<00:37,  5.96it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  55%|█████▌    | 277/500 [00:59<00:45,  4.87it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  55%|█████▌    | 277/500 [01:00<00:45,  4.87it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  56%|█████▌    | 278/500 [01:00<00:38,  5.71it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  56%|█████▌    | 278/500 [01:00<00:38,  5.71it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  56%|█████▌    | 279/500 [01:00<00:40,  5.51it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  56%|█████▌    | 279/500 [01:00<00:40,  5.51it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  56%|█████▌    | 280/500 [01:00<00:41,  5.33it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  56%|█████▌    | 280/500 [01:00<00:41,  5.33it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  56%|█████▌    | 281/500 [01:00<00:41,  5.23it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  56%|█████▌    | 281/500 [01:00<00:41,  5.23it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  56%|█████▋    | 282/500 [01:00<00:42,  5.16it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  56%|█████▋    | 282/500 [01:00<00:42,  5.16it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  57%|█████▋    | 283/500 [01:00<00:42,  5.11it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  57%|█████▋    | 283/500 [01:01<00:42,  5.11it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  57%|█████▋    | 284/500 [01:01<00:36,  5.92it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  57%|█████▋    | 284/500 [01:01<00:36,  5.92it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  57%|█████▋    | 285/500 [01:01<00:38,  5.66it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  57%|█████▋    | 285/500 [01:01<00:38,  5.66it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  57%|█████▋    | 286/500 [01:01<00:39,  5.46it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  57%|█████▋    | 286/500 [01:01<00:39,  5.46it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  57%|█████▋    | 287/500 [01:01<00:40,  5.32it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  57%|█████▋    | 287/500 [01:01<00:40,  5.32it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  58%|█████▊    | 288/500 [01:01<00:34,  6.17it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  58%|█████▊    | 288/500 [01:01<00:34,  6.17it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  58%|█████▊    | 289/500 [01:01<00:36,  5.78it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  58%|█████▊    | 289/500 [01:02<00:36,  5.78it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  58%|█████▊    | 290/500 [01:02<00:32,  6.53it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  58%|█████▊    | 290/500 [01:02<00:32,  6.53it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  58%|█████▊    | 291/500 [01:02<00:34,  6.03it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  58%|█████▊    | 291/500 [01:02<00:34,  6.03it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  58%|█████▊    | 292/500 [01:02<00:30,  6.77it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  58%|█████▊    | 292/500 [01:02<00:30,  6.77it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  59%|█████▊    | 293/500 [01:02<00:33,  6.18it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  59%|█████▊    | 293/500 [01:02<00:33,  6.18it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  59%|█████▉    | 294/500 [01:02<00:35,  5.78it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  59%|█████▉    | 294/500 [01:03<00:35,  5.78it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  59%|█████▉    | 295/500 [01:03<00:37,  5.50it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  59%|█████▉    | 295/500 [01:03<00:37,  5.50it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  59%|█████▉    | 296/500 [01:03<00:38,  5.35it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  59%|█████▉    | 296/500 [01:03<00:38,  5.35it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  59%|█████▉    | 297/500 [01:03<00:38,  5.24it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  59%|█████▉    | 297/500 [01:03<00:38,  5.24it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  60%|█████▉    | 298/500 [01:03<00:44,  4.50it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  60%|█████▉    | 298/500 [01:03<00:44,  4.50it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  60%|█████▉    | 299/500 [01:03<00:37,  5.36it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  60%|█████▉    | 299/500 [01:03<00:37,  5.36it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01:  60%|██████    | 300/500 [01:04<00:38,  5.24it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  60%|██████    | 300/500 [01:04<00:38,  5.24it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  60%|██████    | 301/500 [01:04<00:38,  5.19it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  60%|██████    | 301/500 [01:04<00:38,  5.19it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  60%|██████    | 302/500 [01:04<00:32,  6.04it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  60%|██████    | 302/500 [01:04<00:32,  6.04it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  61%|██████    | 303/500 [01:04<00:34,  5.66it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  61%|██████    | 303/500 [01:04<00:34,  5.66it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  61%|██████    | 304/500 [01:04<00:35,  5.46it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  61%|██████    | 304/500 [01:04<00:35,  5.46it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  61%|██████    | 305/500 [01:04<00:36,  5.28it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  61%|██████    | 305/500 [01:05<00:36,  5.28it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  61%|██████    | 306/500 [01:05<00:37,  5.22it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  61%|██████    | 306/500 [01:05<00:37,  5.22it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  61%|██████▏   | 307/500 [01:05<00:37,  5.18it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  61%|██████▏   | 307/500 [01:05<00:37,  5.18it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  62%|██████▏   | 308/500 [01:05<00:37,  5.08it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  62%|██████▏   | 308/500 [01:05<00:37,  5.08it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  62%|██████▏   | 309/500 [01:05<00:32,  5.95it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  62%|██████▏   | 309/500 [01:05<00:32,  5.95it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  62%|██████▏   | 310/500 [01:05<00:33,  5.68it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  62%|██████▏   | 310/500 [01:05<00:33,  5.68it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  62%|██████▏   | 311/500 [01:05<00:34,  5.46it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  62%|██████▏   | 311/500 [01:06<00:34,  5.46it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  62%|██████▏   | 312/500 [01:06<00:29,  6.29it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  62%|██████▏   | 312/500 [01:06<00:29,  6.29it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  63%|██████▎   | 313/500 [01:06<00:26,  7.01it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  63%|██████▎   | 313/500 [01:06<00:26,  7.01it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  63%|██████▎   | 314/500 [01:06<00:34,  5.34it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  63%|██████▎   | 314/500 [01:06<00:34,  5.34it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  63%|██████▎   | 315/500 [01:06<00:35,  5.22it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  63%|██████▎   | 315/500 [01:06<00:35,  5.22it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  63%|██████▎   | 316/500 [01:06<00:35,  5.12it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  63%|██████▎   | 316/500 [01:07<00:35,  5.12it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  63%|██████▎   | 317/500 [01:07<00:35,  5.09it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  63%|██████▎   | 317/500 [01:07<00:35,  5.09it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  64%|██████▎   | 318/500 [01:07<00:33,  5.38it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  64%|██████▎   | 318/500 [01:07<00:33,  5.38it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  64%|██████▍   | 319/500 [01:07<00:30,  5.84it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  64%|██████▍   | 319/500 [01:07<00:30,  5.84it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  64%|██████▍   | 320/500 [01:07<00:32,  5.57it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  64%|██████▍   | 320/500 [01:07<00:32,  5.57it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  64%|██████▍   | 321/500 [01:07<00:38,  4.66it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  64%|██████▍   | 321/500 [01:08<00:38,  4.66it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  64%|██████▍   | 322/500 [01:08<00:37,  4.76it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  64%|██████▍   | 322/500 [01:08<00:37,  4.76it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  65%|██████▍   | 323/500 [01:08<00:31,  5.59it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  65%|██████▍   | 323/500 [01:08<00:31,  5.59it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  65%|██████▍   | 324/500 [01:08<00:32,  5.39it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  65%|██████▍   | 324/500 [01:08<00:32,  5.39it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  65%|██████▌   | 325/500 [01:08<00:38,  4.54it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  65%|██████▌   | 325/500 [01:08<00:38,  4.54it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  65%|██████▌   | 326/500 [01:08<00:37,  4.69it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  65%|██████▌   | 326/500 [01:09<00:37,  4.69it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  65%|██████▌   | 327/500 [01:09<00:36,  4.79it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  65%|██████▌   | 327/500 [01:09<00:36,  4.79it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  66%|██████▌   | 328/500 [01:09<00:35,  4.86it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  66%|██████▌   | 328/500 [01:09<00:35,  4.86it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  66%|██████▌   | 329/500 [01:09<00:29,  5.72it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  66%|██████▌   | 329/500 [01:09<00:29,  5.72it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  66%|██████▌   | 330/500 [01:09<00:31,  5.48it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  66%|██████▌   | 330/500 [01:09<00:31,  5.48it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  66%|██████▌   | 331/500 [01:09<00:31,  5.33it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  66%|██████▌   | 331/500 [01:09<00:31,  5.33it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  66%|██████▋   | 332/500 [01:09<00:32,  5.24it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  66%|██████▋   | 332/500 [01:10<00:32,  5.24it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  67%|██████▋   | 333/500 [01:10<00:27,  6.05it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  67%|██████▋   | 333/500 [01:10<00:27,  6.05it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  67%|██████▋   | 334/500 [01:10<00:29,  5.71it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  67%|██████▋   | 334/500 [01:10<00:29,  5.71it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  67%|██████▋   | 335/500 [01:10<00:30,  5.48it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  67%|██████▋   | 335/500 [01:10<00:30,  5.48it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  67%|██████▋   | 336/500 [01:10<00:30,  5.34it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  67%|██████▋   | 336/500 [01:10<00:30,  5.34it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  67%|██████▋   | 337/500 [01:10<00:29,  5.56it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  67%|██████▋   | 337/500 [01:11<00:29,  5.56it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  68%|██████▊   | 338/500 [01:11<00:27,  5.99it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  68%|██████▊   | 338/500 [01:11<00:27,  5.99it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  68%|██████▊   | 339/500 [01:11<00:28,  5.66it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  68%|██████▊   | 339/500 [01:11<00:28,  5.66it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  68%|██████▊   | 340/500 [01:11<00:29,  5.45it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  68%|██████▊   | 340/500 [01:11<00:29,  5.45it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  68%|██████▊   | 341/500 [01:11<00:29,  5.31it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  68%|██████▊   | 341/500 [01:11<00:29,  5.31it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  68%|██████▊   | 342/500 [01:11<00:25,  6.10it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  68%|██████▊   | 342/500 [01:12<00:25,  6.10it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  69%|██████▊   | 343/500 [01:12<00:31,  4.92it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  69%|██████▊   | 343/500 [01:12<00:31,  4.92it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  69%|██████▉   | 344/500 [01:12<00:31,  4.95it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  69%|██████▉   | 344/500 [01:12<00:31,  4.95it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  69%|██████▉   | 345/500 [01:12<00:31,  4.96it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  69%|██████▉   | 345/500 [01:12<00:31,  4.96it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  69%|██████▉   | 346/500 [01:12<00:30,  4.97it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  69%|██████▉   | 346/500 [01:12<00:30,  4.97it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  69%|██████▉   | 347/500 [01:12<00:28,  5.28it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  69%|██████▉   | 347/500 [01:12<00:28,  5.28it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  70%|██████▉   | 348/500 [01:12<00:26,  5.74it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  70%|██████▉   | 348/500 [01:13<00:26,  5.74it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  70%|██████▉   | 349/500 [01:13<00:27,  5.48it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  70%|██████▉   | 349/500 [01:13<00:27,  5.48it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01:  70%|███████   | 350/500 [01:13<00:28,  5.35it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  70%|███████   | 350/500 [01:13<00:28,  5.35it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  70%|███████   | 351/500 [01:13<00:24,  6.16it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  70%|███████   | 351/500 [01:13<00:24,  6.16it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  70%|███████   | 352/500 [01:13<00:25,  5.81it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  70%|███████   | 352/500 [01:13<00:25,  5.81it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  71%|███████   | 353/500 [01:13<00:24,  5.91it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  71%|███████   | 353/500 [01:13<00:24,  5.91it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  71%|███████   | 354/500 [01:13<00:23,  6.28it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  71%|███████   | 354/500 [01:14<00:23,  6.28it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  71%|███████   | 355/500 [01:14<00:20,  6.99it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  71%|███████   | 355/500 [01:14<00:20,  6.99it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  71%|███████   | 356/500 [01:14<00:22,  6.30it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  71%|███████   | 356/500 [01:14<00:22,  6.30it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  71%|███████▏  | 357/500 [01:14<00:20,  7.02it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  71%|███████▏  | 357/500 [01:14<00:20,  7.02it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  72%|███████▏  | 358/500 [01:14<00:22,  6.31it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  72%|███████▏  | 358/500 [01:14<00:22,  6.31it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  72%|███████▏  | 359/500 [01:14<00:20,  7.00it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  72%|███████▏  | 359/500 [01:14<00:20,  7.00it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  72%|███████▏  | 360/500 [01:14<00:22,  6.31it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  72%|███████▏  | 360/500 [01:14<00:22,  6.31it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  72%|███████▏  | 361/500 [01:14<00:19,  7.02it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  72%|███████▏  | 361/500 [01:15<00:19,  7.02it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  72%|███████▏  | 362/500 [01:15<00:21,  6.33it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  72%|███████▏  | 362/500 [01:15<00:21,  6.33it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  73%|███████▎  | 363/500 [01:15<00:19,  7.04it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  73%|███████▎  | 363/500 [01:15<00:19,  7.04it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  73%|███████▎  | 364/500 [01:15<00:21,  6.34it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  73%|███████▎  | 364/500 [01:15<00:21,  6.34it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  73%|███████▎  | 365/500 [01:15<00:23,  5.86it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  73%|███████▎  | 365/500 [01:15<00:23,  5.86it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  73%|███████▎  | 366/500 [01:15<00:20,  6.61it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  73%|███████▎  | 366/500 [01:15<00:20,  6.61it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  73%|███████▎  | 367/500 [01:15<00:21,  6.10it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  73%|███████▎  | 367/500 [01:16<00:21,  6.10it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  74%|███████▎  | 368/500 [01:16<00:23,  5.74it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  74%|███████▎  | 368/500 [01:16<00:23,  5.74it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  74%|███████▍  | 369/500 [01:16<00:20,  6.53it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  74%|███████▍  | 369/500 [01:16<00:20,  6.53it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  74%|███████▍  | 370/500 [01:16<00:17,  7.23it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  74%|███████▍  | 370/500 [01:16<00:17,  7.23it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  74%|███████▍  | 371/500 [01:16<00:20,  6.45it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  74%|███████▍  | 371/500 [01:16<00:20,  6.45it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  74%|███████▍  | 372/500 [01:16<00:21,  5.95it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  74%|███████▍  | 372/500 [01:16<00:21,  5.95it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  75%|███████▍  | 373/500 [01:16<00:18,  6.74it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  75%|███████▍  | 373/500 [01:16<00:18,  6.74it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  75%|███████▍  | 374/500 [01:16<00:17,  7.41it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  75%|███████▍  | 374/500 [01:17<00:17,  7.41it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  75%|███████▌  | 375/500 [01:17<00:19,  6.56it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  75%|███████▌  | 375/500 [01:17<00:19,  6.56it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  75%|███████▌  | 376/500 [01:17<00:17,  7.24it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  75%|███████▌  | 376/500 [01:17<00:17,  7.24it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  75%|███████▌  | 377/500 [01:17<00:22,  5.40it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  75%|███████▌  | 377/500 [01:17<00:22,  5.40it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  76%|███████▌  | 378/500 [01:17<00:19,  6.22it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  76%|███████▌  | 378/500 [01:17<00:19,  6.22it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  76%|███████▌  | 379/500 [01:17<00:17,  6.94it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  76%|███████▌  | 379/500 [01:17<00:17,  6.94it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  76%|███████▌  | 380/500 [01:17<00:19,  6.27it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  76%|███████▌  | 380/500 [01:18<00:19,  6.27it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  76%|███████▌  | 381/500 [01:18<00:20,  5.85it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  76%|███████▌  | 381/500 [01:18<00:20,  5.85it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  76%|███████▋  | 382/500 [01:18<00:21,  5.58it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  76%|███████▋  | 382/500 [01:18<00:21,  5.58it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  77%|███████▋  | 383/500 [01:18<00:21,  5.38it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  77%|███████▋  | 383/500 [01:18<00:21,  5.38it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  77%|███████▋  | 384/500 [01:18<00:18,  6.19it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  77%|███████▋  | 384/500 [01:18<00:18,  6.19it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  77%|███████▋  | 385/500 [01:18<00:19,  5.81it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  77%|███████▋  | 385/500 [01:18<00:19,  5.81it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  77%|███████▋  | 386/500 [01:18<00:20,  5.55it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  77%|███████▋  | 386/500 [01:19<00:20,  5.55it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  77%|███████▋  | 387/500 [01:19<00:20,  5.39it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  77%|███████▋  | 387/500 [01:19<00:20,  5.39it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  78%|███████▊  | 388/500 [01:19<00:21,  5.26it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  78%|███████▊  | 388/500 [01:19<00:21,  5.26it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  78%|███████▊  | 389/500 [01:19<00:21,  5.18it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  78%|███████▊  | 389/500 [01:19<00:21,  5.18it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  78%|███████▊  | 390/500 [01:19<00:21,  5.13it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  78%|███████▊  | 390/500 [01:19<00:21,  5.13it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  78%|███████▊  | 391/500 [01:19<00:21,  5.09it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  78%|███████▊  | 391/500 [01:20<00:21,  5.09it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  78%|███████▊  | 392/500 [01:20<00:18,  5.92it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  78%|███████▊  | 392/500 [01:20<00:18,  5.92it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  79%|███████▊  | 393/500 [01:20<00:22,  4.82it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  79%|███████▊  | 393/500 [01:20<00:22,  4.82it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  79%|███████▉  | 394/500 [01:20<00:21,  4.88it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  79%|███████▉  | 394/500 [01:20<00:21,  4.88it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  79%|███████▉  | 395/500 [01:20<00:18,  5.71it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  79%|███████▉  | 395/500 [01:20<00:18,  5.71it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  79%|███████▉  | 396/500 [01:20<00:18,  5.53it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  79%|███████▉  | 396/500 [01:21<00:18,  5.53it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  79%|███████▉  | 397/500 [01:21<00:19,  5.35it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  79%|███████▉  | 397/500 [01:21<00:19,  5.35it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  80%|███████▉  | 398/500 [01:21<00:16,  6.19it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  80%|███████▉  | 398/500 [01:21<00:16,  6.19it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  80%|███████▉  | 399/500 [01:21<00:17,  5.78it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  80%|███████▉  | 399/500 [01:21<00:17,  5.78it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01:  80%|████████  | 400/500 [01:21<00:18,  5.53it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  80%|████████  | 400/500 [01:21<00:18,  5.53it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  80%|████████  | 401/500 [01:21<00:21,  4.64it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  80%|████████  | 401/500 [01:22<00:21,  4.64it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  80%|████████  | 402/500 [01:22<00:17,  5.48it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  80%|████████  | 402/500 [01:22<00:17,  5.48it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  81%|████████  | 403/500 [01:22<00:18,  5.35it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  81%|████████  | 403/500 [01:22<00:18,  5.35it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  81%|████████  | 404/500 [01:22<00:18,  5.26it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  81%|████████  | 404/500 [01:22<00:18,  5.26it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  81%|████████  | 405/500 [01:22<00:15,  6.08it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  81%|████████  | 405/500 [01:22<00:15,  6.08it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  81%|████████  | 406/500 [01:22<00:16,  5.75it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  81%|████████  | 406/500 [01:22<00:16,  5.75it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  81%|████████▏ | 407/500 [01:22<00:14,  6.54it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  81%|████████▏ | 407/500 [01:22<00:14,  6.54it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  82%|████████▏ | 408/500 [01:22<00:15,  6.02it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  82%|████████▏ | 408/500 [01:23<00:15,  6.02it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  82%|████████▏ | 409/500 [01:23<00:13,  6.77it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  82%|████████▏ | 409/500 [01:23<00:13,  6.77it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  82%|████████▏ | 410/500 [01:23<00:14,  6.14it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  82%|████████▏ | 410/500 [01:23<00:14,  6.14it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  82%|████████▏ | 411/500 [01:23<00:15,  5.79it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  82%|████████▏ | 411/500 [01:23<00:15,  5.79it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  82%|████████▏ | 412/500 [01:23<00:13,  6.58it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  82%|████████▏ | 412/500 [01:23<00:13,  6.58it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  83%|████████▎ | 413/500 [01:23<00:14,  6.08it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  83%|████████▎ | 413/500 [01:23<00:14,  6.08it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  83%|████████▎ | 414/500 [01:23<00:12,  6.83it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  83%|████████▎ | 414/500 [01:24<00:12,  6.83it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  83%|████████▎ | 415/500 [01:24<00:16,  5.22it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  83%|████████▎ | 415/500 [01:24<00:16,  5.22it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  83%|████████▎ | 416/500 [01:24<00:13,  6.04it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  83%|████████▎ | 416/500 [01:24<00:13,  6.04it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  83%|████████▎ | 417/500 [01:24<00:14,  5.67it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  83%|████████▎ | 417/500 [01:24<00:14,  5.67it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  84%|████████▎ | 418/500 [01:24<00:14,  5.50it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  84%|████████▎ | 418/500 [01:24<00:14,  5.50it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  84%|████████▍ | 419/500 [01:24<00:12,  6.28it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  84%|████████▍ | 419/500 [01:24<00:12,  6.28it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  84%|████████▍ | 420/500 [01:25<00:13,  5.87it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  84%|████████▍ | 420/500 [01:25<00:13,  5.87it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  84%|████████▍ | 421/500 [01:25<00:14,  5.59it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  84%|████████▍ | 421/500 [01:25<00:14,  5.59it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  84%|████████▍ | 422/500 [01:25<00:14,  5.43it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  84%|████████▍ | 422/500 [01:25<00:14,  5.43it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  85%|████████▍ | 423/500 [01:25<00:12,  6.29it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  85%|████████▍ | 423/500 [01:25<00:12,  6.29it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  85%|████████▍ | 424/500 [01:25<00:10,  6.97it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  85%|████████▍ | 424/500 [01:25<00:10,  6.97it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  85%|████████▌ | 425/500 [01:25<00:11,  6.26it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  85%|████████▌ | 425/500 [01:25<00:11,  6.26it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  85%|████████▌ | 426/500 [01:25<00:10,  6.98it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  85%|████████▌ | 426/500 [01:26<00:10,  6.98it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  85%|████████▌ | 427/500 [01:26<00:11,  6.31it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  85%|████████▌ | 427/500 [01:26<00:11,  6.31it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  86%|████████▌ | 428/500 [01:26<00:10,  7.02it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  86%|████████▌ | 428/500 [01:26<00:10,  7.02it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  86%|████████▌ | 429/500 [01:26<00:11,  6.34it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  86%|████████▌ | 429/500 [01:26<00:11,  6.34it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  86%|████████▌ | 430/500 [01:26<00:11,  5.87it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  86%|████████▌ | 430/500 [01:26<00:11,  5.87it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  86%|████████▌ | 431/500 [01:26<00:10,  6.64it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  86%|████████▌ | 431/500 [01:26<00:10,  6.64it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  86%|████████▋ | 432/500 [01:26<00:11,  6.10it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  86%|████████▋ | 432/500 [01:27<00:11,  6.10it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  87%|████████▋ | 433/500 [01:27<00:11,  5.73it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  87%|████████▋ | 433/500 [01:27<00:11,  5.73it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  87%|████████▋ | 434/500 [01:27<00:10,  6.54it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  87%|████████▋ | 434/500 [01:27<00:10,  6.54it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  87%|████████▋ | 435/500 [01:27<00:10,  6.00it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  87%|████████▋ | 435/500 [01:27<00:10,  6.00it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  87%|████████▋ | 436/500 [01:27<00:09,  6.78it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  87%|████████▋ | 436/500 [01:27<00:09,  6.78it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  87%|████████▋ | 437/500 [01:27<00:08,  7.41it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  87%|████████▋ | 437/500 [01:27<00:08,  7.41it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  88%|████████▊ | 438/500 [01:27<00:09,  6.55it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  88%|████████▊ | 438/500 [01:27<00:09,  6.55it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  88%|████████▊ | 439/500 [01:27<00:10,  6.00it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  88%|████████▊ | 439/500 [01:28<00:10,  6.00it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  88%|████████▊ | 440/500 [01:28<00:10,  5.67it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  88%|████████▊ | 440/500 [01:28<00:10,  5.67it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  88%|████████▊ | 441/500 [01:28<00:09,  6.45it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  88%|████████▊ | 441/500 [01:28<00:09,  6.45it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  88%|████████▊ | 442/500 [01:28<00:08,  7.20it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  88%|████████▊ | 442/500 [01:28<00:08,  7.20it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  89%|████████▊ | 443/500 [01:28<00:08,  6.40it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  89%|████████▊ | 443/500 [01:28<00:08,  6.40it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  89%|████████▉ | 444/500 [01:28<00:11,  5.07it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  89%|████████▉ | 444/500 [01:29<00:11,  5.07it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  89%|████████▉ | 445/500 [01:29<00:10,  5.05it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  89%|████████▉ | 445/500 [01:29<00:10,  5.05it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  89%|████████▉ | 446/500 [01:29<00:09,  5.89it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  89%|████████▉ | 446/500 [01:29<00:09,  5.89it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  89%|████████▉ | 447/500 [01:29<00:09,  5.59it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  89%|████████▉ | 447/500 [01:29<00:09,  5.59it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  90%|████████▉ | 448/500 [01:29<00:09,  5.40it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  90%|████████▉ | 448/500 [01:29<00:09,  5.40it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  90%|████████▉ | 449/500 [01:29<00:08,  6.20it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  90%|████████▉ | 449/500 [01:29<00:08,  6.20it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01:  90%|█████████ | 450/500 [01:29<00:08,  5.81it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  90%|█████████ | 450/500 [01:30<00:08,  5.81it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  90%|█████████ | 451/500 [01:30<00:08,  5.54it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  90%|█████████ | 451/500 [01:30<00:08,  5.54it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  90%|█████████ | 452/500 [01:30<00:08,  5.37it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  90%|█████████ | 452/500 [01:30<00:08,  5.37it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  91%|█████████ | 453/500 [01:30<00:08,  5.27it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  91%|█████████ | 453/500 [01:30<00:08,  5.27it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  91%|█████████ | 454/500 [01:30<00:08,  5.20it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  91%|█████████ | 454/500 [01:30<00:08,  5.20it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  91%|█████████ | 455/500 [01:30<00:07,  6.04it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  91%|█████████ | 455/500 [01:30<00:07,  6.04it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  91%|█████████ | 456/500 [01:30<00:06,  6.78it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  91%|█████████ | 456/500 [01:31<00:06,  6.78it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  91%|█████████▏| 457/500 [01:31<00:06,  6.18it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  91%|█████████▏| 457/500 [01:31<00:06,  6.18it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  92%|█████████▏| 458/500 [01:31<00:07,  5.75it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  92%|█████████▏| 458/500 [01:31<00:07,  5.75it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  92%|█████████▏| 459/500 [01:31<00:07,  5.53it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  92%|█████████▏| 459/500 [01:31<00:07,  5.53it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  92%|█████████▏| 460/500 [01:31<00:07,  5.33it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  92%|█████████▏| 460/500 [01:32<00:07,  5.33it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  92%|█████████▏| 461/500 [01:32<00:08,  4.53it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  92%|█████████▏| 461/500 [01:32<00:08,  4.53it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  92%|█████████▏| 462/500 [01:32<00:08,  4.66it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  92%|█████████▏| 462/500 [01:32<00:08,  4.66it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  93%|█████████▎| 463/500 [01:32<00:07,  4.77it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  93%|█████████▎| 463/500 [01:32<00:07,  4.77it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  93%|█████████▎| 464/500 [01:32<00:07,  4.84it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  93%|█████████▎| 464/500 [01:32<00:07,  4.84it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  93%|█████████▎| 465/500 [01:32<00:07,  4.89it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  93%|█████████▎| 465/500 [01:32<00:07,  4.89it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  93%|█████████▎| 466/500 [01:32<00:06,  4.93it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  93%|█████████▎| 466/500 [01:33<00:06,  4.93it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  93%|█████████▎| 467/500 [01:33<00:05,  5.76it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  93%|█████████▎| 467/500 [01:33<00:05,  5.76it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  94%|█████████▎| 468/500 [01:33<00:05,  5.55it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  94%|█████████▎| 468/500 [01:33<00:05,  5.55it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  94%|█████████▍| 469/500 [01:33<00:04,  6.35it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  94%|█████████▍| 469/500 [01:33<00:04,  6.35it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  94%|█████████▍| 470/500 [01:33<00:05,  5.95it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  94%|█████████▍| 470/500 [01:33<00:05,  5.95it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  94%|█████████▍| 471/500 [01:33<00:05,  5.64it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  94%|█████████▍| 471/500 [01:33<00:05,  5.64it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  94%|█████████▍| 472/500 [01:33<00:04,  6.49it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  94%|█████████▍| 472/500 [01:34<00:04,  6.49it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  95%|█████████▍| 473/500 [01:34<00:04,  5.95it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  95%|█████████▍| 473/500 [01:34<00:04,  5.95it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  95%|█████████▍| 474/500 [01:34<00:04,  5.63it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  95%|█████████▍| 474/500 [01:34<00:04,  5.63it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  95%|█████████▌| 475/500 [01:34<00:03,  6.43it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  95%|█████████▌| 475/500 [01:34<00:03,  6.43it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  95%|█████████▌| 476/500 [01:34<00:03,  7.12it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  95%|█████████▌| 476/500 [01:34<00:03,  7.12it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  95%|█████████▌| 477/500 [01:34<00:03,  6.38it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  95%|█████████▌| 477/500 [01:34<00:03,  6.38it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  96%|█████████▌| 478/500 [01:34<00:03,  7.08it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  96%|█████████▌| 478/500 [01:34<00:03,  7.08it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  96%|█████████▌| 479/500 [01:34<00:03,  6.38it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  96%|█████████▌| 479/500 [01:35<00:03,  6.38it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  96%|█████████▌| 480/500 [01:35<00:03,  5.91it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  96%|█████████▌| 480/500 [01:35<00:03,  5.91it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  96%|█████████▌| 481/500 [01:35<00:03,  5.59it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  96%|█████████▌| 481/500 [01:35<00:03,  5.59it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  96%|█████████▋| 482/500 [01:35<00:03,  5.38it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  96%|█████████▋| 482/500 [01:35<00:03,  5.38it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  97%|█████████▋| 483/500 [01:35<00:03,  5.26it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  97%|█████████▋| 483/500 [01:35<00:03,  5.26it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  97%|█████████▋| 484/500 [01:35<00:02,  6.08it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  97%|█████████▋| 484/500 [01:36<00:02,  6.08it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  97%|█████████▋| 485/500 [01:36<00:02,  5.76it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  97%|█████████▋| 485/500 [01:36<00:02,  5.76it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  97%|█████████▋| 486/500 [01:36<00:02,  5.52it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  97%|█████████▋| 486/500 [01:36<00:02,  5.52it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  97%|█████████▋| 487/500 [01:36<00:02,  6.32it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  97%|█████████▋| 487/500 [01:36<00:02,  6.32it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  98%|█████████▊| 488/500 [01:36<00:02,  5.87it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  98%|█████████▊| 488/500 [01:36<00:02,  5.87it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  98%|█████████▊| 489/500 [01:36<00:01,  5.60it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  98%|█████████▊| 489/500 [01:36<00:01,  5.60it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  98%|█████████▊| 490/500 [01:36<00:01,  6.38it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  98%|█████████▊| 490/500 [01:37<00:01,  6.38it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  98%|█████████▊| 491/500 [01:37<00:01,  5.95it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  98%|█████████▊| 491/500 [01:37<00:01,  5.95it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  98%|█████████▊| 492/500 [01:37<00:01,  5.62it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  98%|█████████▊| 492/500 [01:37<00:01,  5.62it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  99%|█████████▊| 493/500 [01:37<00:01,  5.43it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  99%|█████████▊| 493/500 [01:37<00:01,  5.43it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  99%|█████████▉| 494/500 [01:37<00:00,  6.22it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  99%|█████████▉| 494/500 [01:37<00:00,  6.22it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  99%|█████████▉| 495/500 [01:37<00:00,  5.85it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  99%|█████████▉| 495/500 [01:37<00:00,  5.85it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  99%|█████████▉| 496/500 [01:37<00:00,  6.62it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  99%|█████████▉| 496/500 [01:38<00:00,  6.62it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  99%|█████████▉| 497/500 [01:38<00:00,  6.09it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01:  99%|█████████▉| 497/500 [01:38<00:00,  6.09it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 100%|█████████▉| 498/500 [01:38<00:00,  6.84it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 100%|█████████▉| 498/500 [01:38<00:00,  6.84it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 100%|█████████▉| 499/500 [01:38<00:00,  6.22it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 100%|█████████▉| 499/500 [01:38<00:00,  6.22it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 100%|██████████| 500/500 [01:38<00:00,  5.75it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 100%|██████████| 500/500 [01:38<00:00,  5.07it/s]
[TorchDR] COSNE: Starting on device 'None' with backend 'None'.
[TorchDR] COSNE: [Step 1/2] --- Computing the input affinity matrix with EntropicAffinity ---
[TorchDR] COSNE: [Step 2/2] --- Optimizing the embedding ---

  0%|          | 0/500 [00:00<?, ?it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 3.46e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   0%|          | 0/500 [00:00<?, ?it/s]
[TorchDR] COSNE: Loss: 3.46e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   0%|          | 1/500 [00:00<04:08,  2.01it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 3.30e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   0%|          | 1/500 [00:00<04:08,  2.01it/s]
[TorchDR] COSNE: Loss: 3.30e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   0%|          | 2/500 [00:00<04:09,  2.00it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 3.15e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   0%|          | 2/500 [00:01<04:09,  2.00it/s]
[TorchDR] COSNE: Loss: 3.15e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   1%|          | 3/500 [00:01<03:45,  2.20it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 3.01e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   1%|          | 3/500 [00:01<03:45,  2.20it/s]
[TorchDR] COSNE: Loss: 3.01e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   1%|          | 4/500 [00:01<03:15,  2.54it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 2.85e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   1%|          | 4/500 [00:02<03:15,  2.54it/s]
[TorchDR] COSNE: Loss: 2.85e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   1%|          | 5/500 [00:02<03:15,  2.53it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 2.70e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   1%|          | 5/500 [00:02<03:15,  2.53it/s]
[TorchDR] COSNE: Loss: 2.70e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   1%|          | 6/500 [00:02<03:32,  2.32it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 2.54e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   1%|          | 6/500 [00:03<03:32,  2.32it/s]
[TorchDR] COSNE: Loss: 2.54e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   1%|▏         | 7/500 [00:03<03:43,  2.20it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 2.37e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   1%|▏         | 7/500 [00:03<03:43,  2.20it/s]
[TorchDR] COSNE: Loss: 2.37e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   2%|▏         | 8/500 [00:03<03:50,  2.14it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 2.20e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   2%|▏         | 8/500 [00:04<03:50,  2.14it/s]
[TorchDR] COSNE: Loss: 2.20e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   2%|▏         | 9/500 [00:04<03:39,  2.23it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 2.04e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   2%|▏         | 9/500 [00:04<03:39,  2.23it/s]
[TorchDR] COSNE: Loss: 2.04e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   2%|▏         | 10/500 [00:04<03:32,  2.30it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.87e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   2%|▏         | 10/500 [00:04<03:32,  2.30it/s]
[TorchDR] COSNE: Loss: 1.87e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   2%|▏         | 11/500 [00:04<03:41,  2.21it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.72e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   2%|▏         | 11/500 [00:05<03:41,  2.21it/s]
[TorchDR] COSNE: Loss: 1.72e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   2%|▏         | 12/500 [00:05<03:34,  2.27it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.58e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   2%|▏         | 12/500 [00:05<03:34,  2.27it/s]
[TorchDR] COSNE: Loss: 1.58e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   3%|▎         | 13/500 [00:05<03:56,  2.06it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.45e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   3%|▎         | 13/500 [00:06<03:56,  2.06it/s]
[TorchDR] COSNE: Loss: 1.45e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   3%|▎         | 14/500 [00:06<03:57,  2.04it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.34e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   3%|▎         | 14/500 [00:07<03:57,  2.04it/s]
[TorchDR] COSNE: Loss: 1.34e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   3%|▎         | 15/500 [00:07<04:14,  1.91it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   3%|▎         | 15/500 [00:07<04:14,  1.91it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   3%|▎         | 16/500 [00:07<04:09,  1.94it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.15e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   3%|▎         | 16/500 [00:08<04:09,  1.94it/s]
[TorchDR] COSNE: Loss: 1.15e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   3%|▎         | 17/500 [00:08<04:06,  1.96it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.09e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   3%|▎         | 17/500 [00:08<04:06,  1.96it/s]
[TorchDR] COSNE: Loss: 1.09e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   4%|▎         | 18/500 [00:08<03:50,  2.09it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.03e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   4%|▎         | 18/500 [00:08<03:50,  2.09it/s]
[TorchDR] COSNE: Loss: 1.03e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01:   4%|▍         | 19/500 [00:08<03:38,  2.20it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 9.84e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   4%|▍         | 19/500 [00:09<03:38,  2.20it/s]
[TorchDR] COSNE: Loss: 9.84e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   4%|▍         | 20/500 [00:09<03:30,  2.28it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 9.46e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   4%|▍         | 20/500 [00:09<03:30,  2.28it/s]
[TorchDR] COSNE: Loss: 9.46e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   4%|▍         | 21/500 [00:09<03:24,  2.35it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 9.15e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   4%|▍         | 21/500 [00:09<03:24,  2.35it/s]
[TorchDR] COSNE: Loss: 9.15e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   4%|▍         | 22/500 [00:09<03:19,  2.39it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 8.89e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   4%|▍         | 22/500 [00:10<03:19,  2.39it/s]
[TorchDR] COSNE: Loss: 8.89e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   5%|▍         | 23/500 [00:10<03:02,  2.61it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 8.67e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   5%|▍         | 23/500 [00:10<03:02,  2.61it/s]
[TorchDR] COSNE: Loss: 8.67e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   5%|▍         | 24/500 [00:10<03:18,  2.39it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 8.50e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   5%|▍         | 24/500 [00:11<03:18,  2.39it/s]
[TorchDR] COSNE: Loss: 8.50e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   5%|▌         | 25/500 [00:11<03:30,  2.25it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 8.35e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   5%|▌         | 25/500 [00:11<03:30,  2.25it/s]
[TorchDR] COSNE: Loss: 8.35e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   5%|▌         | 26/500 [00:11<03:23,  2.33it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 8.22e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   5%|▌         | 26/500 [00:12<03:23,  2.33it/s]
[TorchDR] COSNE: Loss: 8.22e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   5%|▌         | 27/500 [00:12<03:05,  2.55it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 8.07e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   5%|▌         | 27/500 [00:12<03:05,  2.55it/s]
[TorchDR] COSNE: Loss: 8.07e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   6%|▌         | 28/500 [00:12<03:05,  2.54it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 7.92e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   6%|▌         | 28/500 [00:12<03:05,  2.54it/s]
[TorchDR] COSNE: Loss: 7.92e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   6%|▌         | 29/500 [00:12<03:06,  2.53it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 7.76e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   6%|▌         | 29/500 [00:13<03:06,  2.53it/s]
[TorchDR] COSNE: Loss: 7.76e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   6%|▌         | 30/500 [00:13<03:06,  2.51it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 7.59e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   6%|▌         | 30/500 [00:13<03:06,  2.51it/s]
[TorchDR] COSNE: Loss: 7.59e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   6%|▌         | 31/500 [00:13<03:20,  2.34it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 7.42e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   6%|▌         | 31/500 [00:14<03:20,  2.34it/s]
[TorchDR] COSNE: Loss: 7.42e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   6%|▋         | 32/500 [00:14<03:16,  2.39it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 7.27e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   6%|▋         | 32/500 [00:14<03:16,  2.39it/s]
[TorchDR] COSNE: Loss: 7.27e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   7%|▋         | 33/500 [00:14<03:12,  2.42it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 7.13e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   7%|▋         | 33/500 [00:14<03:12,  2.42it/s]
[TorchDR] COSNE: Loss: 7.13e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   7%|▋         | 34/500 [00:14<02:43,  2.85it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 7.01e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   7%|▋         | 34/500 [00:15<02:43,  2.85it/s]
[TorchDR] COSNE: Loss: 7.01e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   7%|▋         | 35/500 [00:15<02:49,  2.75it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 6.91e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   7%|▋         | 35/500 [00:15<02:49,  2.75it/s]
[TorchDR] COSNE: Loss: 6.91e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   7%|▋         | 36/500 [00:15<02:40,  2.89it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 6.82e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   7%|▋         | 36/500 [00:15<02:40,  2.89it/s]
[TorchDR] COSNE: Loss: 6.82e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   7%|▋         | 37/500 [00:15<03:01,  2.56it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 6.75e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   7%|▋         | 37/500 [00:16<03:01,  2.56it/s]
[TorchDR] COSNE: Loss: 6.75e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   8%|▊         | 38/500 [00:16<03:02,  2.53it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 6.69e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   8%|▊         | 38/500 [00:16<03:02,  2.53it/s]
[TorchDR] COSNE: Loss: 6.69e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   8%|▊         | 39/500 [00:16<03:16,  2.35it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 6.63e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   8%|▊         | 39/500 [00:17<03:16,  2.35it/s]
[TorchDR] COSNE: Loss: 6.63e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   8%|▊         | 40/500 [00:17<02:59,  2.56it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 6.57e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   8%|▊         | 40/500 [00:17<02:59,  2.56it/s]
[TorchDR] COSNE: Loss: 6.57e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   8%|▊         | 41/500 [00:17<03:27,  2.21it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 6.52e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   8%|▊         | 41/500 [00:18<03:27,  2.21it/s]
[TorchDR] COSNE: Loss: 6.52e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   8%|▊         | 42/500 [00:18<03:19,  2.29it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 6.47e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   8%|▊         | 42/500 [00:18<03:19,  2.29it/s]
[TorchDR] COSNE: Loss: 6.47e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   9%|▊         | 43/500 [00:18<03:27,  2.20it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 6.41e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   9%|▊         | 43/500 [00:18<03:27,  2.20it/s]
[TorchDR] COSNE: Loss: 6.41e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   9%|▉         | 44/500 [00:18<03:19,  2.29it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 6.36e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   9%|▉         | 44/500 [00:19<03:19,  2.29it/s]
[TorchDR] COSNE: Loss: 6.36e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   9%|▉         | 45/500 [00:19<03:00,  2.52it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 6.32e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   9%|▉         | 45/500 [00:19<03:00,  2.52it/s]
[TorchDR] COSNE: Loss: 6.32e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   9%|▉         | 46/500 [00:19<03:00,  2.51it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 6.27e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   9%|▉         | 46/500 [00:20<03:00,  2.51it/s]
[TorchDR] COSNE: Loss: 6.27e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   9%|▉         | 47/500 [00:20<03:00,  2.51it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 6.23e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:   9%|▉         | 47/500 [00:20<03:00,  2.51it/s]
[TorchDR] COSNE: Loss: 6.23e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:  10%|▉         | 48/500 [00:20<02:47,  2.70it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 6.18e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:  10%|▉         | 48/500 [00:20<02:47,  2.70it/s]
[TorchDR] COSNE: Loss: 6.18e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:  10%|▉         | 49/500 [00:20<03:03,  2.46it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 6.15e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:  10%|▉         | 49/500 [00:21<03:03,  2.46it/s]
[TorchDR] COSNE: Loss: 6.15e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01:  10%|█         | 50/500 [00:21<02:48,  2.66it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 6.11e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  10%|█         | 50/500 [00:21<02:48,  2.66it/s]
[TorchDR] COSNE: Loss: 6.11e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  10%|█         | 51/500 [00:21<02:38,  2.83it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 6.08e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  10%|█         | 51/500 [00:21<02:38,  2.83it/s]
[TorchDR] COSNE: Loss: 6.08e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  10%|█         | 52/500 [00:21<02:44,  2.72it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 6.05e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  10%|█         | 52/500 [00:22<02:44,  2.72it/s]
[TorchDR] COSNE: Loss: 6.05e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  11%|█         | 53/500 [00:22<02:48,  2.65it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 6.03e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  11%|█         | 53/500 [00:22<02:48,  2.65it/s]
[TorchDR] COSNE: Loss: 6.03e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  11%|█         | 54/500 [00:22<02:38,  2.82it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 6.01e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  11%|█         | 54/500 [00:22<02:38,  2.82it/s]
[TorchDR] COSNE: Loss: 6.01e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  11%|█         | 55/500 [00:22<02:43,  2.72it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.98e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  11%|█         | 55/500 [00:23<02:43,  2.72it/s]
[TorchDR] COSNE: Loss: 5.98e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  11%|█         | 56/500 [00:23<02:47,  2.65it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.96e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  11%|█         | 56/500 [00:23<02:47,  2.65it/s]
[TorchDR] COSNE: Loss: 5.96e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  11%|█▏        | 57/500 [00:23<02:49,  2.61it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.93e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  11%|█▏        | 57/500 [00:24<02:49,  2.61it/s]
[TorchDR] COSNE: Loss: 5.93e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  12%|█▏        | 58/500 [00:24<02:51,  2.57it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.91e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  12%|█▏        | 58/500 [00:24<02:51,  2.57it/s]
[TorchDR] COSNE: Loss: 5.91e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  12%|█▏        | 59/500 [00:24<02:53,  2.54it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.89e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  12%|█▏        | 59/500 [00:25<02:53,  2.54it/s]
[TorchDR] COSNE: Loss: 5.89e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  12%|█▏        | 60/500 [00:25<03:06,  2.36it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.87e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  12%|█▏        | 60/500 [00:25<03:06,  2.36it/s]
[TorchDR] COSNE: Loss: 5.87e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  12%|█▏        | 61/500 [00:25<03:15,  2.24it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.85e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  12%|█▏        | 61/500 [00:25<03:15,  2.24it/s]
[TorchDR] COSNE: Loss: 5.85e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  12%|█▏        | 62/500 [00:25<02:57,  2.47it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.84e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  12%|█▏        | 62/500 [00:26<02:57,  2.47it/s]
[TorchDR] COSNE: Loss: 5.84e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  13%|█▎        | 63/500 [00:26<02:55,  2.49it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.83e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  13%|█▎        | 63/500 [00:26<02:55,  2.49it/s]
[TorchDR] COSNE: Loss: 5.83e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  13%|█▎        | 64/500 [00:26<02:54,  2.49it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.82e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  13%|█▎        | 64/500 [00:27<02:54,  2.49it/s]
[TorchDR] COSNE: Loss: 5.82e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  13%|█▎        | 65/500 [00:27<02:54,  2.49it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.80e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  13%|█▎        | 65/500 [00:27<02:54,  2.49it/s]
[TorchDR] COSNE: Loss: 5.80e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  13%|█▎        | 66/500 [00:27<03:06,  2.32it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.79e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  13%|█▎        | 66/500 [00:28<03:06,  2.32it/s]
[TorchDR] COSNE: Loss: 5.79e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  13%|█▎        | 67/500 [00:28<03:02,  2.37it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.78e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  13%|█▎        | 67/500 [00:28<03:02,  2.37it/s]
[TorchDR] COSNE: Loss: 5.78e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  14%|█▎        | 68/500 [00:28<03:12,  2.25it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.77e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  14%|█▎        | 68/500 [00:28<03:12,  2.25it/s]
[TorchDR] COSNE: Loss: 5.77e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  14%|█▍        | 69/500 [00:28<02:53,  2.48it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.76e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  14%|█▍        | 69/500 [00:29<02:53,  2.48it/s]
[TorchDR] COSNE: Loss: 5.76e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  14%|█▍        | 70/500 [00:29<02:52,  2.49it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.75e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  14%|█▍        | 70/500 [00:29<02:52,  2.49it/s]
[TorchDR] COSNE: Loss: 5.75e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  14%|█▍        | 71/500 [00:29<03:04,  2.33it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.74e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  14%|█▍        | 71/500 [00:30<03:04,  2.33it/s]
[TorchDR] COSNE: Loss: 5.74e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  14%|█▍        | 72/500 [00:30<02:47,  2.55it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.73e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  14%|█▍        | 72/500 [00:30<02:47,  2.55it/s]
[TorchDR] COSNE: Loss: 5.73e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  15%|█▍        | 73/500 [00:30<03:13,  2.20it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.73e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  15%|█▍        | 73/500 [00:31<03:13,  2.20it/s]
[TorchDR] COSNE: Loss: 5.73e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  15%|█▍        | 74/500 [00:31<03:31,  2.01it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.72e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  15%|█▍        | 74/500 [00:31<03:31,  2.01it/s]
[TorchDR] COSNE: Loss: 5.72e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  15%|█▌        | 75/500 [00:31<03:44,  1.89it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 5.71e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  15%|█▌        | 75/500 [00:32<03:44,  1.89it/s]
[TorchDR] COSNE: Loss: 5.71e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  15%|█▌        | 76/500 [00:32<03:15,  2.17it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.71e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  15%|█▌        | 76/500 [00:32<03:15,  2.17it/s]
[TorchDR] COSNE: Loss: 5.71e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  15%|█▌        | 77/500 [00:32<03:06,  2.27it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.70e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  15%|█▌        | 77/500 [00:32<03:06,  2.27it/s]
[TorchDR] COSNE: Loss: 5.70e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  16%|█▌        | 78/500 [00:32<03:01,  2.32it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.70e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  16%|█▌        | 78/500 [00:33<03:01,  2.32it/s]
[TorchDR] COSNE: Loss: 5.70e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  16%|█▌        | 79/500 [00:33<03:09,  2.22it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.69e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  16%|█▌        | 79/500 [00:33<03:09,  2.22it/s]
[TorchDR] COSNE: Loss: 5.69e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  16%|█▌        | 80/500 [00:33<03:15,  2.15it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 5.69e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  16%|█▌        | 80/500 [00:34<03:15,  2.15it/s]
[TorchDR] COSNE: Loss: 5.69e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  16%|█▌        | 81/500 [00:34<03:07,  2.24it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.69e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  16%|█▌        | 81/500 [00:34<03:07,  2.24it/s]
[TorchDR] COSNE: Loss: 5.69e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  16%|█▋        | 82/500 [00:34<03:25,  2.04it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.68e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  16%|█▋        | 82/500 [00:35<03:25,  2.04it/s]
[TorchDR] COSNE: Loss: 5.68e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  17%|█▋        | 83/500 [00:35<03:26,  2.02it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.68e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  17%|█▋        | 83/500 [00:35<03:26,  2.02it/s]
[TorchDR] COSNE: Loss: 5.68e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  17%|█▋        | 84/500 [00:35<03:14,  2.14it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.67e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  17%|█▋        | 84/500 [00:36<03:14,  2.14it/s]
[TorchDR] COSNE: Loss: 5.67e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  17%|█▋        | 85/500 [00:36<03:17,  2.10it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.67e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  17%|█▋        | 85/500 [00:36<03:17,  2.10it/s]
[TorchDR] COSNE: Loss: 5.67e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  17%|█▋        | 86/500 [00:36<03:07,  2.20it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.67e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  17%|█▋        | 86/500 [00:37<03:07,  2.20it/s]
[TorchDR] COSNE: Loss: 5.67e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  17%|█▋        | 87/500 [00:37<03:24,  2.02it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.67e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  17%|█▋        | 87/500 [00:37<03:24,  2.02it/s]
[TorchDR] COSNE: Loss: 5.67e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  18%|█▊        | 88/500 [00:37<03:13,  2.13it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  18%|█▊        | 88/500 [00:38<03:13,  2.13it/s]
[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  18%|█▊        | 89/500 [00:38<03:16,  2.10it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  18%|█▊        | 89/500 [00:38<03:16,  2.10it/s]
[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  18%|█▊        | 90/500 [00:38<02:54,  2.35it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  18%|█▊        | 90/500 [00:39<02:54,  2.35it/s]
[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  18%|█▊        | 91/500 [00:39<03:03,  2.23it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  18%|█▊        | 91/500 [00:39<03:03,  2.23it/s]
[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  18%|█▊        | 92/500 [00:39<03:20,  2.03it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  18%|█▊        | 92/500 [00:40<03:20,  2.03it/s]
[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  19%|█▊        | 93/500 [00:40<03:21,  2.02it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  19%|█▊        | 93/500 [00:40<03:21,  2.02it/s]
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  19%|█▉        | 94/500 [00:40<03:21,  2.01it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  19%|█▉        | 94/500 [00:41<03:21,  2.01it/s]
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  19%|█▉        | 95/500 [00:41<03:20,  2.02it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  19%|█▉        | 95/500 [00:41<03:20,  2.02it/s]
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  19%|█▉        | 96/500 [00:41<03:08,  2.14it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  19%|█▉        | 96/500 [00:41<03:08,  2.14it/s]
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  19%|█▉        | 97/500 [00:41<02:48,  2.39it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  19%|█▉        | 97/500 [00:42<02:48,  2.39it/s]
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  20%|█▉        | 98/500 [00:42<02:46,  2.42it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  20%|█▉        | 98/500 [00:42<02:46,  2.42it/s]
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  20%|█▉        | 99/500 [00:42<02:43,  2.45it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  20%|█▉        | 99/500 [00:42<02:43,  2.45it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01:  20%|██        | 100/500 [00:42<02:30,  2.66it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  20%|██        | 100/500 [00:43<02:30,  2.66it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  20%|██        | 101/500 [00:43<02:32,  2.61it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  20%|██        | 101/500 [00:43<02:32,  2.61it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  20%|██        | 102/500 [00:43<02:34,  2.58it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  20%|██        | 102/500 [00:44<02:34,  2.58it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  21%|██        | 103/500 [00:44<02:35,  2.55it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  21%|██        | 103/500 [00:44<02:35,  2.55it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  21%|██        | 104/500 [00:44<02:47,  2.36it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  21%|██        | 104/500 [00:45<02:47,  2.36it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  21%|██        | 105/500 [00:45<03:08,  2.10it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  21%|██        | 105/500 [00:45<03:08,  2.10it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  21%|██        | 106/500 [00:45<02:46,  2.36it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  21%|██        | 106/500 [00:45<02:46,  2.36it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  21%|██▏       | 107/500 [00:45<02:43,  2.40it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  21%|██▏       | 107/500 [00:46<02:43,  2.40it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  22%|██▏       | 108/500 [00:46<02:41,  2.43it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  22%|██▏       | 108/500 [00:46<02:41,  2.43it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  22%|██▏       | 109/500 [00:46<02:39,  2.45it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  22%|██▏       | 109/500 [00:47<02:39,  2.45it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  22%|██▏       | 110/500 [00:47<02:38,  2.46it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  22%|██▏       | 110/500 [00:47<02:38,  2.46it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  22%|██▏       | 111/500 [00:47<02:37,  2.47it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  22%|██▏       | 111/500 [00:47<02:37,  2.47it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  22%|██▏       | 112/500 [00:47<02:47,  2.31it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  22%|██▏       | 112/500 [00:48<02:47,  2.31it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  23%|██▎       | 113/500 [00:48<02:44,  2.35it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  23%|██▎       | 113/500 [00:48<02:44,  2.35it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  23%|██▎       | 114/500 [00:48<02:40,  2.40it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  23%|██▎       | 114/500 [00:49<02:40,  2.40it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  23%|██▎       | 115/500 [00:49<02:49,  2.27it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  23%|██▎       | 115/500 [00:49<02:49,  2.27it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  23%|██▎       | 116/500 [00:49<02:33,  2.51it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  23%|██▎       | 116/500 [00:49<02:33,  2.51it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  23%|██▎       | 117/500 [00:49<02:32,  2.51it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  23%|██▎       | 117/500 [00:50<02:32,  2.51it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  24%|██▎       | 118/500 [00:50<02:43,  2.33it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  24%|██▎       | 118/500 [00:50<02:43,  2.33it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  24%|██▍       | 119/500 [00:50<02:18,  2.75it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  24%|██▍       | 119/500 [00:51<02:18,  2.75it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  24%|██▍       | 120/500 [00:51<02:21,  2.68it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  24%|██▍       | 120/500 [00:51<02:21,  2.68it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  24%|██▍       | 121/500 [00:51<02:35,  2.44it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  24%|██▍       | 121/500 [00:52<02:35,  2.44it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  24%|██▍       | 122/500 [00:52<02:34,  2.45it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  24%|██▍       | 122/500 [00:52<02:34,  2.45it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  25%|██▍       | 123/500 [00:52<02:54,  2.15it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  25%|██▍       | 123/500 [00:53<02:54,  2.15it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  25%|██▍       | 124/500 [00:53<02:58,  2.10it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  25%|██▍       | 124/500 [00:53<02:58,  2.10it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  25%|██▌       | 125/500 [00:53<02:50,  2.20it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  25%|██▌       | 125/500 [00:53<02:50,  2.20it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  25%|██▌       | 126/500 [00:53<02:43,  2.29it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  25%|██▌       | 126/500 [00:54<02:43,  2.29it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  25%|██▌       | 127/500 [00:54<02:38,  2.35it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  25%|██▌       | 127/500 [00:54<02:38,  2.35it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  26%|██▌       | 128/500 [00:54<02:24,  2.57it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  26%|██▌       | 128/500 [00:54<02:24,  2.57it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  26%|██▌       | 129/500 [00:54<02:14,  2.77it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  26%|██▌       | 129/500 [00:55<02:14,  2.77it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  26%|██▌       | 130/500 [00:55<02:07,  2.91it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  26%|██▌       | 130/500 [00:55<02:07,  2.91it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  26%|██▌       | 131/500 [00:55<02:12,  2.78it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  26%|██▌       | 131/500 [00:56<02:12,  2.78it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  26%|██▋       | 132/500 [00:56<02:17,  2.68it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  26%|██▋       | 132/500 [00:56<02:17,  2.68it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  27%|██▋       | 133/500 [00:56<02:19,  2.63it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  27%|██▋       | 133/500 [00:56<02:19,  2.63it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  27%|██▋       | 134/500 [00:56<02:10,  2.81it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  27%|██▋       | 134/500 [00:57<02:10,  2.81it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  27%|██▋       | 135/500 [00:57<02:26,  2.50it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  27%|██▋       | 135/500 [00:57<02:26,  2.50it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  27%|██▋       | 136/500 [00:57<02:36,  2.32it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  27%|██▋       | 136/500 [00:58<02:36,  2.32it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  27%|██▋       | 137/500 [00:58<02:43,  2.22it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  27%|██▋       | 137/500 [00:58<02:43,  2.22it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  28%|██▊       | 138/500 [00:58<02:48,  2.15it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  28%|██▊       | 138/500 [00:59<02:48,  2.15it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  28%|██▊       | 139/500 [00:59<02:40,  2.25it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  28%|██▊       | 139/500 [00:59<02:40,  2.25it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  28%|██▊       | 140/500 [00:59<02:25,  2.48it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  28%|██▊       | 140/500 [00:59<02:25,  2.48it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  28%|██▊       | 141/500 [00:59<02:24,  2.49it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  28%|██▊       | 141/500 [01:00<02:24,  2.49it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  28%|██▊       | 142/500 [01:00<02:33,  2.32it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  28%|██▊       | 142/500 [01:00<02:33,  2.32it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  29%|██▊       | 143/500 [01:00<02:30,  2.38it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  29%|██▊       | 143/500 [01:01<02:30,  2.38it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  29%|██▉       | 144/500 [01:01<02:27,  2.41it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  29%|██▉       | 144/500 [01:01<02:27,  2.41it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  29%|██▉       | 145/500 [01:01<02:05,  2.83it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  29%|██▉       | 145/500 [01:01<02:05,  2.83it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  29%|██▉       | 146/500 [01:01<02:19,  2.54it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  29%|██▉       | 146/500 [01:02<02:19,  2.54it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  29%|██▉       | 147/500 [01:02<02:09,  2.72it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  29%|██▉       | 147/500 [01:02<02:09,  2.72it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  30%|██▉       | 148/500 [01:02<02:12,  2.65it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  30%|██▉       | 148/500 [01:02<02:12,  2.65it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  30%|██▉       | 149/500 [01:02<02:14,  2.60it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  30%|██▉       | 149/500 [01:03<02:14,  2.60it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01:  30%|███       | 150/500 [01:03<02:26,  2.39it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  30%|███       | 150/500 [01:03<02:26,  2.39it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  30%|███       | 151/500 [01:03<02:23,  2.43it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  30%|███       | 151/500 [01:04<02:23,  2.43it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  30%|███       | 152/500 [01:04<02:22,  2.45it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  30%|███       | 152/500 [01:04<02:22,  2.45it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  31%|███       | 153/500 [01:04<02:21,  2.46it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  31%|███       | 153/500 [01:05<02:21,  2.46it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  31%|███       | 154/500 [01:05<02:20,  2.47it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  31%|███       | 154/500 [01:05<02:20,  2.47it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  31%|███       | 155/500 [01:05<02:29,  2.31it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  31%|███       | 155/500 [01:05<02:29,  2.31it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  31%|███       | 156/500 [01:05<02:25,  2.36it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  31%|███       | 156/500 [01:06<02:25,  2.36it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  31%|███▏      | 157/500 [01:06<02:22,  2.40it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  31%|███▏      | 157/500 [01:06<02:22,  2.40it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  32%|███▏      | 158/500 [01:06<02:20,  2.43it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  32%|███▏      | 158/500 [01:07<02:20,  2.43it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  32%|███▏      | 159/500 [01:07<02:19,  2.45it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  32%|███▏      | 159/500 [01:07<02:19,  2.45it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  32%|███▏      | 160/500 [01:07<02:17,  2.47it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  32%|███▏      | 160/500 [01:07<02:17,  2.47it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  32%|███▏      | 161/500 [01:07<02:26,  2.31it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  32%|███▏      | 161/500 [01:08<02:26,  2.31it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  32%|███▏      | 162/500 [01:08<02:23,  2.35it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  32%|███▏      | 162/500 [01:08<02:23,  2.35it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  33%|███▎      | 163/500 [01:08<02:30,  2.24it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  33%|███▎      | 163/500 [01:09<02:30,  2.24it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  33%|███▎      | 164/500 [01:09<02:35,  2.17it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  33%|███▎      | 164/500 [01:09<02:35,  2.17it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  33%|███▎      | 165/500 [01:09<02:19,  2.41it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  33%|███▎      | 165/500 [01:10<02:19,  2.41it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  33%|███▎      | 166/500 [01:10<02:36,  2.13it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  33%|███▎      | 166/500 [01:10<02:36,  2.13it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  33%|███▎      | 167/500 [01:10<02:39,  2.09it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  33%|███▎      | 167/500 [01:11<02:39,  2.09it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  34%|███▎      | 168/500 [01:11<02:40,  2.06it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  34%|███▎      | 168/500 [01:11<02:40,  2.06it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  34%|███▍      | 169/500 [01:11<02:42,  2.04it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  34%|███▍      | 169/500 [01:12<02:42,  2.04it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  34%|███▍      | 170/500 [01:12<02:52,  1.91it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  34%|███▍      | 170/500 [01:12<02:52,  1.91it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  34%|███▍      | 171/500 [01:12<02:50,  1.94it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  34%|███▍      | 171/500 [01:13<02:50,  1.94it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  34%|███▍      | 172/500 [01:13<02:37,  2.08it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  34%|███▍      | 172/500 [01:13<02:37,  2.08it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  35%|███▍      | 173/500 [01:13<02:29,  2.19it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  35%|███▍      | 173/500 [01:14<02:29,  2.19it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  35%|███▍      | 174/500 [01:14<02:23,  2.27it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  35%|███▍      | 174/500 [01:14<02:23,  2.27it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  35%|███▌      | 175/500 [01:14<02:18,  2.34it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  35%|███▌      | 175/500 [01:14<02:18,  2.34it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  35%|███▌      | 176/500 [01:14<02:15,  2.39it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  35%|███▌      | 176/500 [01:15<02:15,  2.39it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  35%|███▌      | 177/500 [01:15<02:13,  2.42it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  35%|███▌      | 177/500 [01:15<02:13,  2.42it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  36%|███▌      | 178/500 [01:15<02:12,  2.44it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  36%|███▌      | 178/500 [01:16<02:12,  2.44it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  36%|███▌      | 179/500 [01:16<02:10,  2.46it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  36%|███▌      | 179/500 [01:16<02:10,  2.46it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  36%|███▌      | 180/500 [01:16<02:09,  2.47it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  36%|███▌      | 180/500 [01:16<02:09,  2.47it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  36%|███▌      | 181/500 [01:16<01:59,  2.67it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  36%|███▌      | 181/500 [01:17<01:59,  2.67it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  36%|███▋      | 182/500 [01:17<02:01,  2.63it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  36%|███▋      | 182/500 [01:17<02:01,  2.63it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  37%|███▋      | 183/500 [01:17<01:53,  2.79it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  37%|███▋      | 183/500 [01:17<01:53,  2.79it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  37%|███▋      | 184/500 [01:17<01:56,  2.71it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  37%|███▋      | 184/500 [01:18<01:56,  2.71it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  37%|███▋      | 185/500 [01:18<01:59,  2.65it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  37%|███▋      | 185/500 [01:18<01:59,  2.65it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  37%|███▋      | 186/500 [01:18<02:00,  2.60it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  37%|███▋      | 186/500 [01:19<02:00,  2.60it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  37%|███▋      | 187/500 [01:19<02:02,  2.56it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  37%|███▋      | 187/500 [01:19<02:02,  2.56it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  38%|███▊      | 188/500 [01:19<01:53,  2.76it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  38%|███▊      | 188/500 [01:19<01:53,  2.76it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  38%|███▊      | 189/500 [01:19<01:47,  2.89it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  38%|███▊      | 189/500 [01:20<01:47,  2.89it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  38%|███▊      | 190/500 [01:20<01:51,  2.78it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  38%|███▊      | 190/500 [01:20<01:51,  2.78it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  38%|███▊      | 191/500 [01:20<01:55,  2.68it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  38%|███▊      | 191/500 [01:20<01:55,  2.68it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  38%|███▊      | 192/500 [01:20<01:57,  2.62it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  38%|███▊      | 192/500 [01:21<01:57,  2.62it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  39%|███▊      | 193/500 [01:21<01:58,  2.59it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  39%|███▊      | 193/500 [01:21<01:58,  2.59it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  39%|███▉      | 194/500 [01:21<01:59,  2.56it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  39%|███▉      | 194/500 [01:22<01:59,  2.56it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  39%|███▉      | 195/500 [01:22<02:00,  2.54it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  39%|███▉      | 195/500 [01:22<02:00,  2.54it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  39%|███▉      | 196/500 [01:22<02:00,  2.53it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  39%|███▉      | 196/500 [01:22<02:00,  2.53it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  39%|███▉      | 197/500 [01:22<01:51,  2.72it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  39%|███▉      | 197/500 [01:23<01:51,  2.72it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  40%|███▉      | 198/500 [01:23<01:53,  2.66it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  40%|███▉      | 198/500 [01:23<01:53,  2.66it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  40%|███▉      | 199/500 [01:23<02:04,  2.42it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  40%|███▉      | 199/500 [01:23<02:04,  2.42it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01:  40%|████      | 200/500 [01:23<01:53,  2.64it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  40%|████      | 200/500 [01:24<01:53,  2.64it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  40%|████      | 201/500 [01:24<02:04,  2.41it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  40%|████      | 201/500 [01:24<02:04,  2.41it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  40%|████      | 202/500 [01:24<02:02,  2.43it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  40%|████      | 202/500 [01:25<02:02,  2.43it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  41%|████      | 203/500 [01:25<02:01,  2.45it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  41%|████      | 203/500 [01:25<02:01,  2.45it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  41%|████      | 204/500 [01:25<02:00,  2.46it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  41%|████      | 204/500 [01:26<02:00,  2.46it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  41%|████      | 205/500 [01:26<02:25,  2.03it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  41%|████      | 205/500 [01:26<02:25,  2.03it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  41%|████      | 206/500 [01:26<02:17,  2.15it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  41%|████      | 206/500 [01:27<02:17,  2.15it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  41%|████▏     | 207/500 [01:27<02:10,  2.24it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  41%|████▏     | 207/500 [01:27<02:10,  2.24it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  42%|████▏     | 208/500 [01:27<02:06,  2.32it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  42%|████▏     | 208/500 [01:28<02:06,  2.32it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  42%|████▏     | 209/500 [01:28<02:11,  2.21it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  42%|████▏     | 209/500 [01:28<02:11,  2.21it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  42%|████▏     | 210/500 [01:28<01:58,  2.46it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  42%|████▏     | 210/500 [01:28<01:58,  2.46it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  42%|████▏     | 211/500 [01:28<01:56,  2.48it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  42%|████▏     | 211/500 [01:29<01:56,  2.48it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  42%|████▏     | 212/500 [01:29<02:04,  2.31it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  42%|████▏     | 212/500 [01:29<02:04,  2.31it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  43%|████▎     | 213/500 [01:29<02:01,  2.36it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  43%|████▎     | 213/500 [01:30<02:01,  2.36it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  43%|████▎     | 214/500 [01:30<02:07,  2.24it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  43%|████▎     | 214/500 [01:30<02:07,  2.24it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  43%|████▎     | 215/500 [01:30<01:54,  2.48it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  43%|████▎     | 215/500 [01:31<01:54,  2.48it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  43%|████▎     | 216/500 [01:31<02:02,  2.31it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  43%|████▎     | 216/500 [01:31<02:02,  2.31it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  43%|████▎     | 217/500 [01:31<01:59,  2.37it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  43%|████▎     | 217/500 [01:31<01:59,  2.37it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  44%|████▎     | 218/500 [01:31<01:57,  2.41it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  44%|████▎     | 218/500 [01:32<01:57,  2.41it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  44%|████▍     | 219/500 [01:32<02:04,  2.26it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  44%|████▍     | 219/500 [01:32<02:04,  2.26it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  44%|████▍     | 220/500 [01:32<02:16,  2.05it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  44%|████▍     | 220/500 [01:33<02:16,  2.05it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  44%|████▍     | 221/500 [01:33<02:09,  2.16it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  44%|████▍     | 221/500 [01:33<02:09,  2.16it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  44%|████▍     | 222/500 [01:33<02:03,  2.25it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  44%|████▍     | 222/500 [01:34<02:03,  2.25it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  45%|████▍     | 223/500 [01:34<02:07,  2.17it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  45%|████▍     | 223/500 [01:34<02:07,  2.17it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  45%|████▍     | 224/500 [01:34<02:02,  2.26it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  45%|████▍     | 224/500 [01:35<02:02,  2.26it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  45%|████▌     | 225/500 [01:35<02:06,  2.18it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  45%|████▌     | 225/500 [01:35<02:06,  2.18it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  45%|████▌     | 226/500 [01:35<02:00,  2.27it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  45%|████▌     | 226/500 [01:35<02:00,  2.27it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  45%|████▌     | 227/500 [01:35<02:05,  2.18it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  45%|████▌     | 227/500 [01:36<02:05,  2.18it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  46%|████▌     | 228/500 [01:36<02:00,  2.26it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  46%|████▌     | 228/500 [01:36<02:00,  2.26it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  46%|████▌     | 229/500 [01:36<01:56,  2.33it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  46%|████▌     | 229/500 [01:37<01:56,  2.33it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  46%|████▌     | 230/500 [01:37<01:45,  2.56it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  46%|████▌     | 230/500 [01:37<01:45,  2.56it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  46%|████▌     | 231/500 [01:37<01:53,  2.37it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  46%|████▌     | 231/500 [01:37<01:53,  2.37it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  46%|████▋     | 232/500 [01:37<01:43,  2.58it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  46%|████▋     | 232/500 [01:38<01:43,  2.58it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  47%|████▋     | 233/500 [01:38<01:52,  2.38it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  47%|████▋     | 233/500 [01:39<01:52,  2.38it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  47%|████▋     | 234/500 [01:39<02:06,  2.11it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  47%|████▋     | 234/500 [01:39<02:06,  2.11it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  47%|████▋     | 235/500 [01:39<02:07,  2.07it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  47%|████▋     | 235/500 [01:40<02:07,  2.07it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  47%|████▋     | 236/500 [01:40<02:08,  2.05it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  47%|████▋     | 236/500 [01:40<02:08,  2.05it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  47%|████▋     | 237/500 [01:40<02:17,  1.92it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  47%|████▋     | 237/500 [01:41<02:17,  1.92it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  48%|████▊     | 238/500 [01:41<02:22,  1.84it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  48%|████▊     | 238/500 [01:41<02:22,  1.84it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  48%|████▊     | 239/500 [01:41<02:18,  1.88it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  48%|████▊     | 239/500 [01:42<02:18,  1.88it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  48%|████▊     | 240/500 [01:42<02:39,  1.63it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  48%|████▊     | 240/500 [01:43<02:39,  1.63it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  48%|████▊     | 241/500 [01:43<02:29,  1.73it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  48%|████▊     | 241/500 [01:43<02:29,  1.73it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  48%|████▊     | 242/500 [01:43<02:15,  1.90it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  48%|████▊     | 242/500 [01:43<02:15,  1.90it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  49%|████▊     | 243/500 [01:43<02:05,  2.05it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  49%|████▊     | 243/500 [01:44<02:05,  2.05it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  49%|████▉     | 244/500 [01:44<02:20,  1.82it/s][TorchDR] CauchyAffinity: Computed in 0.11s.
[TorchDR] CauchyAffinity: Computed in 0.89s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  49%|████▉     | 244/500 [01:46<02:20,  1.82it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  49%|████▉     | 245/500 [01:46<03:33,  1.20it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  49%|████▉     | 245/500 [01:46<03:33,  1.20it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  49%|████▉     | 246/500 [01:46<03:14,  1.31it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  49%|████▉     | 246/500 [01:47<03:14,  1.31it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  49%|████▉     | 247/500 [01:47<02:53,  1.46it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  49%|████▉     | 247/500 [01:47<02:53,  1.46it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  50%|████▉     | 248/500 [01:47<02:31,  1.67it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  50%|████▉     | 248/500 [01:47<02:31,  1.67it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  50%|████▉     | 249/500 [01:47<02:15,  1.85it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  50%|████▉     | 249/500 [01:48<02:15,  1.85it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01:  50%|█████     | 250/500 [01:48<02:12,  1.89it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.30s.

[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 1.52e-03 | LR: 1.00e-01:  50%|█████     | 250/500 [01:49<02:12,  1.89it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 1.52e-03 | LR: 1.00e-01:  50%|█████     | 251/500 [01:49<02:46,  1.50it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.29e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  50%|█████     | 251/500 [01:49<02:46,  1.50it/s]
[TorchDR] COSNE: Loss: 1.29e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  50%|█████     | 252/500 [01:49<02:33,  1.62it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.28e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  50%|█████     | 252/500 [01:50<02:33,  1.62it/s]
[TorchDR] COSNE: Loss: 1.28e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  51%|█████     | 253/500 [01:50<02:31,  1.63it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.27e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  51%|█████     | 253/500 [01:50<02:31,  1.63it/s]
[TorchDR] COSNE: Loss: 1.27e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  51%|█████     | 254/500 [01:50<02:22,  1.73it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.25e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  51%|█████     | 254/500 [01:51<02:22,  1.73it/s]
[TorchDR] COSNE: Loss: 1.25e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  51%|█████     | 255/500 [01:51<02:09,  1.90it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  51%|█████     | 255/500 [01:51<02:09,  1.90it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  51%|█████     | 256/500 [01:51<02:06,  1.93it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  51%|█████     | 256/500 [01:52<02:06,  1.93it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  51%|█████▏    | 257/500 [01:52<02:04,  1.95it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  51%|█████▏    | 257/500 [01:52<02:04,  1.95it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  52%|█████▏    | 258/500 [01:52<02:02,  1.97it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  52%|█████▏    | 258/500 [01:53<02:02,  1.97it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  52%|█████▏    | 259/500 [01:53<02:01,  1.98it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.11s.

[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  52%|█████▏    | 259/500 [01:53<02:01,  1.98it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  52%|█████▏    | 260/500 [01:53<02:01,  1.98it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  52%|█████▏    | 260/500 [01:54<02:01,  1.98it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  52%|█████▏    | 261/500 [01:54<02:00,  1.99it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  52%|█████▏    | 261/500 [01:54<02:00,  1.99it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  52%|█████▏    | 262/500 [01:54<01:59,  2.00it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  52%|█████▏    | 262/500 [01:55<01:59,  2.00it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  53%|█████▎    | 263/500 [01:55<01:51,  2.12it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  53%|█████▎    | 263/500 [01:55<01:51,  2.12it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  53%|█████▎    | 264/500 [01:55<02:00,  1.96it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  53%|█████▎    | 264/500 [01:56<02:00,  1.96it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  53%|█████▎    | 265/500 [01:56<01:52,  2.09it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  53%|█████▎    | 265/500 [01:56<01:52,  2.09it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  53%|█████▎    | 266/500 [01:56<02:00,  1.95it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  53%|█████▎    | 266/500 [01:57<02:00,  1.95it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  53%|█████▎    | 267/500 [01:57<01:58,  1.96it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  53%|█████▎    | 267/500 [01:57<01:58,  1.96it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  54%|█████▎    | 268/500 [01:57<01:57,  1.98it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  54%|█████▎    | 268/500 [01:58<01:57,  1.98it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  54%|█████▍    | 269/500 [01:58<01:49,  2.11it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  54%|█████▍    | 269/500 [01:58<01:49,  2.11it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  54%|█████▍    | 270/500 [01:58<01:44,  2.21it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  54%|█████▍    | 270/500 [01:59<01:44,  2.21it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  54%|█████▍    | 271/500 [01:59<01:47,  2.14it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  54%|█████▍    | 271/500 [01:59<01:47,  2.14it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  54%|█████▍    | 272/500 [01:59<01:48,  2.09it/s][TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  54%|█████▍    | 272/500 [02:00<01:48,  2.09it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  55%|█████▍    | 273/500 [02:00<01:56,  1.95it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  55%|█████▍    | 273/500 [02:00<01:56,  1.95it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  55%|█████▍    | 274/500 [02:00<01:48,  2.08it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  55%|█████▍    | 274/500 [02:01<01:48,  2.08it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  55%|█████▌    | 275/500 [02:01<01:49,  2.06it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  55%|█████▌    | 275/500 [02:01<01:49,  2.06it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  55%|█████▌    | 276/500 [02:01<01:49,  2.04it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  55%|█████▌    | 276/500 [02:02<01:49,  2.04it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  55%|█████▌    | 277/500 [02:02<01:43,  2.16it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  55%|█████▌    | 277/500 [02:02<01:43,  2.16it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  56%|█████▌    | 278/500 [02:02<01:38,  2.26it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  56%|█████▌    | 278/500 [02:02<01:38,  2.26it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  56%|█████▌    | 279/500 [02:02<01:41,  2.17it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  56%|█████▌    | 279/500 [02:03<01:41,  2.17it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  56%|█████▌    | 280/500 [02:03<01:43,  2.12it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  56%|█████▌    | 280/500 [02:04<01:43,  2.12it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  56%|█████▌    | 281/500 [02:04<01:45,  2.07it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  56%|█████▌    | 281/500 [02:04<01:45,  2.07it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  56%|█████▋    | 282/500 [02:04<01:45,  2.06it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  56%|█████▋    | 282/500 [02:04<01:45,  2.06it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  57%|█████▋    | 283/500 [02:04<01:39,  2.17it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  57%|█████▋    | 283/500 [02:05<01:39,  2.17it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  57%|█████▋    | 284/500 [02:05<01:35,  2.26it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  57%|█████▋    | 284/500 [02:05<01:35,  2.26it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  57%|█████▋    | 285/500 [02:05<01:32,  2.33it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  57%|█████▋    | 285/500 [02:06<01:32,  2.33it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  57%|█████▋    | 286/500 [02:06<01:36,  2.22it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  57%|█████▋    | 286/500 [02:06<01:36,  2.22it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  57%|█████▋    | 287/500 [02:06<01:26,  2.47it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  57%|█████▋    | 287/500 [02:07<01:26,  2.47it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  58%|█████▊    | 288/500 [02:07<01:32,  2.29it/s][TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] CauchyAffinity: Computed in 0.11s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  58%|█████▊    | 288/500 [02:07<01:32,  2.29it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  58%|█████▊    | 289/500 [02:07<01:54,  1.84it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  58%|█████▊    | 289/500 [02:08<01:54,  1.84it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  58%|█████▊    | 290/500 [02:08<01:51,  1.89it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  58%|█████▊    | 290/500 [02:08<01:51,  1.89it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  58%|█████▊    | 291/500 [02:08<01:42,  2.04it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  58%|█████▊    | 291/500 [02:09<01:42,  2.04it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  58%|█████▊    | 292/500 [02:09<01:36,  2.15it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  58%|█████▊    | 292/500 [02:09<01:36,  2.15it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  59%|█████▊    | 293/500 [02:09<01:32,  2.25it/s][TorchDR] CauchyAffinity: Computed in 0.29s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  59%|█████▊    | 293/500 [02:10<01:32,  2.25it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  59%|█████▉    | 294/500 [02:10<01:41,  2.03it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  59%|█████▉    | 294/500 [02:10<01:41,  2.03it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  59%|█████▉    | 295/500 [02:10<01:41,  2.03it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  59%|█████▉    | 295/500 [02:11<01:41,  2.03it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  59%|█████▉    | 296/500 [02:11<01:40,  2.02it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  59%|█████▉    | 296/500 [02:11<01:40,  2.02it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  59%|█████▉    | 297/500 [02:11<01:40,  2.02it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  59%|█████▉    | 297/500 [02:12<01:40,  2.02it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  60%|█████▉    | 298/500 [02:12<01:34,  2.13it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  60%|█████▉    | 298/500 [02:12<01:34,  2.13it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  60%|█████▉    | 299/500 [02:12<01:41,  1.97it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  60%|█████▉    | 299/500 [02:13<01:41,  1.97it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01:  60%|██████    | 300/500 [02:13<01:35,  2.10it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  60%|██████    | 300/500 [02:13<01:35,  2.10it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  60%|██████    | 301/500 [02:13<01:29,  2.21it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  60%|██████    | 301/500 [02:13<01:29,  2.21it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  60%|██████    | 302/500 [02:13<01:38,  2.02it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  60%|██████    | 302/500 [02:14<01:38,  2.02it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  61%|██████    | 303/500 [02:14<01:32,  2.14it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  61%|██████    | 303/500 [02:14<01:32,  2.14it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  61%|██████    | 304/500 [02:14<01:33,  2.09it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  61%|██████    | 304/500 [02:15<01:33,  2.09it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  61%|██████    | 305/500 [02:15<01:34,  2.06it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  61%|██████    | 305/500 [02:15<01:34,  2.06it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  61%|██████    | 306/500 [02:15<01:40,  1.93it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  61%|██████    | 306/500 [02:16<01:40,  1.93it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  61%|██████▏   | 307/500 [02:16<01:39,  1.94it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  61%|██████▏   | 307/500 [02:17<01:39,  1.94it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  62%|██████▏   | 308/500 [02:17<01:43,  1.86it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  62%|██████▏   | 308/500 [02:17<01:43,  1.86it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  62%|██████▏   | 309/500 [02:17<01:35,  2.01it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  62%|██████▏   | 309/500 [02:17<01:35,  2.01it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  62%|██████▏   | 310/500 [02:17<01:29,  2.13it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  62%|██████▏   | 310/500 [02:18<01:29,  2.13it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  62%|██████▏   | 311/500 [02:18<01:30,  2.10it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  62%|██████▏   | 311/500 [02:18<01:30,  2.10it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  62%|██████▏   | 312/500 [02:18<01:19,  2.36it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  62%|██████▏   | 312/500 [02:19<01:19,  2.36it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  63%|██████▎   | 313/500 [02:19<01:18,  2.39it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  63%|██████▎   | 313/500 [02:19<01:18,  2.39it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  63%|██████▎   | 314/500 [02:19<01:22,  2.26it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  63%|██████▎   | 314/500 [02:20<01:22,  2.26it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  63%|██████▎   | 315/500 [02:20<01:24,  2.18it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  63%|██████▎   | 315/500 [02:20<01:24,  2.18it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  63%|██████▎   | 316/500 [02:20<01:37,  1.88it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  63%|██████▎   | 316/500 [02:21<01:37,  1.88it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  63%|██████▎   | 317/500 [02:21<01:35,  1.91it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  63%|██████▎   | 317/500 [02:21<01:35,  1.91it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  64%|██████▎   | 318/500 [02:21<01:39,  1.83it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  64%|██████▎   | 318/500 [02:22<01:39,  1.83it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  64%|██████▍   | 319/500 [02:22<01:47,  1.69it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  64%|██████▍   | 319/500 [02:23<01:47,  1.69it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  64%|██████▍   | 320/500 [02:23<01:36,  1.87it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  64%|██████▍   | 320/500 [02:23<01:36,  1.87it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  64%|██████▍   | 321/500 [02:23<01:33,  1.90it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  64%|██████▍   | 321/500 [02:24<01:33,  1.90it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  64%|██████▍   | 322/500 [02:24<01:31,  1.94it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  64%|██████▍   | 322/500 [02:24<01:31,  1.94it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  65%|██████▍   | 323/500 [02:24<01:25,  2.08it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  65%|██████▍   | 323/500 [02:24<01:25,  2.08it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  65%|██████▍   | 324/500 [02:24<01:20,  2.19it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  65%|██████▍   | 324/500 [02:25<01:20,  2.19it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  65%|██████▌   | 325/500 [02:25<01:22,  2.13it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  65%|██████▌   | 325/500 [02:25<01:22,  2.13it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  65%|██████▌   | 326/500 [02:25<01:18,  2.23it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  65%|██████▌   | 326/500 [02:26<01:18,  2.23it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  65%|██████▌   | 327/500 [02:26<01:15,  2.30it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  65%|██████▌   | 327/500 [02:26<01:15,  2.30it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  66%|██████▌   | 328/500 [02:26<01:18,  2.20it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  66%|██████▌   | 328/500 [02:27<01:18,  2.20it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  66%|██████▌   | 329/500 [02:27<01:19,  2.14it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  66%|██████▌   | 329/500 [02:27<01:19,  2.14it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  66%|██████▌   | 330/500 [02:27<01:16,  2.23it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  66%|██████▌   | 330/500 [02:27<01:16,  2.23it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  66%|██████▌   | 331/500 [02:27<01:18,  2.16it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  66%|██████▌   | 331/500 [02:28<01:18,  2.16it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  66%|██████▋   | 332/500 [02:28<01:19,  2.11it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  66%|██████▋   | 332/500 [02:28<01:19,  2.11it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  67%|██████▋   | 333/500 [02:28<01:20,  2.08it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  67%|██████▋   | 333/500 [02:29<01:20,  2.08it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  67%|██████▋   | 334/500 [02:29<01:15,  2.19it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  67%|██████▋   | 334/500 [02:29<01:15,  2.19it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  67%|██████▋   | 335/500 [02:29<01:17,  2.13it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  67%|██████▋   | 335/500 [02:30<01:17,  2.13it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  67%|██████▋   | 336/500 [02:30<01:18,  2.09it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  67%|██████▋   | 336/500 [02:30<01:18,  2.09it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  67%|██████▋   | 337/500 [02:30<01:19,  2.06it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  67%|██████▋   | 337/500 [02:31<01:19,  2.06it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  68%|██████▊   | 338/500 [02:31<01:19,  2.04it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  68%|██████▊   | 338/500 [02:31<01:19,  2.04it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  68%|██████▊   | 339/500 [02:31<01:19,  2.03it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  68%|██████▊   | 339/500 [02:32<01:19,  2.03it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  68%|██████▊   | 340/500 [02:32<01:28,  1.80it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  68%|██████▊   | 340/500 [02:33<01:28,  1.80it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  68%|██████▊   | 341/500 [02:33<01:35,  1.67it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  68%|██████▊   | 341/500 [02:33<01:35,  1.67it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  68%|██████▊   | 342/500 [02:33<01:29,  1.76it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  68%|██████▊   | 342/500 [02:34<01:29,  1.76it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  69%|██████▊   | 343/500 [02:34<01:26,  1.82it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  69%|██████▊   | 343/500 [02:34<01:26,  1.82it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  69%|██████▉   | 344/500 [02:34<01:23,  1.87it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  69%|██████▉   | 344/500 [02:35<01:23,  1.87it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  69%|██████▉   | 345/500 [02:35<01:21,  1.91it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  69%|██████▉   | 345/500 [02:35<01:21,  1.91it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  69%|██████▉   | 346/500 [02:35<01:24,  1.83it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  69%|██████▉   | 346/500 [02:36<01:24,  1.83it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  69%|██████▉   | 347/500 [02:36<01:26,  1.78it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  69%|██████▉   | 347/500 [02:37<01:26,  1.78it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  70%|██████▉   | 348/500 [02:37<01:36,  1.58it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.29s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  70%|██████▉   | 348/500 [02:38<01:36,  1.58it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  70%|██████▉   | 349/500 [02:38<01:52,  1.34it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.30s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  70%|██████▉   | 349/500 [02:39<01:52,  1.34it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01:  70%|███████   | 350/500 [02:39<01:58,  1.27it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  70%|███████   | 350/500 [02:39<01:58,  1.27it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  70%|███████   | 351/500 [02:39<01:49,  1.37it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  70%|███████   | 351/500 [02:40<01:49,  1.37it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  70%|███████   | 352/500 [02:40<01:38,  1.51it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  70%|███████   | 352/500 [02:40<01:38,  1.51it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  71%|███████   | 353/500 [02:40<01:26,  1.71it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  71%|███████   | 353/500 [02:41<01:26,  1.71it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  71%|███████   | 354/500 [02:41<01:26,  1.70it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  71%|███████   | 354/500 [02:41<01:26,  1.70it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  71%|███████   | 355/500 [02:41<01:25,  1.69it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  71%|███████   | 355/500 [02:42<01:25,  1.69it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  71%|███████   | 356/500 [02:42<01:25,  1.68it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  71%|███████   | 356/500 [02:42<01:25,  1.68it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  71%|███████▏  | 357/500 [02:42<01:20,  1.77it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  71%|███████▏  | 357/500 [02:43<01:20,  1.77it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  72%|███████▏  | 358/500 [02:43<01:17,  1.83it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  72%|███████▏  | 358/500 [02:43<01:17,  1.83it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  72%|███████▏  | 359/500 [02:43<01:15,  1.88it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  72%|███████▏  | 359/500 [02:44<01:15,  1.88it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  72%|███████▏  | 360/500 [02:44<01:09,  2.03it/s][TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  72%|███████▏  | 360/500 [02:45<01:09,  2.03it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  72%|███████▏  | 361/500 [02:45<01:17,  1.80it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  72%|███████▏  | 361/500 [02:45<01:17,  1.80it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  72%|███████▏  | 362/500 [02:45<01:14,  1.86it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  72%|███████▏  | 362/500 [02:46<01:14,  1.86it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  73%|███████▎  | 363/500 [02:46<01:16,  1.80it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  73%|███████▎  | 363/500 [02:46<01:16,  1.80it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  73%|███████▎  | 364/500 [02:46<01:13,  1.85it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  73%|███████▎  | 364/500 [02:47<01:13,  1.85it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  73%|███████▎  | 365/500 [02:47<01:11,  1.89it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  73%|███████▎  | 365/500 [02:47<01:11,  1.89it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  73%|███████▎  | 366/500 [02:47<01:13,  1.82it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  73%|███████▎  | 366/500 [02:48<01:13,  1.82it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  73%|███████▎  | 367/500 [02:48<01:15,  1.77it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  73%|███████▎  | 367/500 [02:49<01:15,  1.77it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  74%|███████▎  | 368/500 [02:49<01:15,  1.74it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  74%|███████▎  | 368/500 [02:49<01:15,  1.74it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  74%|███████▍  | 369/500 [02:49<01:16,  1.72it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  74%|███████▍  | 369/500 [02:50<01:16,  1.72it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  74%|███████▍  | 370/500 [02:50<01:12,  1.79it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  74%|███████▍  | 370/500 [02:50<01:12,  1.79it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  74%|███████▍  | 371/500 [02:50<01:13,  1.76it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  74%|███████▍  | 371/500 [02:51<01:13,  1.76it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  74%|███████▍  | 372/500 [02:51<01:14,  1.73it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  74%|███████▍  | 372/500 [02:51<01:14,  1.73it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  75%|███████▍  | 373/500 [02:51<01:14,  1.71it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  75%|███████▍  | 373/500 [02:52<01:14,  1.71it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  75%|███████▍  | 374/500 [02:52<01:10,  1.78it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  75%|███████▍  | 374/500 [02:52<01:10,  1.78it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  75%|███████▌  | 375/500 [02:52<01:07,  1.84it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  75%|███████▌  | 375/500 [02:53<01:07,  1.84it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  75%|███████▌  | 376/500 [02:53<01:05,  1.89it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  75%|███████▌  | 376/500 [02:53<01:05,  1.89it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  75%|███████▌  | 377/500 [02:53<01:07,  1.82it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  75%|███████▌  | 377/500 [02:54<01:07,  1.82it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  76%|███████▌  | 378/500 [02:54<01:01,  1.97it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  76%|███████▌  | 378/500 [02:54<01:01,  1.97it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  76%|███████▌  | 379/500 [02:54<01:00,  1.99it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  76%|███████▌  | 379/500 [02:55<01:00,  1.99it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  76%|███████▌  | 380/500 [02:55<01:00,  1.99it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  76%|███████▌  | 380/500 [02:55<01:00,  1.99it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  76%|███████▌  | 381/500 [02:55<01:03,  1.88it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  76%|███████▌  | 381/500 [02:56<01:03,  1.88it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  76%|███████▋  | 382/500 [02:56<01:01,  1.91it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  76%|███████▋  | 382/500 [02:57<01:01,  1.91it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  77%|███████▋  | 383/500 [02:57<01:03,  1.83it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  77%|███████▋  | 383/500 [02:57<01:03,  1.83it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  77%|███████▋  | 384/500 [02:57<01:01,  1.88it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  77%|███████▋  | 384/500 [02:58<01:01,  1.88it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  77%|███████▋  | 385/500 [02:58<00:56,  2.03it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  77%|███████▋  | 385/500 [02:58<00:56,  2.03it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  77%|███████▋  | 386/500 [02:58<00:53,  2.15it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  77%|███████▋  | 386/500 [02:58<00:53,  2.15it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  77%|███████▋  | 387/500 [02:58<00:53,  2.10it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  77%|███████▋  | 387/500 [02:59<00:53,  2.10it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  78%|███████▊  | 388/500 [02:59<01:10,  1.58it/s][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.71s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  78%|███████▊  | 388/500 [03:04<01:10,  1.58it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  78%|███████▊  | 389/500 [03:04<03:09,  1.70s/it][TorchDR] CauchyAffinity: Computed in 0.50s.
[TorchDR] CauchyAffinity: Computed in 0.99s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  78%|███████▊  | 389/500 [03:07<03:09,  1.70s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  78%|███████▊  | 390/500 [03:07<04:00,  2.18s/it][TorchDR] CauchyAffinity: Computed in 0.60s.
[TorchDR] CauchyAffinity: Computed in 0.70s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  78%|███████▊  | 390/500 [03:10<04:00,  2.18s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  78%|███████▊  | 391/500 [03:10<04:31,  2.49s/it][TorchDR] CauchyAffinity: Computed in 0.50s.
[TorchDR] CauchyAffinity: Computed in 0.90s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  78%|███████▊  | 391/500 [03:13<04:31,  2.49s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  78%|███████▊  | 392/500 [03:13<04:55,  2.73s/it][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.50s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  78%|███████▊  | 392/500 [03:15<04:55,  2.73s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  79%|███████▊  | 393/500 [03:15<04:12,  2.36s/it][TorchDR] CauchyAffinity: Computed in 0.30s.
[TorchDR] CauchyAffinity: Computed in 0.50s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  79%|███████▊  | 393/500 [03:17<04:12,  2.36s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  79%|███████▉  | 394/500 [03:17<03:49,  2.16s/it][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.39s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  79%|███████▉  | 394/500 [03:18<03:49,  2.16s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  79%|███████▉  | 395/500 [03:18<03:22,  1.93s/it][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.60s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  79%|███████▉  | 395/500 [03:20<03:22,  1.93s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  79%|███████▉  | 396/500 [03:20<03:10,  1.84s/it][TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] CauchyAffinity: Computed in 0.40s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  79%|███████▉  | 396/500 [03:21<03:10,  1.84s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  79%|███████▉  | 397/500 [03:21<03:01,  1.76s/it][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.30s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  79%|███████▉  | 397/500 [03:23<03:01,  1.76s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  80%|███████▉  | 398/500 [03:23<02:54,  1.71s/it][TorchDR] CauchyAffinity: Computed in 0.30s.
[TorchDR] CauchyAffinity: Computed in 0.59s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  80%|███████▉  | 398/500 [03:25<02:54,  1.71s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  80%|███████▉  | 399/500 [03:25<03:11,  1.89s/it][TorchDR] CauchyAffinity: Computed in 0.30s.
[TorchDR] CauchyAffinity: Computed in 0.51s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  80%|███████▉  | 399/500 [03:28<03:11,  1.89s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01:  80%|████████  | 400/500 [03:28<03:36,  2.16s/it][TorchDR] CauchyAffinity: Computed in 0.60s.
[TorchDR] CauchyAffinity: Computed in 0.59s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  80%|████████  | 400/500 [03:32<03:36,  2.16s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  80%|████████  | 401/500 [03:32<04:28,  2.71s/it][TorchDR] CauchyAffinity: Computed in 0.31s.
[TorchDR] CauchyAffinity: Computed in 0.81s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  80%|████████  | 401/500 [03:37<04:28,  2.71s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  80%|████████  | 402/500 [03:37<05:24,  3.31s/it][TorchDR] CauchyAffinity: Computed in 1.99s.
[TorchDR] CauchyAffinity: Computed in 1.30s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  80%|████████  | 402/500 [03:43<05:24,  3.31s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  81%|████████  | 403/500 [03:43<06:59,  4.33s/it][TorchDR] CauchyAffinity: Computed in 0.30s.
[TorchDR] CauchyAffinity: Computed in 0.60s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  81%|████████  | 403/500 [03:46<06:59,  4.33s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  81%|████████  | 404/500 [03:46<06:02,  3.78s/it][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.49s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  81%|████████  | 404/500 [03:47<06:02,  3.78s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  81%|████████  | 405/500 [03:47<04:45,  3.01s/it][TorchDR] CauchyAffinity: Computed in 0.30s.
[TorchDR] CauchyAffinity: Computed in 0.40s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  81%|████████  | 405/500 [03:48<04:45,  3.01s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  81%|████████  | 406/500 [03:48<03:51,  2.46s/it][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  81%|████████  | 406/500 [03:49<03:51,  2.46s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  81%|████████▏ | 407/500 [03:49<02:54,  1.87s/it][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  81%|████████▏ | 407/500 [03:49<02:54,  1.87s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  82%|████████▏ | 408/500 [03:49<02:14,  1.46s/it][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  82%|████████▏ | 408/500 [03:50<02:14,  1.46s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  82%|████████▏ | 409/500 [03:50<01:49,  1.21s/it][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  82%|████████▏ | 409/500 [03:50<01:49,  1.21s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  82%|████████▏ | 410/500 [03:50<01:32,  1.02s/it][TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  82%|████████▏ | 410/500 [03:51<01:32,  1.02s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  82%|████████▏ | 411/500 [03:51<01:27,  1.01it/s][TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  82%|████████▏ | 411/500 [03:52<01:27,  1.01it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  82%|████████▏ | 412/500 [03:52<01:21,  1.08it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  82%|████████▏ | 412/500 [03:53<01:21,  1.08it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  83%|████████▎ | 413/500 [03:53<01:20,  1.09it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  83%|████████▎ | 413/500 [03:54<01:20,  1.09it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  83%|████████▎ | 414/500 [03:54<01:13,  1.17it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  83%|████████▎ | 414/500 [03:54<01:13,  1.17it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  83%|████████▎ | 415/500 [03:54<01:03,  1.34it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  83%|████████▎ | 415/500 [03:55<01:03,  1.34it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  83%|████████▎ | 416/500 [03:55<00:56,  1.48it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  83%|████████▎ | 416/500 [03:55<00:56,  1.48it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  83%|████████▎ | 417/500 [03:55<00:54,  1.54it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.11s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  83%|████████▎ | 417/500 [03:56<00:54,  1.54it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  84%|████████▎ | 418/500 [03:56<00:54,  1.50it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  84%|████████▎ | 418/500 [03:57<00:54,  1.50it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  84%|████████▍ | 419/500 [03:57<00:50,  1.62it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  84%|████████▍ | 419/500 [03:57<00:50,  1.62it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  84%|████████▍ | 420/500 [03:57<00:48,  1.63it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  84%|████████▍ | 420/500 [03:58<00:48,  1.63it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  84%|████████▍ | 421/500 [03:58<00:50,  1.57it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  84%|████████▍ | 421/500 [03:58<00:50,  1.57it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  84%|████████▍ | 422/500 [03:58<00:48,  1.60it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  84%|████████▍ | 422/500 [03:59<00:48,  1.60it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  85%|████████▍ | 423/500 [03:59<00:49,  1.54it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  85%|████████▍ | 423/500 [04:00<00:49,  1.54it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  85%|████████▍ | 424/500 [04:00<00:45,  1.66it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  85%|████████▍ | 424/500 [04:00<00:45,  1.66it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  85%|████████▌ | 425/500 [04:00<00:47,  1.58it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  85%|████████▌ | 425/500 [04:01<00:47,  1.58it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  85%|████████▌ | 426/500 [04:01<00:43,  1.69it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  85%|████████▌ | 426/500 [04:02<00:43,  1.69it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  85%|████████▌ | 427/500 [04:02<00:45,  1.60it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  85%|████████▌ | 427/500 [04:02<00:45,  1.60it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  86%|████████▌ | 428/500 [04:02<00:48,  1.48it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  86%|████████▌ | 428/500 [04:03<00:48,  1.48it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  86%|████████▌ | 429/500 [04:03<00:46,  1.53it/s][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  86%|████████▌ | 429/500 [04:04<00:46,  1.53it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  86%|████████▌ | 430/500 [04:04<00:48,  1.44it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  86%|████████▌ | 430/500 [04:04<00:48,  1.44it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  86%|████████▌ | 431/500 [04:04<00:44,  1.56it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  86%|████████▌ | 431/500 [04:05<00:44,  1.56it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  86%|████████▋ | 432/500 [04:05<00:40,  1.68it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  86%|████████▋ | 432/500 [04:05<00:40,  1.68it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  87%|████████▋ | 433/500 [04:05<00:40,  1.67it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  87%|████████▋ | 433/500 [04:06<00:40,  1.67it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  87%|████████▋ | 434/500 [04:06<00:37,  1.76it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  87%|████████▋ | 434/500 [04:06<00:37,  1.76it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  87%|████████▋ | 435/500 [04:06<00:37,  1.73it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  87%|████████▋ | 435/500 [04:07<00:37,  1.73it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  87%|████████▋ | 436/500 [04:07<00:39,  1.62it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  87%|████████▋ | 436/500 [04:08<00:39,  1.62it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  87%|████████▋ | 437/500 [04:08<00:38,  1.64it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.30s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  87%|████████▋ | 437/500 [04:08<00:38,  1.64it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  88%|████████▊ | 438/500 [04:08<00:37,  1.65it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  88%|████████▊ | 438/500 [04:09<00:37,  1.65it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  88%|████████▊ | 439/500 [04:09<00:36,  1.65it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  88%|████████▊ | 439/500 [04:09<00:36,  1.65it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  88%|████████▊ | 440/500 [04:09<00:34,  1.74it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  88%|████████▊ | 440/500 [04:10<00:34,  1.74it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  88%|████████▊ | 441/500 [04:10<00:34,  1.72it/s][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  88%|████████▊ | 441/500 [04:11<00:34,  1.72it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  88%|████████▊ | 442/500 [04:11<00:35,  1.62it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  88%|████████▊ | 442/500 [04:11<00:35,  1.62it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  89%|████████▊ | 443/500 [04:11<00:34,  1.64it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  89%|████████▊ | 443/500 [04:12<00:34,  1.64it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  89%|████████▉ | 444/500 [04:12<00:32,  1.73it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  89%|████████▉ | 444/500 [04:13<00:32,  1.73it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  89%|████████▉ | 445/500 [04:13<00:33,  1.63it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  89%|████████▉ | 445/500 [04:13<00:33,  1.63it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  89%|████████▉ | 446/500 [04:13<00:31,  1.72it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  89%|████████▉ | 446/500 [04:13<00:31,  1.72it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  89%|████████▉ | 447/500 [04:13<00:29,  1.80it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  89%|████████▉ | 447/500 [04:14<00:29,  1.80it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  90%|████████▉ | 448/500 [04:14<00:28,  1.85it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  90%|████████▉ | 448/500 [04:14<00:28,  1.85it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  90%|████████▉ | 449/500 [04:14<00:26,  1.90it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  90%|████████▉ | 449/500 [04:15<00:26,  1.90it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01:  90%|█████████ | 450/500 [04:15<00:24,  2.04it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  90%|█████████ | 450/500 [04:16<00:24,  2.04it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  90%|█████████ | 451/500 [04:16<00:25,  1.91it/s][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  90%|█████████ | 451/500 [04:16<00:25,  1.91it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  90%|█████████ | 452/500 [04:16<00:26,  1.84it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  90%|█████████ | 452/500 [04:17<00:26,  1.84it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  91%|█████████ | 453/500 [04:17<00:25,  1.87it/s][TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  91%|█████████ | 453/500 [04:17<00:25,  1.87it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  91%|█████████ | 454/500 [04:17<00:23,  1.92it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  91%|█████████ | 454/500 [04:18<00:23,  1.92it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  91%|█████████ | 455/500 [04:18<00:21,  2.06it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  91%|█████████ | 455/500 [04:18<00:21,  2.06it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  91%|█████████ | 456/500 [04:18<00:26,  1.64it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.39s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  91%|█████████ | 456/500 [04:20<00:26,  1.64it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  91%|█████████▏| 457/500 [04:20<00:32,  1.32it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  91%|█████████▏| 457/500 [04:20<00:32,  1.32it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  92%|█████████▏| 458/500 [04:20<00:29,  1.41it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  92%|█████████▏| 458/500 [04:21<00:29,  1.41it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  92%|█████████▏| 459/500 [04:21<00:26,  1.55it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  92%|█████████▏| 459/500 [04:21<00:26,  1.55it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  92%|█████████▏| 460/500 [04:21<00:24,  1.66it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  92%|█████████▏| 460/500 [04:21<00:24,  1.66it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  92%|█████████▏| 461/500 [04:22<00:21,  1.85it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  92%|█████████▏| 461/500 [04:22<00:21,  1.85it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  92%|█████████▏| 462/500 [04:22<00:20,  1.89it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  92%|█████████▏| 462/500 [04:22<00:20,  1.89it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  93%|█████████▎| 463/500 [04:22<00:18,  2.03it/s][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  93%|█████████▎| 463/500 [04:23<00:18,  2.03it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  93%|█████████▎| 464/500 [04:23<00:20,  1.72it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  93%|█████████▎| 464/500 [04:24<00:20,  1.72it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  93%|█████████▎| 465/500 [04:24<00:19,  1.79it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  93%|█████████▎| 465/500 [04:24<00:19,  1.79it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  93%|█████████▎| 466/500 [04:24<00:17,  1.96it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.29s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  93%|█████████▎| 466/500 [04:25<00:17,  1.96it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  93%|█████████▎| 467/500 [04:25<00:17,  1.86it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  93%|█████████▎| 467/500 [04:25<00:17,  1.86it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  94%|█████████▎| 468/500 [04:25<00:15,  2.01it/s][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  94%|█████████▎| 468/500 [04:26<00:15,  2.01it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  94%|█████████▍| 469/500 [04:26<00:16,  1.89it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  94%|█████████▍| 469/500 [04:26<00:16,  1.89it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  94%|█████████▍| 470/500 [04:26<00:16,  1.82it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  94%|█████████▍| 470/500 [04:27<00:16,  1.82it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  94%|█████████▍| 471/500 [04:27<00:17,  1.69it/s][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  94%|█████████▍| 471/500 [04:28<00:17,  1.69it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  94%|█████████▍| 472/500 [04:28<00:16,  1.68it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  94%|█████████▍| 472/500 [04:28<00:16,  1.68it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  95%|█████████▍| 473/500 [04:28<00:16,  1.60it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  95%|█████████▍| 473/500 [04:29<00:16,  1.60it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  95%|█████████▍| 474/500 [04:29<00:15,  1.70it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  95%|█████████▍| 474/500 [04:30<00:15,  1.70it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  95%|█████████▌| 475/500 [04:30<00:15,  1.61it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  95%|█████████▌| 475/500 [04:30<00:15,  1.61it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  95%|█████████▌| 476/500 [04:30<00:14,  1.70it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  95%|█████████▌| 476/500 [04:30<00:14,  1.70it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  95%|█████████▌| 477/500 [04:30<00:12,  1.79it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  95%|█████████▌| 477/500 [04:31<00:12,  1.79it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  96%|█████████▌| 478/500 [04:31<00:11,  1.95it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  96%|█████████▌| 478/500 [04:31<00:11,  1.95it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  96%|█████████▌| 479/500 [04:31<00:10,  2.08it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  96%|█████████▌| 479/500 [04:32<00:10,  2.08it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  96%|█████████▌| 480/500 [04:32<00:09,  2.06it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  96%|█████████▌| 480/500 [04:32<00:09,  2.06it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  96%|█████████▌| 481/500 [04:32<00:09,  2.04it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  96%|█████████▌| 481/500 [04:33<00:09,  2.04it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  96%|█████████▋| 482/500 [04:33<00:08,  2.16it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  96%|█████████▋| 482/500 [04:33<00:08,  2.16it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  97%|█████████▋| 483/500 [04:33<00:07,  2.26it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  97%|█████████▋| 483/500 [04:34<00:07,  2.26it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  97%|█████████▋| 484/500 [04:34<00:07,  2.04it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  97%|█████████▋| 484/500 [04:34<00:07,  2.04it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  97%|█████████▋| 485/500 [04:34<00:07,  2.03it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  97%|█████████▋| 485/500 [04:35<00:07,  2.03it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  97%|█████████▋| 486/500 [04:35<00:06,  2.14it/s][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  97%|█████████▋| 486/500 [04:35<00:06,  2.14it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  97%|█████████▋| 487/500 [04:35<00:07,  1.76it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  97%|█████████▋| 487/500 [04:36<00:07,  1.76it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  98%|█████████▊| 488/500 [04:36<00:07,  1.65it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  98%|█████████▊| 488/500 [04:37<00:07,  1.65it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  98%|█████████▊| 489/500 [04:37<00:06,  1.74it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  98%|█████████▊| 489/500 [04:37<00:06,  1.74it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  98%|█████████▊| 490/500 [04:37<00:05,  1.81it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  98%|█████████▊| 490/500 [04:38<00:05,  1.81it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  98%|█████████▊| 491/500 [04:38<00:04,  1.86it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  98%|█████████▊| 491/500 [04:38<00:04,  1.86it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  98%|█████████▊| 492/500 [04:38<00:04,  1.80it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  98%|█████████▊| 492/500 [04:39<00:04,  1.80it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  99%|█████████▊| 493/500 [04:39<00:03,  1.96it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  99%|█████████▊| 493/500 [04:39<00:03,  1.96it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  99%|█████████▉| 494/500 [04:39<00:03,  1.98it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  99%|█████████▉| 494/500 [04:40<00:03,  1.98it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  99%|█████████▉| 495/500 [04:40<00:02,  1.77it/s][TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  99%|█████████▉| 495/500 [04:40<00:02,  1.77it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  99%|█████████▉| 496/500 [04:40<00:02,  1.74it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  99%|█████████▉| 496/500 [04:41<00:02,  1.74it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  99%|█████████▉| 497/500 [04:41<00:01,  1.71it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.20s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01:  99%|█████████▉| 497/500 [04:42<00:01,  1.71it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 100%|█████████▉| 498/500 [04:42<00:01,  1.71it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 100%|█████████▉| 498/500 [04:42<00:01,  1.71it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 100%|█████████▉| 499/500 [04:42<00:00,  1.78it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.

[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 100%|█████████▉| 499/500 [04:43<00:00,  1.78it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 100%|██████████| 500/500 [04:43<00:00,  1.84it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 100%|██████████| 500/500 [04:43<00:00,  1.77it/s]
[TorchDR] COSNE: Computed in 283.60s.
/home/circleci/project/torchdr/utils/visu.py:29: RuntimeWarning: invalid value encountered in arccosh
  hypDistance = np.arccosh(1 + 2 * (distance) / (1 - distance + 1e-10))

Total running time of the script: (6 minutes 24.716 seconds)

Gallery generated by Sphinx-Gallery