Note
Go to the end to download the full example code.
TSNE vs COSNE : Euclidean vs Hyperbolic#
We compare in this example two dimensionalty reduction methods: T-SNE and CO-SNE on a synthetic hierarchical toy dataset and on singlecell data. The first method computes an embedding in a 2D Euclidean space while the second one operates in the Hyperbolic Poincaré Ball model.
Load the SNARE-seq dataset (gene expression) with cell type labels#
def load_numpy_from_url(url, delimiter="\t"):
"""
Load a numpy array from a URL.
Parameters
----------
url : str
URL to load data from.
delimiter : str, default="\t"
Delimiter used in the data file.
Returns
-------
numpy.ndarray
Loaded data as a numpy array.
"""
response = urllib.request.urlopen(url)
data = response.read().decode("utf-8")
data = data.split("\n")
data = [row.split(delimiter) for row in data if row]
numpy_array = np.array(data, dtype=float)
return numpy_array
url_x = "https://rsinghlab.github.io/SCOT/data/snare_rna.txt"
snare_data = load_numpy_from_url(url_x) / 100
url_y = "https://rsinghlab.github.io/SCOT/data/SNAREseq_types.txt"
snare_labels = load_numpy_from_url(url_y)
Computing TSNE and COSNE on SNARE-seq data#
We can now proceed to computing the two DR methods and visualizing the results on the SNARE-seq dataset.
tsne_model = TSNE(verbose=True, max_iter=500)
out_tsne = tsne_model.fit_transform(snare_data)
cosne_model = COSNE(lr=1e-1, verbose=True, gamma=0.5, lambda1=0.01, max_iter=500)
out_cosne = cosne_model.fit_transform(snare_data)
fig, axes = plt.subplots(nrows=1, ncols=2, figsize=(16, 8))
axes[0].scatter(*out_tsne.T, c=snare_labels.squeeze(1), cmap=plt.get_cmap("rainbow"))
axes[0].set_xticks([])
axes[0].set_yticks([])
axes[0].set_title("T-SNE", fontsize=24)
plot_disk(axes[1])
axes[1].scatter(*out_cosne.T, c=snare_labels.squeeze(1), cmap=plt.get_cmap("rainbow"))
axes[1].axis("off")
axes[1].set_title("CO-SNE", fontsize=24)
plt.show()

0%| | 0/500 [00:00<?, ?it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 0%| | 0/500 [00:00<?, ?it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 0%| | 1/500 [00:00<01:41, 4.93it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 0%| | 1/500 [00:00<01:41, 4.93it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 0%| | 2/500 [00:00<01:57, 4.23it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 0%| | 2/500 [00:00<01:57, 4.23it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 1%| | 3/500 [00:00<01:49, 4.55it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 1%| | 3/500 [00:00<01:49, 4.55it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 1%| | 4/500 [00:00<01:33, 5.33it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 1%| | 4/500 [00:00<01:33, 5.33it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 1%| | 5/500 [00:00<01:34, 5.23it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 1%| | 5/500 [00:01<01:34, 5.23it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 1%| | 6/500 [00:01<01:36, 5.14it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 1%| | 6/500 [00:01<01:36, 5.14it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 1%|▏ | 7/500 [00:01<01:36, 5.12it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 1%|▏ | 7/500 [00:01<01:36, 5.12it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 2%|▏ | 8/500 [00:01<01:21, 6.00it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 2%|▏ | 8/500 [00:01<01:21, 6.00it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 2%|▏ | 9/500 [00:01<01:27, 5.64it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 2%|▏ | 9/500 [00:01<01:27, 5.64it/s]
[TorchDR] TSNE: Loss: 1.39e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 2%|▏ | 10/500 [00:01<01:30, 5.43it/s]
[TorchDR] TSNE: Loss: 1.41e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 2%|▏ | 10/500 [00:02<01:30, 5.43it/s]
[TorchDR] TSNE: Loss: 1.41e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 2%|▏ | 11/500 [00:02<01:32, 5.31it/s]
[TorchDR] TSNE: Loss: 1.43e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 2%|▏ | 11/500 [00:02<01:32, 5.31it/s]
[TorchDR] TSNE: Loss: 1.43e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 2%|▏ | 12/500 [00:02<01:33, 5.20it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 2%|▏ | 12/500 [00:02<01:33, 5.20it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 3%|▎ | 13/500 [00:02<01:35, 5.12it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 3%|▎ | 13/500 [00:02<01:35, 5.12it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 3%|▎ | 14/500 [00:02<01:35, 5.10it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 3%|▎ | 14/500 [00:03<01:35, 5.10it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 3%|▎ | 15/500 [00:03<01:50, 4.40it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 3%|▎ | 15/500 [00:03<01:50, 4.40it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 3%|▎ | 16/500 [00:03<01:46, 4.55it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 3%|▎ | 16/500 [00:03<01:46, 4.55it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 3%|▎ | 17/500 [00:03<01:57, 4.12it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 3%|▎ | 17/500 [00:03<01:57, 4.12it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 4%|▎ | 18/500 [00:03<02:05, 3.84it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 4%|▎ | 18/500 [00:04<02:05, 3.84it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 4%|▍ | 19/500 [00:04<01:56, 4.12it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 4%|▍ | 19/500 [00:04<01:56, 4.12it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 4%|▍ | 20/500 [00:04<01:50, 4.36it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 4%|▍ | 20/500 [00:04<01:50, 4.36it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 4%|▍ | 21/500 [00:04<01:45, 4.54it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 4%|▍ | 21/500 [00:04<01:45, 4.54it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 4%|▍ | 22/500 [00:04<01:37, 4.92it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 4%|▍ | 22/500 [00:04<01:37, 4.92it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 5%|▍ | 23/500 [00:04<01:27, 5.45it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 5%|▍ | 23/500 [00:04<01:27, 5.45it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 5%|▍ | 24/500 [00:04<01:29, 5.31it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 5%|▍ | 24/500 [00:05<01:29, 5.31it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 5%|▌ | 25/500 [00:05<01:31, 5.22it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 5%|▌ | 25/500 [00:05<01:31, 5.22it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 5%|▌ | 26/500 [00:05<01:18, 6.04it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 5%|▌ | 26/500 [00:05<01:18, 6.04it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 5%|▌ | 27/500 [00:05<01:23, 5.70it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 5%|▌ | 27/500 [00:05<01:23, 5.70it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 6%|▌ | 28/500 [00:05<01:39, 4.73it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 6%|▌ | 28/500 [00:05<01:39, 4.73it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 6%|▌ | 29/500 [00:05<01:37, 4.81it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 6%|▌ | 29/500 [00:06<01:37, 4.81it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 6%|▌ | 30/500 [00:06<01:36, 4.85it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 6%|▌ | 30/500 [00:06<01:36, 4.85it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 6%|▌ | 31/500 [00:06<01:49, 4.28it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 6%|▌ | 31/500 [00:06<01:49, 4.28it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 6%|▋ | 32/500 [00:06<01:31, 5.12it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 6%|▋ | 32/500 [00:06<01:31, 5.12it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 7%|▋ | 33/500 [00:06<01:45, 4.43it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 7%|▋ | 33/500 [00:07<01:45, 4.43it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 7%|▋ | 34/500 [00:07<01:55, 4.04it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 7%|▋ | 34/500 [00:07<01:55, 4.04it/s]
[TorchDR] TSNE: Loss: 1.50e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 7%|▋ | 35/500 [00:07<01:48, 4.28it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 7%|▋ | 35/500 [00:07<01:48, 4.28it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 7%|▋ | 36/500 [00:07<01:57, 3.94it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 7%|▋ | 36/500 [00:07<01:57, 3.94it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 7%|▋ | 37/500 [00:07<01:50, 4.21it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 7%|▋ | 37/500 [00:08<01:50, 4.21it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 8%|▊ | 38/500 [00:08<01:53, 4.08it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 8%|▊ | 38/500 [00:08<01:53, 4.08it/s]
[TorchDR] TSNE: Loss: 1.49e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 8%|▊ | 39/500 [00:08<01:38, 4.69it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 8%|▊ | 39/500 [00:08<01:38, 4.69it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 8%|▊ | 40/500 [00:08<01:49, 4.21it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 8%|▊ | 40/500 [00:08<01:49, 4.21it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 8%|▊ | 41/500 [00:08<01:31, 5.04it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 8%|▊ | 41/500 [00:08<01:31, 5.04it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 8%|▊ | 42/500 [00:08<01:44, 4.39it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 8%|▊ | 42/500 [00:09<01:44, 4.39it/s]
[TorchDR] TSNE: Loss: 1.48e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 9%|▊ | 43/500 [00:09<01:40, 4.56it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 9%|▊ | 43/500 [00:09<01:40, 4.56it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 9%|▉ | 44/500 [00:09<01:37, 4.66it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 9%|▉ | 44/500 [00:09<01:37, 4.66it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 9%|▉ | 45/500 [00:09<01:48, 4.18it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 9%|▉ | 45/500 [00:09<01:48, 4.18it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 9%|▉ | 46/500 [00:09<01:43, 4.38it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 9%|▉ | 46/500 [00:10<01:43, 4.38it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 9%|▉ | 47/500 [00:10<01:39, 4.54it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 9%|▉ | 47/500 [00:10<01:39, 4.54it/s]
[TorchDR] TSNE: Loss: 1.47e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 10%|▉ | 48/500 [00:10<01:49, 4.12it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 10%|▉ | 48/500 [00:10<01:49, 4.12it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 10%|▉ | 49/500 [00:10<01:57, 3.85it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 10%|▉ | 49/500 [00:10<01:57, 3.85it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 2.30e-05 | LR: 5.00e+01: 10%|█ | 50/500 [00:10<01:49, 4.12it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 10%|█ | 50/500 [00:11<01:49, 4.12it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 10%|█ | 51/500 [00:11<01:44, 4.31it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 10%|█ | 51/500 [00:11<01:44, 4.31it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 10%|█ | 52/500 [00:11<01:38, 4.53it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 10%|█ | 52/500 [00:11<01:38, 4.53it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 11%|█ | 53/500 [00:11<01:48, 4.12it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 11%|█ | 53/500 [00:11<01:48, 4.12it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 11%|█ | 54/500 [00:11<01:29, 4.98it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 11%|█ | 54/500 [00:11<01:29, 4.98it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 11%|█ | 55/500 [00:11<01:29, 4.98it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 11%|█ | 55/500 [00:11<01:29, 4.98it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 11%|█ | 56/500 [00:11<01:29, 4.98it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 11%|█ | 56/500 [00:12<01:29, 4.98it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 11%|█▏ | 57/500 [00:12<01:29, 4.97it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 11%|█▏ | 57/500 [00:12<01:29, 4.97it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 12%|█▏ | 58/500 [00:12<01:41, 4.35it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 12%|█▏ | 58/500 [00:12<01:41, 4.35it/s]
[TorchDR] TSNE: Loss: 1.46e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 12%|█▏ | 59/500 [00:12<01:37, 4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 12%|█▏ | 59/500 [00:12<01:37, 4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 12%|█▏ | 60/500 [00:12<01:42, 4.28it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 12%|█▏ | 60/500 [00:13<01:42, 4.28it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 12%|█▏ | 61/500 [00:13<01:43, 4.26it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 12%|█▏ | 61/500 [00:13<01:43, 4.26it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 12%|█▏ | 62/500 [00:13<01:46, 4.11it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 12%|█▏ | 62/500 [00:13<01:46, 4.11it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 13%|█▎ | 63/500 [00:13<01:40, 4.34it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 13%|█▎ | 63/500 [00:13<01:40, 4.34it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 13%|█▎ | 64/500 [00:13<01:28, 4.92it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 13%|█▎ | 64/500 [00:14<01:28, 4.92it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 13%|█▎ | 65/500 [00:14<01:53, 3.84it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 13%|█▎ | 65/500 [00:14<01:53, 3.84it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 13%|█▎ | 66/500 [00:14<01:45, 4.12it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 13%|█▎ | 66/500 [00:14<01:45, 4.12it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 13%|█▎ | 67/500 [00:14<01:47, 4.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 13%|█▎ | 67/500 [00:14<01:47, 4.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 14%|█▎ | 68/500 [00:14<01:33, 4.64it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 14%|█▎ | 68/500 [00:14<01:33, 4.64it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 14%|█▍ | 69/500 [00:14<01:30, 4.74it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 14%|█▍ | 69/500 [00:15<01:30, 4.74it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 14%|█▍ | 70/500 [00:15<01:42, 4.21it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 14%|█▍ | 70/500 [00:15<01:42, 4.21it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 14%|█▍ | 71/500 [00:15<01:37, 4.41it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 14%|█▍ | 71/500 [00:15<01:37, 4.41it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 14%|█▍ | 72/500 [00:15<01:46, 4.03it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 14%|█▍ | 72/500 [00:16<01:46, 4.03it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 15%|█▍ | 73/500 [00:16<01:52, 3.79it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 15%|█▍ | 73/500 [00:16<01:52, 3.79it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 15%|█▍ | 74/500 [00:16<01:44, 4.06it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 15%|█▍ | 74/500 [00:16<01:44, 4.06it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 15%|█▌ | 75/500 [00:16<01:50, 3.84it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 15%|█▌ | 75/500 [00:16<01:50, 3.84it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 15%|█▌ | 76/500 [00:16<01:42, 4.13it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 15%|█▌ | 76/500 [00:16<01:42, 4.13it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 15%|█▌ | 77/500 [00:16<01:37, 4.36it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 15%|█▌ | 77/500 [00:17<01:37, 4.36it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 16%|█▌ | 78/500 [00:17<01:33, 4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 16%|█▌ | 78/500 [00:17<01:33, 4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 16%|█▌ | 79/500 [00:17<01:30, 4.63it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 16%|█▌ | 79/500 [00:17<01:30, 4.63it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 16%|█▌ | 80/500 [00:17<01:40, 4.18it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 16%|█▌ | 80/500 [00:17<01:40, 4.18it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 16%|█▌ | 81/500 [00:17<01:23, 5.02it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 16%|█▌ | 81/500 [00:18<01:23, 5.02it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 16%|█▋ | 82/500 [00:18<01:23, 5.02it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 16%|█▋ | 82/500 [00:18<01:23, 5.02it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 17%|█▋ | 83/500 [00:18<01:23, 5.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 17%|█▋ | 83/500 [00:18<01:23, 5.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 17%|█▋ | 84/500 [00:18<01:22, 5.02it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 17%|█▋ | 84/500 [00:18<01:22, 5.02it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 17%|█▋ | 85/500 [00:18<01:22, 5.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 17%|█▋ | 85/500 [00:18<01:22, 5.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 17%|█▋ | 86/500 [00:18<01:22, 5.00it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 17%|█▋ | 86/500 [00:18<01:22, 5.00it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 17%|█▋ | 87/500 [00:18<01:22, 5.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 17%|█▋ | 87/500 [00:19<01:22, 5.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 18%|█▊ | 88/500 [00:19<01:22, 5.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 18%|█▊ | 88/500 [00:19<01:22, 5.01it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 18%|█▊ | 89/500 [00:19<01:34, 4.37it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 18%|█▊ | 89/500 [00:19<01:34, 4.37it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 18%|█▊ | 90/500 [00:19<01:30, 4.51it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 18%|█▊ | 90/500 [00:19<01:30, 4.51it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 18%|█▊ | 91/500 [00:19<01:27, 4.66it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 18%|█▊ | 91/500 [00:20<01:27, 4.66it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 18%|█▊ | 92/500 [00:20<01:26, 4.74it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 18%|█▊ | 92/500 [00:20<01:26, 4.74it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 19%|█▊ | 93/500 [00:20<01:24, 4.83it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 19%|█▊ | 93/500 [00:20<01:24, 4.83it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 19%|█▉ | 94/500 [00:20<01:23, 4.89it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 19%|█▉ | 94/500 [00:20<01:23, 4.89it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 19%|█▉ | 95/500 [00:20<01:22, 4.91it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 19%|█▉ | 95/500 [00:20<01:22, 4.91it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 19%|█▉ | 96/500 [00:20<01:22, 4.93it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 19%|█▉ | 96/500 [00:21<01:22, 4.93it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 19%|█▉ | 97/500 [00:21<01:21, 4.97it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 19%|█▉ | 97/500 [00:21<01:21, 4.97it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 20%|█▉ | 98/500 [00:21<01:20, 4.99it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 20%|█▉ | 98/500 [00:21<01:20, 4.99it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 20%|█▉ | 99/500 [00:21<01:08, 5.83it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 20%|█▉ | 99/500 [00:21<01:08, 5.83it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.57e-01 | LR: 5.00e+01: 20%|██ | 100/500 [00:21<01:12, 5.54it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 20%|██ | 100/500 [00:21<01:12, 5.54it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 20%|██ | 101/500 [00:21<01:13, 5.40it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 20%|██ | 101/500 [00:22<01:13, 5.40it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 20%|██ | 102/500 [00:22<01:27, 4.55it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 20%|██ | 102/500 [00:22<01:27, 4.55it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 21%|██ | 103/500 [00:22<01:36, 4.13it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 21%|██ | 103/500 [00:22<01:36, 4.13it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 21%|██ | 104/500 [00:22<01:31, 4.33it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 21%|██ | 104/500 [00:22<01:31, 4.33it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 21%|██ | 105/500 [00:22<01:27, 4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 21%|██ | 105/500 [00:23<01:27, 4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 21%|██ | 106/500 [00:23<01:36, 4.10it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 21%|██ | 106/500 [00:23<01:36, 4.10it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 21%|██▏ | 107/500 [00:23<01:19, 4.94it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 21%|██▏ | 107/500 [00:23<01:19, 4.94it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 22%|██▏ | 108/500 [00:23<01:19, 4.95it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 22%|██▏ | 108/500 [00:23<01:19, 4.95it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 22%|██▏ | 109/500 [00:23<01:30, 4.33it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 22%|██▏ | 109/500 [00:23<01:30, 4.33it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 22%|██▏ | 110/500 [00:23<01:26, 4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 22%|██▏ | 110/500 [00:24<01:26, 4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 22%|██▏ | 111/500 [00:24<01:35, 4.09it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 22%|██▏ | 111/500 [00:24<01:35, 4.09it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 22%|██▏ | 112/500 [00:24<01:29, 4.32it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 22%|██▏ | 112/500 [00:24<01:29, 4.32it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 23%|██▎ | 113/500 [00:24<01:44, 3.69it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 23%|██▎ | 113/500 [00:24<01:44, 3.69it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 23%|██▎ | 114/500 [00:24<01:40, 3.84it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 23%|██▎ | 114/500 [00:25<01:40, 3.84it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 23%|██▎ | 115/500 [00:25<01:33, 4.11it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 23%|██▎ | 115/500 [00:25<01:33, 4.11it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 23%|██▎ | 116/500 [00:25<01:39, 3.85it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 23%|██▎ | 116/500 [00:25<01:39, 3.85it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 23%|██▎ | 117/500 [00:25<01:32, 4.13it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 23%|██▎ | 117/500 [00:25<01:32, 4.13it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 24%|██▎ | 118/500 [00:25<01:27, 4.36it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 24%|██▎ | 118/500 [00:26<01:27, 4.36it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 24%|██▍ | 119/500 [00:26<01:24, 4.53it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 24%|██▍ | 119/500 [00:26<01:24, 4.53it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 24%|██▍ | 120/500 [00:26<01:21, 4.64it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 24%|██▍ | 120/500 [00:26<01:21, 4.64it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 24%|██▍ | 121/500 [00:26<01:30, 4.19it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 24%|██▍ | 121/500 [00:26<01:30, 4.19it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 24%|██▍ | 122/500 [00:26<01:14, 5.04it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 24%|██▍ | 122/500 [00:26<01:14, 5.04it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 25%|██▍ | 123/500 [00:26<01:14, 5.06it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 25%|██▍ | 123/500 [00:27<01:14, 5.06it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 25%|██▍ | 124/500 [00:27<01:14, 5.05it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 25%|██▍ | 124/500 [00:27<01:14, 5.05it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 25%|██▌ | 125/500 [00:27<01:03, 5.91it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 25%|██▌ | 125/500 [00:27<01:03, 5.91it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 25%|██▌ | 126/500 [00:27<00:56, 6.64it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 25%|██▌ | 126/500 [00:27<00:56, 6.64it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 25%|██▌ | 127/500 [00:27<01:01, 6.09it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 25%|██▌ | 127/500 [00:27<01:01, 6.09it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 26%|██▌ | 128/500 [00:27<01:05, 5.72it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 26%|██▌ | 128/500 [00:27<01:05, 5.72it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 26%|██▌ | 129/500 [00:27<01:07, 5.47it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 26%|██▌ | 129/500 [00:28<01:07, 5.47it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 26%|██▌ | 130/500 [00:28<01:09, 5.32it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 26%|██▌ | 130/500 [00:28<01:09, 5.32it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 26%|██▌ | 131/500 [00:28<01:21, 4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 26%|██▌ | 131/500 [00:28<01:21, 4.52it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 26%|██▋ | 132/500 [00:28<01:29, 4.10it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 26%|██▋ | 132/500 [00:28<01:29, 4.10it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 27%|██▋ | 133/500 [00:28<01:25, 4.31it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 27%|██▋ | 133/500 [00:29<01:25, 4.31it/s]
[TorchDR] TSNE: Loss: 1.45e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 27%|██▋ | 134/500 [00:29<01:21, 4.51it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 27%|██▋ | 134/500 [00:29<01:21, 4.51it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 27%|██▋ | 135/500 [00:29<01:29, 4.09it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 27%|██▋ | 135/500 [00:29<01:29, 4.09it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 27%|██▋ | 136/500 [00:29<01:24, 4.31it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 27%|██▋ | 136/500 [00:29<01:24, 4.31it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 27%|██▋ | 137/500 [00:29<01:21, 4.47it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 27%|██▋ | 137/500 [00:30<01:21, 4.47it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 28%|██▊ | 138/500 [00:30<01:17, 4.64it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 28%|██▊ | 138/500 [00:30<01:17, 4.64it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 28%|██▊ | 139/500 [00:30<01:26, 4.18it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 28%|██▊ | 139/500 [00:30<01:26, 4.18it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 28%|██▊ | 140/500 [00:30<01:11, 5.04it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 28%|██▊ | 140/500 [00:30<01:11, 5.04it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 28%|██▊ | 141/500 [00:30<01:22, 4.38it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 28%|██▊ | 141/500 [00:30<01:22, 4.38it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 28%|██▊ | 142/500 [00:30<01:08, 5.24it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 28%|██▊ | 142/500 [00:31<01:08, 5.24it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 29%|██▊ | 143/500 [00:31<01:09, 5.16it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 29%|██▊ | 143/500 [00:31<01:09, 5.16it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 29%|██▉ | 144/500 [00:31<01:19, 4.45it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 29%|██▉ | 144/500 [00:31<01:19, 4.45it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 29%|██▉ | 145/500 [00:31<01:07, 5.30it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 29%|██▉ | 145/500 [00:31<01:07, 5.30it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 29%|██▉ | 146/500 [00:31<01:18, 4.53it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 29%|██▉ | 146/500 [00:31<01:18, 4.53it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 29%|██▉ | 147/500 [00:31<01:15, 4.65it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 29%|██▉ | 147/500 [00:32<01:15, 4.65it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 30%|██▉ | 148/500 [00:32<01:14, 4.74it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 30%|██▉ | 148/500 [00:32<01:14, 4.74it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 30%|██▉ | 149/500 [00:32<01:13, 4.80it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 30%|██▉ | 149/500 [00:32<01:13, 4.80it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.50e-01 | LR: 5.00e+01: 30%|███ | 150/500 [00:32<01:32, 3.78it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 30%|███ | 150/500 [00:32<01:32, 3.78it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 30%|███ | 151/500 [00:32<01:35, 3.64it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 30%|███ | 151/500 [00:33<01:35, 3.64it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 30%|███ | 152/500 [00:33<01:38, 3.55it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 30%|███ | 152/500 [00:33<01:38, 3.55it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 31%|███ | 153/500 [00:33<01:29, 3.87it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 31%|███ | 153/500 [00:33<01:29, 3.87it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 31%|███ | 154/500 [00:33<01:23, 4.15it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 31%|███ | 154/500 [00:33<01:23, 4.15it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 31%|███ | 155/500 [00:33<01:18, 4.37it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 31%|███ | 155/500 [00:34<01:18, 4.37it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 31%|███ | 156/500 [00:34<01:15, 4.54it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 31%|███ | 156/500 [00:34<01:15, 4.54it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 31%|███▏ | 157/500 [00:34<01:23, 4.11it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 31%|███▏ | 157/500 [00:34<01:23, 4.11it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 32%|███▏ | 158/500 [00:34<01:18, 4.33it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 32%|███▏ | 158/500 [00:34<01:18, 4.33it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 32%|███▏ | 159/500 [00:34<01:25, 3.99it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 32%|███▏ | 159/500 [00:35<01:25, 3.99it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 32%|███▏ | 160/500 [00:35<01:10, 4.83it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 32%|███▏ | 160/500 [00:35<01:10, 4.83it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 32%|███▏ | 161/500 [00:35<01:19, 4.27it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 32%|███▏ | 161/500 [00:35<01:19, 4.27it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 32%|███▏ | 162/500 [00:35<01:25, 3.95it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 32%|███▏ | 162/500 [00:35<01:25, 3.95it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 33%|███▎ | 163/500 [00:35<01:10, 4.80it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 33%|███▎ | 163/500 [00:35<01:10, 4.80it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 33%|███▎ | 164/500 [00:35<01:09, 4.84it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 33%|███▎ | 164/500 [00:36<01:09, 4.84it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 33%|███▎ | 165/500 [00:36<01:17, 4.30it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 33%|███▎ | 165/500 [00:36<01:17, 4.30it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 33%|███▎ | 166/500 [00:36<01:04, 5.16it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 33%|███▎ | 166/500 [00:36<01:04, 5.16it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 33%|███▎ | 167/500 [00:36<01:05, 5.07it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 33%|███▎ | 167/500 [00:36<01:05, 5.07it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 34%|███▎ | 168/500 [00:36<01:14, 4.43it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 34%|███▎ | 168/500 [00:37<01:14, 4.43it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 34%|███▍ | 169/500 [00:37<01:12, 4.56it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 34%|███▍ | 169/500 [00:37<01:12, 4.56it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 34%|███▍ | 170/500 [00:37<01:10, 4.67it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 34%|███▍ | 170/500 [00:37<01:10, 4.67it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 34%|███▍ | 171/500 [00:37<01:18, 4.21it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 34%|███▍ | 171/500 [00:37<01:18, 4.21it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 34%|███▍ | 172/500 [00:37<01:14, 4.37it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 34%|███▍ | 172/500 [00:37<01:14, 4.37it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 35%|███▍ | 173/500 [00:37<01:11, 4.54it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 35%|███▍ | 173/500 [00:38<01:11, 4.54it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 35%|███▍ | 174/500 [00:38<01:19, 4.10it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 35%|███▍ | 174/500 [00:38<01:19, 4.10it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 35%|███▌ | 175/500 [00:38<01:14, 4.34it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 35%|███▌ | 175/500 [00:38<01:14, 4.34it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 35%|███▌ | 176/500 [00:38<01:02, 5.18it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 35%|███▌ | 176/500 [00:38<01:02, 5.18it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 35%|███▌ | 177/500 [00:38<01:02, 5.18it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 35%|███▌ | 177/500 [00:38<01:02, 5.18it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 36%|███▌ | 178/500 [00:38<01:02, 5.13it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 36%|███▌ | 178/500 [00:39<01:02, 5.13it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 36%|███▌ | 179/500 [00:39<01:03, 5.06it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 36%|███▌ | 179/500 [00:39<01:03, 5.06it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 36%|███▌ | 180/500 [00:39<01:09, 4.63it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 36%|███▌ | 180/500 [00:39<01:09, 4.63it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 36%|███▌ | 181/500 [00:39<01:01, 5.22it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 36%|███▌ | 181/500 [00:39<01:01, 5.22it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 36%|███▋ | 182/500 [00:39<01:11, 4.47it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 36%|███▋ | 182/500 [00:40<01:11, 4.47it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 37%|███▋ | 183/500 [00:40<01:09, 4.56it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 37%|███▋ | 183/500 [00:40<01:09, 4.56it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 37%|███▋ | 184/500 [00:40<01:16, 4.13it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 37%|███▋ | 184/500 [00:40<01:16, 4.13it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 37%|███▋ | 185/500 [00:40<01:12, 4.36it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 37%|███▋ | 185/500 [00:40<01:12, 4.36it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 37%|███▋ | 186/500 [00:40<01:09, 4.53it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 37%|███▋ | 186/500 [00:41<01:09, 4.53it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 37%|███▋ | 187/500 [00:41<01:16, 4.09it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 37%|███▋ | 187/500 [00:41<01:16, 4.09it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 38%|███▊ | 188/500 [00:41<01:12, 4.32it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 38%|███▊ | 188/500 [00:41<01:12, 4.32it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 38%|███▊ | 189/500 [00:41<01:18, 3.97it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 38%|███▊ | 189/500 [00:41<01:18, 3.97it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 38%|███▊ | 190/500 [00:41<01:13, 4.23it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 38%|███▊ | 190/500 [00:41<01:13, 4.23it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 38%|███▊ | 191/500 [00:41<01:09, 4.45it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 38%|███▊ | 191/500 [00:42<01:09, 4.45it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 38%|███▊ | 192/500 [00:42<01:07, 4.59it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 38%|███▊ | 192/500 [00:42<01:07, 4.59it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 39%|███▊ | 193/500 [00:42<01:05, 4.71it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 39%|███▊ | 193/500 [00:42<01:05, 4.71it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 39%|███▉ | 194/500 [00:42<01:13, 4.18it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 39%|███▉ | 194/500 [00:42<01:13, 4.18it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 39%|███▉ | 195/500 [00:42<01:18, 3.91it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 39%|███▉ | 195/500 [00:43<01:18, 3.91it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 39%|███▉ | 196/500 [00:43<01:13, 4.15it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 39%|███▉ | 196/500 [00:43<01:13, 4.15it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 39%|███▉ | 197/500 [00:43<01:17, 3.89it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 39%|███▉ | 197/500 [00:43<01:17, 3.89it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 40%|███▉ | 198/500 [00:43<01:21, 3.70it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 40%|███▉ | 198/500 [00:43<01:21, 3.70it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 40%|███▉ | 199/500 [00:43<01:06, 4.55it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 40%|███▉ | 199/500 [00:43<01:06, 4.55it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 40%|████ | 200/500 [00:43<01:04, 4.68it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 40%|████ | 200/500 [00:44<01:04, 4.68it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 40%|████ | 201/500 [00:44<01:11, 4.19it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 40%|████ | 201/500 [00:44<01:11, 4.19it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 40%|████ | 202/500 [00:44<01:16, 3.89it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 40%|████ | 202/500 [00:44<01:16, 3.89it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 41%|████ | 203/500 [00:44<01:11, 4.16it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 41%|████ | 203/500 [00:44<01:11, 4.16it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 41%|████ | 204/500 [00:44<00:59, 5.01it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 41%|████ | 204/500 [00:45<00:59, 5.01it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 41%|████ | 205/500 [00:45<00:59, 5.00it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 41%|████ | 205/500 [00:45<00:59, 5.00it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 41%|████ | 206/500 [00:45<01:07, 4.38it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 41%|████ | 206/500 [00:45<01:07, 4.38it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 41%|████▏ | 207/500 [00:45<01:04, 4.53it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 41%|████▏ | 207/500 [00:45<01:04, 4.53it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 42%|████▏ | 208/500 [00:45<01:11, 4.11it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 42%|████▏ | 208/500 [00:46<01:11, 4.11it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 42%|████▏ | 209/500 [00:46<01:15, 3.85it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 42%|████▏ | 209/500 [00:46<01:15, 3.85it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 42%|████▏ | 210/500 [00:46<01:01, 4.68it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 42%|████▏ | 210/500 [00:46<01:01, 4.68it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 42%|████▏ | 211/500 [00:46<01:08, 4.20it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 42%|████▏ | 211/500 [00:46<01:08, 4.20it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 42%|████▏ | 212/500 [00:46<01:14, 3.89it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 42%|████▏ | 212/500 [00:47<01:14, 3.89it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 43%|████▎ | 213/500 [00:47<01:17, 3.70it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 43%|████▎ | 213/500 [00:47<01:17, 3.70it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 43%|████▎ | 214/500 [00:47<01:19, 3.58it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 43%|████▎ | 214/500 [00:47<01:19, 3.58it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 43%|████▎ | 215/500 [00:47<01:04, 4.42it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 43%|████▎ | 215/500 [00:47<01:04, 4.42it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 43%|████▎ | 216/500 [00:47<01:02, 4.57it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 43%|████▎ | 216/500 [00:47<01:02, 4.57it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 43%|████▎ | 217/500 [00:47<00:59, 4.72it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 43%|████▎ | 217/500 [00:48<00:59, 4.72it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 44%|████▎ | 218/500 [00:48<01:06, 4.21it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 44%|████▎ | 218/500 [00:48<01:06, 4.21it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 44%|████▍ | 219/500 [00:48<00:55, 5.07it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 44%|████▍ | 219/500 [00:48<00:55, 5.07it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 44%|████▍ | 220/500 [00:48<00:55, 5.04it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 44%|████▍ | 220/500 [00:48<00:55, 5.04it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 44%|████▍ | 221/500 [00:48<00:55, 5.03it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 44%|████▍ | 221/500 [00:48<00:55, 5.03it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 44%|████▍ | 222/500 [00:48<00:55, 5.02it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 44%|████▍ | 222/500 [00:49<00:55, 5.02it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 45%|████▍ | 223/500 [00:49<00:55, 5.01it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 45%|████▍ | 223/500 [00:49<00:55, 5.01it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 45%|████▍ | 224/500 [00:49<00:55, 5.00it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 45%|████▍ | 224/500 [00:49<00:55, 5.00it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 45%|████▌ | 225/500 [00:49<01:00, 4.58it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 45%|████▌ | 225/500 [00:49<01:00, 4.58it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 45%|████▌ | 226/500 [00:49<01:09, 3.95it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 45%|████▌ | 226/500 [00:50<01:09, 3.95it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 45%|████▌ | 227/500 [00:50<00:57, 4.79it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 45%|████▌ | 227/500 [00:50<00:57, 4.79it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 46%|████▌ | 228/500 [00:50<01:11, 3.79it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 46%|████▌ | 228/500 [00:50<01:11, 3.79it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 46%|████▌ | 229/500 [00:50<01:06, 4.06it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 46%|████▌ | 229/500 [00:50<01:06, 4.06it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 46%|████▌ | 230/500 [00:50<01:02, 4.30it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 46%|████▌ | 230/500 [00:51<01:02, 4.30it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 46%|████▌ | 231/500 [00:51<01:07, 3.97it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 46%|████▌ | 231/500 [00:51<01:07, 3.97it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 46%|████▋ | 232/500 [00:51<00:55, 4.80it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 46%|████▋ | 232/500 [00:51<00:55, 4.80it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 47%|████▋ | 233/500 [00:51<01:10, 3.80it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 47%|████▋ | 233/500 [00:51<01:10, 3.80it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 47%|████▋ | 234/500 [00:51<01:04, 4.09it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 47%|████▋ | 234/500 [00:52<01:04, 4.09it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 47%|████▋ | 235/500 [00:52<01:01, 4.33it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 47%|████▋ | 235/500 [00:52<01:01, 4.33it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 47%|████▋ | 236/500 [00:52<00:58, 4.49it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 47%|████▋ | 236/500 [00:52<00:58, 4.49it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 47%|████▋ | 237/500 [00:52<00:56, 4.62it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 47%|████▋ | 237/500 [00:52<00:56, 4.62it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 48%|████▊ | 238/500 [00:52<01:02, 4.17it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 48%|████▊ | 238/500 [00:52<01:02, 4.17it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 48%|████▊ | 239/500 [00:52<00:52, 5.01it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 48%|████▊ | 239/500 [00:53<00:52, 5.01it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 48%|████▊ | 240/500 [00:53<00:51, 5.05it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 48%|████▊ | 240/500 [00:53<00:51, 5.05it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 48%|████▊ | 241/500 [00:53<00:44, 5.85it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 48%|████▊ | 241/500 [00:53<00:44, 5.85it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 48%|████▊ | 242/500 [00:53<00:46, 5.59it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 48%|████▊ | 242/500 [00:53<00:46, 5.59it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 49%|████▊ | 243/500 [00:53<00:44, 5.75it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 49%|████▊ | 243/500 [00:53<00:44, 5.75it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 49%|████▉ | 244/500 [00:53<00:46, 5.51it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 49%|████▉ | 244/500 [00:53<00:46, 5.51it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 49%|████▉ | 245/500 [00:53<00:42, 5.98it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 49%|████▉ | 245/500 [00:54<00:42, 5.98it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 49%|████▉ | 246/500 [00:54<00:45, 5.63it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 49%|████▉ | 246/500 [00:54<00:45, 5.63it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 49%|████▉ | 247/500 [00:54<00:46, 5.42it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 49%|████▉ | 247/500 [00:54<00:46, 5.42it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 50%|████▉ | 248/500 [00:54<00:47, 5.29it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 50%|████▉ | 248/500 [00:54<00:47, 5.29it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 50%|████▉ | 249/500 [00:54<00:48, 5.19it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 50%|████▉ | 249/500 [00:54<00:48, 5.19it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 50%|█████ | 250/500 [00:54<00:45, 5.44it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 50%|█████ | 250/500 [00:55<00:45, 5.44it/s]
[TorchDR] TSNE: Loss: 1.44e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 50%|█████ | 251/500 [00:55<00:49, 4.99it/s]
[TorchDR] TSNE: Loss: 1.29e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 50%|█████ | 251/500 [00:55<00:49, 4.99it/s]
[TorchDR] TSNE: Loss: 1.29e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 50%|█████ | 252/500 [00:55<00:49, 5.00it/s]
[TorchDR] TSNE: Loss: 1.29e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 50%|█████ | 252/500 [00:55<00:49, 5.00it/s]
[TorchDR] TSNE: Loss: 1.29e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 51%|█████ | 253/500 [00:55<00:49, 4.98it/s]
[TorchDR] TSNE: Loss: 1.29e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 51%|█████ | 253/500 [00:55<00:49, 4.98it/s]
[TorchDR] TSNE: Loss: 1.29e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 51%|█████ | 254/500 [00:55<00:49, 5.00it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 51%|█████ | 254/500 [00:55<00:49, 5.00it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 51%|█████ | 255/500 [00:55<00:49, 5.00it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 51%|█████ | 255/500 [00:56<00:49, 5.00it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 51%|█████ | 256/500 [00:56<00:48, 5.00it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 51%|█████ | 256/500 [00:56<00:48, 5.00it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 51%|█████▏ | 257/500 [00:56<00:55, 4.37it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 51%|█████▏ | 257/500 [00:56<00:55, 4.37it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 52%|█████▏ | 258/500 [00:56<00:46, 5.23it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 52%|█████▏ | 258/500 [00:56<00:46, 5.23it/s]
[TorchDR] TSNE: Loss: 1.28e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 52%|█████▏ | 259/500 [00:56<00:46, 5.17it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 52%|█████▏ | 259/500 [00:56<00:46, 5.17it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 52%|█████▏ | 260/500 [00:56<00:53, 4.46it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 52%|█████▏ | 260/500 [00:57<00:53, 4.46it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 52%|█████▏ | 261/500 [00:57<00:44, 5.32it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 52%|█████▏ | 261/500 [00:57<00:44, 5.32it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 52%|█████▏ | 262/500 [00:57<00:45, 5.21it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 52%|█████▏ | 262/500 [00:57<00:45, 5.21it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 53%|█████▎ | 263/500 [00:57<00:45, 5.15it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 53%|█████▎ | 263/500 [00:57<00:45, 5.15it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 53%|█████▎ | 264/500 [00:57<00:39, 5.98it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 53%|█████▎ | 264/500 [00:57<00:39, 5.98it/s]
[TorchDR] TSNE: Loss: 1.27e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 53%|█████▎ | 265/500 [00:57<00:41, 5.68it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 53%|█████▎ | 265/500 [00:57<00:41, 5.68it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 53%|█████▎ | 266/500 [00:57<00:36, 6.47it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 53%|█████▎ | 266/500 [00:58<00:36, 6.47it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 53%|█████▎ | 267/500 [00:58<00:32, 7.21it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 53%|█████▎ | 267/500 [00:58<00:32, 7.21it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 54%|█████▎ | 268/500 [00:58<00:35, 6.45it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 54%|█████▎ | 268/500 [00:58<00:35, 6.45it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 54%|█████▍ | 269/500 [00:58<00:32, 7.15it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 54%|█████▍ | 269/500 [00:58<00:32, 7.15it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 54%|█████▍ | 270/500 [00:58<00:35, 6.40it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 54%|█████▍ | 270/500 [00:58<00:35, 6.40it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 54%|█████▍ | 271/500 [00:58<00:38, 5.89it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 54%|█████▍ | 271/500 [00:58<00:38, 5.89it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 54%|█████▍ | 272/500 [00:58<00:47, 4.82it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 54%|█████▍ | 272/500 [00:59<00:47, 4.82it/s]
[TorchDR] TSNE: Loss: 1.26e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 55%|█████▍ | 273/500 [00:59<00:46, 4.87it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 55%|█████▍ | 273/500 [00:59<00:46, 4.87it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 55%|█████▍ | 274/500 [00:59<00:39, 5.73it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 55%|█████▍ | 274/500 [00:59<00:39, 5.73it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 55%|█████▌ | 275/500 [00:59<00:34, 6.53it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 55%|█████▌ | 275/500 [00:59<00:34, 6.53it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 55%|█████▌ | 276/500 [00:59<00:37, 5.96it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 55%|█████▌ | 276/500 [00:59<00:37, 5.96it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 55%|█████▌ | 277/500 [00:59<00:45, 4.87it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 55%|█████▌ | 277/500 [01:00<00:45, 4.87it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 56%|█████▌ | 278/500 [01:00<00:38, 5.71it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 56%|█████▌ | 278/500 [01:00<00:38, 5.71it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 56%|█████▌ | 279/500 [01:00<00:40, 5.51it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 56%|█████▌ | 279/500 [01:00<00:40, 5.51it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 56%|█████▌ | 280/500 [01:00<00:41, 5.33it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 56%|█████▌ | 280/500 [01:00<00:41, 5.33it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 56%|█████▌ | 281/500 [01:00<00:41, 5.23it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 56%|█████▌ | 281/500 [01:00<00:41, 5.23it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 56%|█████▋ | 282/500 [01:00<00:42, 5.16it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 56%|█████▋ | 282/500 [01:00<00:42, 5.16it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 57%|█████▋ | 283/500 [01:00<00:42, 5.11it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 57%|█████▋ | 283/500 [01:01<00:42, 5.11it/s]
[TorchDR] TSNE: Loss: 1.25e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 57%|█████▋ | 284/500 [01:01<00:36, 5.92it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 57%|█████▋ | 284/500 [01:01<00:36, 5.92it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 57%|█████▋ | 285/500 [01:01<00:38, 5.66it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 57%|█████▋ | 285/500 [01:01<00:38, 5.66it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 57%|█████▋ | 286/500 [01:01<00:39, 5.46it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 57%|█████▋ | 286/500 [01:01<00:39, 5.46it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 57%|█████▋ | 287/500 [01:01<00:40, 5.32it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 57%|█████▋ | 287/500 [01:01<00:40, 5.32it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 58%|█████▊ | 288/500 [01:01<00:34, 6.17it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 58%|█████▊ | 288/500 [01:01<00:34, 6.17it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 58%|█████▊ | 289/500 [01:01<00:36, 5.78it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 58%|█████▊ | 289/500 [01:02<00:36, 5.78it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 58%|█████▊ | 290/500 [01:02<00:32, 6.53it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 58%|█████▊ | 290/500 [01:02<00:32, 6.53it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 58%|█████▊ | 291/500 [01:02<00:34, 6.03it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 58%|█████▊ | 291/500 [01:02<00:34, 6.03it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 58%|█████▊ | 292/500 [01:02<00:30, 6.77it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 58%|█████▊ | 292/500 [01:02<00:30, 6.77it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 59%|█████▊ | 293/500 [01:02<00:33, 6.18it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 59%|█████▊ | 293/500 [01:02<00:33, 6.18it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 59%|█████▉ | 294/500 [01:02<00:35, 5.78it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 59%|█████▉ | 294/500 [01:03<00:35, 5.78it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 59%|█████▉ | 295/500 [01:03<00:37, 5.50it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 59%|█████▉ | 295/500 [01:03<00:37, 5.50it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 59%|█████▉ | 296/500 [01:03<00:38, 5.35it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 59%|█████▉ | 296/500 [01:03<00:38, 5.35it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 59%|█████▉ | 297/500 [01:03<00:38, 5.24it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 59%|█████▉ | 297/500 [01:03<00:38, 5.24it/s]
[TorchDR] TSNE: Loss: 1.24e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 60%|█████▉ | 298/500 [01:03<00:44, 4.50it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 60%|█████▉ | 298/500 [01:03<00:44, 4.50it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 60%|█████▉ | 299/500 [01:03<00:37, 5.36it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 60%|█████▉ | 299/500 [01:03<00:37, 5.36it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 3.48e-01 | LR: 5.00e+01: 60%|██████ | 300/500 [01:04<00:38, 5.24it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 60%|██████ | 300/500 [01:04<00:38, 5.24it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 60%|██████ | 301/500 [01:04<00:38, 5.19it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 60%|██████ | 301/500 [01:04<00:38, 5.19it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 60%|██████ | 302/500 [01:04<00:32, 6.04it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 60%|██████ | 302/500 [01:04<00:32, 6.04it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 61%|██████ | 303/500 [01:04<00:34, 5.66it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 61%|██████ | 303/500 [01:04<00:34, 5.66it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 61%|██████ | 304/500 [01:04<00:35, 5.46it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 61%|██████ | 304/500 [01:04<00:35, 5.46it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 61%|██████ | 305/500 [01:04<00:36, 5.28it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 61%|██████ | 305/500 [01:05<00:36, 5.28it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 61%|██████ | 306/500 [01:05<00:37, 5.22it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 61%|██████ | 306/500 [01:05<00:37, 5.22it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 61%|██████▏ | 307/500 [01:05<00:37, 5.18it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 61%|██████▏ | 307/500 [01:05<00:37, 5.18it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 62%|██████▏ | 308/500 [01:05<00:37, 5.08it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 62%|██████▏ | 308/500 [01:05<00:37, 5.08it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 62%|██████▏ | 309/500 [01:05<00:32, 5.95it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 62%|██████▏ | 309/500 [01:05<00:32, 5.95it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 62%|██████▏ | 310/500 [01:05<00:33, 5.68it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 62%|██████▏ | 310/500 [01:05<00:33, 5.68it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 62%|██████▏ | 311/500 [01:05<00:34, 5.46it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 62%|██████▏ | 311/500 [01:06<00:34, 5.46it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 62%|██████▏ | 312/500 [01:06<00:29, 6.29it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 62%|██████▏ | 312/500 [01:06<00:29, 6.29it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 63%|██████▎ | 313/500 [01:06<00:26, 7.01it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 63%|██████▎ | 313/500 [01:06<00:26, 7.01it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 63%|██████▎ | 314/500 [01:06<00:34, 5.34it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 63%|██████▎ | 314/500 [01:06<00:34, 5.34it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 63%|██████▎ | 315/500 [01:06<00:35, 5.22it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 63%|██████▎ | 315/500 [01:06<00:35, 5.22it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 63%|██████▎ | 316/500 [01:06<00:35, 5.12it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 63%|██████▎ | 316/500 [01:07<00:35, 5.12it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 63%|██████▎ | 317/500 [01:07<00:35, 5.09it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 63%|██████▎ | 317/500 [01:07<00:35, 5.09it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 64%|██████▎ | 318/500 [01:07<00:33, 5.38it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 64%|██████▎ | 318/500 [01:07<00:33, 5.38it/s]
[TorchDR] TSNE: Loss: 1.23e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 64%|██████▍ | 319/500 [01:07<00:30, 5.84it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 64%|██████▍ | 319/500 [01:07<00:30, 5.84it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 64%|██████▍ | 320/500 [01:07<00:32, 5.57it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 64%|██████▍ | 320/500 [01:07<00:32, 5.57it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 64%|██████▍ | 321/500 [01:07<00:38, 4.66it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 64%|██████▍ | 321/500 [01:08<00:38, 4.66it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 64%|██████▍ | 322/500 [01:08<00:37, 4.76it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 64%|██████▍ | 322/500 [01:08<00:37, 4.76it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 65%|██████▍ | 323/500 [01:08<00:31, 5.59it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 65%|██████▍ | 323/500 [01:08<00:31, 5.59it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 65%|██████▍ | 324/500 [01:08<00:32, 5.39it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 65%|██████▍ | 324/500 [01:08<00:32, 5.39it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 65%|██████▌ | 325/500 [01:08<00:38, 4.54it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 65%|██████▌ | 325/500 [01:08<00:38, 4.54it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 65%|██████▌ | 326/500 [01:08<00:37, 4.69it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 65%|██████▌ | 326/500 [01:09<00:37, 4.69it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 65%|██████▌ | 327/500 [01:09<00:36, 4.79it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 65%|██████▌ | 327/500 [01:09<00:36, 4.79it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 66%|██████▌ | 328/500 [01:09<00:35, 4.86it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 66%|██████▌ | 328/500 [01:09<00:35, 4.86it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 66%|██████▌ | 329/500 [01:09<00:29, 5.72it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 66%|██████▌ | 329/500 [01:09<00:29, 5.72it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 66%|██████▌ | 330/500 [01:09<00:31, 5.48it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 66%|██████▌ | 330/500 [01:09<00:31, 5.48it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 66%|██████▌ | 331/500 [01:09<00:31, 5.33it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 66%|██████▌ | 331/500 [01:09<00:31, 5.33it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 66%|██████▋ | 332/500 [01:09<00:32, 5.24it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 66%|██████▋ | 332/500 [01:10<00:32, 5.24it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 67%|██████▋ | 333/500 [01:10<00:27, 6.05it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 67%|██████▋ | 333/500 [01:10<00:27, 6.05it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 67%|██████▋ | 334/500 [01:10<00:29, 5.71it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 67%|██████▋ | 334/500 [01:10<00:29, 5.71it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 67%|██████▋ | 335/500 [01:10<00:30, 5.48it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 67%|██████▋ | 335/500 [01:10<00:30, 5.48it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 67%|██████▋ | 336/500 [01:10<00:30, 5.34it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 67%|██████▋ | 336/500 [01:10<00:30, 5.34it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 67%|██████▋ | 337/500 [01:10<00:29, 5.56it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 67%|██████▋ | 337/500 [01:11<00:29, 5.56it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 68%|██████▊ | 338/500 [01:11<00:27, 5.99it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 68%|██████▊ | 338/500 [01:11<00:27, 5.99it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 68%|██████▊ | 339/500 [01:11<00:28, 5.66it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 68%|██████▊ | 339/500 [01:11<00:28, 5.66it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 68%|██████▊ | 340/500 [01:11<00:29, 5.45it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 68%|██████▊ | 340/500 [01:11<00:29, 5.45it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 68%|██████▊ | 341/500 [01:11<00:29, 5.31it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 68%|██████▊ | 341/500 [01:11<00:29, 5.31it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 68%|██████▊ | 342/500 [01:11<00:25, 6.10it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 68%|██████▊ | 342/500 [01:12<00:25, 6.10it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 69%|██████▊ | 343/500 [01:12<00:31, 4.92it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 69%|██████▊ | 343/500 [01:12<00:31, 4.92it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 69%|██████▉ | 344/500 [01:12<00:31, 4.95it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 69%|██████▉ | 344/500 [01:12<00:31, 4.95it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 69%|██████▉ | 345/500 [01:12<00:31, 4.96it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 69%|██████▉ | 345/500 [01:12<00:31, 4.96it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 69%|██████▉ | 346/500 [01:12<00:30, 4.97it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 69%|██████▉ | 346/500 [01:12<00:30, 4.97it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 69%|██████▉ | 347/500 [01:12<00:28, 5.28it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 69%|██████▉ | 347/500 [01:12<00:28, 5.28it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 70%|██████▉ | 348/500 [01:12<00:26, 5.74it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 70%|██████▉ | 348/500 [01:13<00:26, 5.74it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 70%|██████▉ | 349/500 [01:13<00:27, 5.48it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 70%|██████▉ | 349/500 [01:13<00:27, 5.48it/s]
[TorchDR] TSNE: Loss: 1.22e+01 | Grad norm: 7.45e-03 | LR: 5.00e+01: 70%|███████ | 350/500 [01:13<00:28, 5.35it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 70%|███████ | 350/500 [01:13<00:28, 5.35it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 70%|███████ | 351/500 [01:13<00:24, 6.16it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 70%|███████ | 351/500 [01:13<00:24, 6.16it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 70%|███████ | 352/500 [01:13<00:25, 5.81it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 70%|███████ | 352/500 [01:13<00:25, 5.81it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 71%|███████ | 353/500 [01:13<00:24, 5.91it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 71%|███████ | 353/500 [01:13<00:24, 5.91it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 71%|███████ | 354/500 [01:13<00:23, 6.28it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 71%|███████ | 354/500 [01:14<00:23, 6.28it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 71%|███████ | 355/500 [01:14<00:20, 6.99it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 71%|███████ | 355/500 [01:14<00:20, 6.99it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 71%|███████ | 356/500 [01:14<00:22, 6.30it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 71%|███████ | 356/500 [01:14<00:22, 6.30it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 71%|███████▏ | 357/500 [01:14<00:20, 7.02it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 71%|███████▏ | 357/500 [01:14<00:20, 7.02it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 72%|███████▏ | 358/500 [01:14<00:22, 6.31it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 72%|███████▏ | 358/500 [01:14<00:22, 6.31it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 72%|███████▏ | 359/500 [01:14<00:20, 7.00it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 72%|███████▏ | 359/500 [01:14<00:20, 7.00it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 72%|███████▏ | 360/500 [01:14<00:22, 6.31it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 72%|███████▏ | 360/500 [01:14<00:22, 6.31it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 72%|███████▏ | 361/500 [01:14<00:19, 7.02it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 72%|███████▏ | 361/500 [01:15<00:19, 7.02it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 72%|███████▏ | 362/500 [01:15<00:21, 6.33it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 72%|███████▏ | 362/500 [01:15<00:21, 6.33it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 73%|███████▎ | 363/500 [01:15<00:19, 7.04it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 73%|███████▎ | 363/500 [01:15<00:19, 7.04it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 73%|███████▎ | 364/500 [01:15<00:21, 6.34it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 73%|███████▎ | 364/500 [01:15<00:21, 6.34it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 73%|███████▎ | 365/500 [01:15<00:23, 5.86it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 73%|███████▎ | 365/500 [01:15<00:23, 5.86it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 73%|███████▎ | 366/500 [01:15<00:20, 6.61it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 73%|███████▎ | 366/500 [01:15<00:20, 6.61it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 73%|███████▎ | 367/500 [01:15<00:21, 6.10it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 73%|███████▎ | 367/500 [01:16<00:21, 6.10it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 74%|███████▎ | 368/500 [01:16<00:23, 5.74it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 74%|███████▎ | 368/500 [01:16<00:23, 5.74it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 74%|███████▍ | 369/500 [01:16<00:20, 6.53it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 74%|███████▍ | 369/500 [01:16<00:20, 6.53it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 74%|███████▍ | 370/500 [01:16<00:17, 7.23it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 74%|███████▍ | 370/500 [01:16<00:17, 7.23it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 74%|███████▍ | 371/500 [01:16<00:20, 6.45it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 74%|███████▍ | 371/500 [01:16<00:20, 6.45it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 74%|███████▍ | 372/500 [01:16<00:21, 5.95it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 74%|███████▍ | 372/500 [01:16<00:21, 5.95it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 75%|███████▍ | 373/500 [01:16<00:18, 6.74it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 75%|███████▍ | 373/500 [01:16<00:18, 6.74it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 75%|███████▍ | 374/500 [01:16<00:17, 7.41it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 75%|███████▍ | 374/500 [01:17<00:17, 7.41it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 75%|███████▌ | 375/500 [01:17<00:19, 6.56it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 75%|███████▌ | 375/500 [01:17<00:19, 6.56it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 75%|███████▌ | 376/500 [01:17<00:17, 7.24it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 75%|███████▌ | 376/500 [01:17<00:17, 7.24it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 75%|███████▌ | 377/500 [01:17<00:22, 5.40it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 75%|███████▌ | 377/500 [01:17<00:22, 5.40it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 76%|███████▌ | 378/500 [01:17<00:19, 6.22it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 76%|███████▌ | 378/500 [01:17<00:19, 6.22it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 76%|███████▌ | 379/500 [01:17<00:17, 6.94it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 76%|███████▌ | 379/500 [01:17<00:17, 6.94it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 76%|███████▌ | 380/500 [01:17<00:19, 6.27it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 76%|███████▌ | 380/500 [01:18<00:19, 6.27it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 76%|███████▌ | 381/500 [01:18<00:20, 5.85it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 76%|███████▌ | 381/500 [01:18<00:20, 5.85it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 76%|███████▋ | 382/500 [01:18<00:21, 5.58it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 76%|███████▋ | 382/500 [01:18<00:21, 5.58it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 77%|███████▋ | 383/500 [01:18<00:21, 5.38it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 77%|███████▋ | 383/500 [01:18<00:21, 5.38it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 77%|███████▋ | 384/500 [01:18<00:18, 6.19it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 77%|███████▋ | 384/500 [01:18<00:18, 6.19it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 77%|███████▋ | 385/500 [01:18<00:19, 5.81it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 77%|███████▋ | 385/500 [01:18<00:19, 5.81it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 77%|███████▋ | 386/500 [01:18<00:20, 5.55it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 77%|███████▋ | 386/500 [01:19<00:20, 5.55it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 77%|███████▋ | 387/500 [01:19<00:20, 5.39it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 77%|███████▋ | 387/500 [01:19<00:20, 5.39it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 78%|███████▊ | 388/500 [01:19<00:21, 5.26it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 78%|███████▊ | 388/500 [01:19<00:21, 5.26it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 78%|███████▊ | 389/500 [01:19<00:21, 5.18it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 78%|███████▊ | 389/500 [01:19<00:21, 5.18it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 78%|███████▊ | 390/500 [01:19<00:21, 5.13it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 78%|███████▊ | 390/500 [01:19<00:21, 5.13it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 78%|███████▊ | 391/500 [01:19<00:21, 5.09it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 78%|███████▊ | 391/500 [01:20<00:21, 5.09it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 78%|███████▊ | 392/500 [01:20<00:18, 5.92it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 78%|███████▊ | 392/500 [01:20<00:18, 5.92it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 79%|███████▊ | 393/500 [01:20<00:22, 4.82it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 79%|███████▊ | 393/500 [01:20<00:22, 4.82it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 79%|███████▉ | 394/500 [01:20<00:21, 4.88it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 79%|███████▉ | 394/500 [01:20<00:21, 4.88it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 79%|███████▉ | 395/500 [01:20<00:18, 5.71it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 79%|███████▉ | 395/500 [01:20<00:18, 5.71it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 79%|███████▉ | 396/500 [01:20<00:18, 5.53it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 79%|███████▉ | 396/500 [01:21<00:18, 5.53it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 79%|███████▉ | 397/500 [01:21<00:19, 5.35it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 79%|███████▉ | 397/500 [01:21<00:19, 5.35it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 80%|███████▉ | 398/500 [01:21<00:16, 6.19it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 80%|███████▉ | 398/500 [01:21<00:16, 6.19it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 80%|███████▉ | 399/500 [01:21<00:17, 5.78it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 80%|███████▉ | 399/500 [01:21<00:17, 5.78it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 4.99e-03 | LR: 5.00e+01: 80%|████████ | 400/500 [01:21<00:18, 5.53it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 80%|████████ | 400/500 [01:21<00:18, 5.53it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 80%|████████ | 401/500 [01:21<00:21, 4.64it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 80%|████████ | 401/500 [01:22<00:21, 4.64it/s]
[TorchDR] TSNE: Loss: 1.21e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 80%|████████ | 402/500 [01:22<00:17, 5.48it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 80%|████████ | 402/500 [01:22<00:17, 5.48it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 81%|████████ | 403/500 [01:22<00:18, 5.35it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 81%|████████ | 403/500 [01:22<00:18, 5.35it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 81%|████████ | 404/500 [01:22<00:18, 5.26it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 81%|████████ | 404/500 [01:22<00:18, 5.26it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 81%|████████ | 405/500 [01:22<00:15, 6.08it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 81%|████████ | 405/500 [01:22<00:15, 6.08it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 81%|████████ | 406/500 [01:22<00:16, 5.75it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 81%|████████ | 406/500 [01:22<00:16, 5.75it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 81%|████████▏ | 407/500 [01:22<00:14, 6.54it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 81%|████████▏ | 407/500 [01:22<00:14, 6.54it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 82%|████████▏ | 408/500 [01:22<00:15, 6.02it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 82%|████████▏ | 408/500 [01:23<00:15, 6.02it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 82%|████████▏ | 409/500 [01:23<00:13, 6.77it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 82%|████████▏ | 409/500 [01:23<00:13, 6.77it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 82%|████████▏ | 410/500 [01:23<00:14, 6.14it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 82%|████████▏ | 410/500 [01:23<00:14, 6.14it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 82%|████████▏ | 411/500 [01:23<00:15, 5.79it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 82%|████████▏ | 411/500 [01:23<00:15, 5.79it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 82%|████████▏ | 412/500 [01:23<00:13, 6.58it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 82%|████████▏ | 412/500 [01:23<00:13, 6.58it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 83%|████████▎ | 413/500 [01:23<00:14, 6.08it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 83%|████████▎ | 413/500 [01:23<00:14, 6.08it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 83%|████████▎ | 414/500 [01:23<00:12, 6.83it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 83%|████████▎ | 414/500 [01:24<00:12, 6.83it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 83%|████████▎ | 415/500 [01:24<00:16, 5.22it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 83%|████████▎ | 415/500 [01:24<00:16, 5.22it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 83%|████████▎ | 416/500 [01:24<00:13, 6.04it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 83%|████████▎ | 416/500 [01:24<00:13, 6.04it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 83%|████████▎ | 417/500 [01:24<00:14, 5.67it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 83%|████████▎ | 417/500 [01:24<00:14, 5.67it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 84%|████████▎ | 418/500 [01:24<00:14, 5.50it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 84%|████████▎ | 418/500 [01:24<00:14, 5.50it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 84%|████████▍ | 419/500 [01:24<00:12, 6.28it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 84%|████████▍ | 419/500 [01:24<00:12, 6.28it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 84%|████████▍ | 420/500 [01:25<00:13, 5.87it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 84%|████████▍ | 420/500 [01:25<00:13, 5.87it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 84%|████████▍ | 421/500 [01:25<00:14, 5.59it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 84%|████████▍ | 421/500 [01:25<00:14, 5.59it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 84%|████████▍ | 422/500 [01:25<00:14, 5.43it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 84%|████████▍ | 422/500 [01:25<00:14, 5.43it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 85%|████████▍ | 423/500 [01:25<00:12, 6.29it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 85%|████████▍ | 423/500 [01:25<00:12, 6.29it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 85%|████████▍ | 424/500 [01:25<00:10, 6.97it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 85%|████████▍ | 424/500 [01:25<00:10, 6.97it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 85%|████████▌ | 425/500 [01:25<00:11, 6.26it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 85%|████████▌ | 425/500 [01:25<00:11, 6.26it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 85%|████████▌ | 426/500 [01:25<00:10, 6.98it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 85%|████████▌ | 426/500 [01:26<00:10, 6.98it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 85%|████████▌ | 427/500 [01:26<00:11, 6.31it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 85%|████████▌ | 427/500 [01:26<00:11, 6.31it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 86%|████████▌ | 428/500 [01:26<00:10, 7.02it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 86%|████████▌ | 428/500 [01:26<00:10, 7.02it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 86%|████████▌ | 429/500 [01:26<00:11, 6.34it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 86%|████████▌ | 429/500 [01:26<00:11, 6.34it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 86%|████████▌ | 430/500 [01:26<00:11, 5.87it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 86%|████████▌ | 430/500 [01:26<00:11, 5.87it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 86%|████████▌ | 431/500 [01:26<00:10, 6.64it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 86%|████████▌ | 431/500 [01:26<00:10, 6.64it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 86%|████████▋ | 432/500 [01:26<00:11, 6.10it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 86%|████████▋ | 432/500 [01:27<00:11, 6.10it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 87%|████████▋ | 433/500 [01:27<00:11, 5.73it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 87%|████████▋ | 433/500 [01:27<00:11, 5.73it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 87%|████████▋ | 434/500 [01:27<00:10, 6.54it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 87%|████████▋ | 434/500 [01:27<00:10, 6.54it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 87%|████████▋ | 435/500 [01:27<00:10, 6.00it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 87%|████████▋ | 435/500 [01:27<00:10, 6.00it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 87%|████████▋ | 436/500 [01:27<00:09, 6.78it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 87%|████████▋ | 436/500 [01:27<00:09, 6.78it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 87%|████████▋ | 437/500 [01:27<00:08, 7.41it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 87%|████████▋ | 437/500 [01:27<00:08, 7.41it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 88%|████████▊ | 438/500 [01:27<00:09, 6.55it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 88%|████████▊ | 438/500 [01:27<00:09, 6.55it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 88%|████████▊ | 439/500 [01:27<00:10, 6.00it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 88%|████████▊ | 439/500 [01:28<00:10, 6.00it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 88%|████████▊ | 440/500 [01:28<00:10, 5.67it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 88%|████████▊ | 440/500 [01:28<00:10, 5.67it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 88%|████████▊ | 441/500 [01:28<00:09, 6.45it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 88%|████████▊ | 441/500 [01:28<00:09, 6.45it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 88%|████████▊ | 442/500 [01:28<00:08, 7.20it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 88%|████████▊ | 442/500 [01:28<00:08, 7.20it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 89%|████████▊ | 443/500 [01:28<00:08, 6.40it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 89%|████████▊ | 443/500 [01:28<00:08, 6.40it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 89%|████████▉ | 444/500 [01:28<00:11, 5.07it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 89%|████████▉ | 444/500 [01:29<00:11, 5.07it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 89%|████████▉ | 445/500 [01:29<00:10, 5.05it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 89%|████████▉ | 445/500 [01:29<00:10, 5.05it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 89%|████████▉ | 446/500 [01:29<00:09, 5.89it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 89%|████████▉ | 446/500 [01:29<00:09, 5.89it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 89%|████████▉ | 447/500 [01:29<00:09, 5.59it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 89%|████████▉ | 447/500 [01:29<00:09, 5.59it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 90%|████████▉ | 448/500 [01:29<00:09, 5.40it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 90%|████████▉ | 448/500 [01:29<00:09, 5.40it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 90%|████████▉ | 449/500 [01:29<00:08, 6.20it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 90%|████████▉ | 449/500 [01:29<00:08, 6.20it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.89e-03 | LR: 5.00e+01: 90%|█████████ | 450/500 [01:29<00:08, 5.81it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 90%|█████████ | 450/500 [01:30<00:08, 5.81it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 90%|█████████ | 451/500 [01:30<00:08, 5.54it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 90%|█████████ | 451/500 [01:30<00:08, 5.54it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 90%|█████████ | 452/500 [01:30<00:08, 5.37it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 90%|█████████ | 452/500 [01:30<00:08, 5.37it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 91%|█████████ | 453/500 [01:30<00:08, 5.27it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 91%|█████████ | 453/500 [01:30<00:08, 5.27it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 91%|█████████ | 454/500 [01:30<00:08, 5.20it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 91%|█████████ | 454/500 [01:30<00:08, 5.20it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 91%|█████████ | 455/500 [01:30<00:07, 6.04it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 91%|█████████ | 455/500 [01:30<00:07, 6.04it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 91%|█████████ | 456/500 [01:30<00:06, 6.78it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 91%|█████████ | 456/500 [01:31<00:06, 6.78it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 91%|█████████▏| 457/500 [01:31<00:06, 6.18it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 91%|█████████▏| 457/500 [01:31<00:06, 6.18it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 92%|█████████▏| 458/500 [01:31<00:07, 5.75it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 92%|█████████▏| 458/500 [01:31<00:07, 5.75it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 92%|█████████▏| 459/500 [01:31<00:07, 5.53it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 92%|█████████▏| 459/500 [01:31<00:07, 5.53it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 92%|█████████▏| 460/500 [01:31<00:07, 5.33it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 92%|█████████▏| 460/500 [01:32<00:07, 5.33it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 92%|█████████▏| 461/500 [01:32<00:08, 4.53it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 92%|█████████▏| 461/500 [01:32<00:08, 4.53it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 92%|█████████▏| 462/500 [01:32<00:08, 4.66it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 92%|█████████▏| 462/500 [01:32<00:08, 4.66it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 93%|█████████▎| 463/500 [01:32<00:07, 4.77it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 93%|█████████▎| 463/500 [01:32<00:07, 4.77it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 93%|█████████▎| 464/500 [01:32<00:07, 4.84it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 93%|█████████▎| 464/500 [01:32<00:07, 4.84it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 93%|█████████▎| 465/500 [01:32<00:07, 4.89it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 93%|█████████▎| 465/500 [01:32<00:07, 4.89it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 93%|█████████▎| 466/500 [01:32<00:06, 4.93it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 93%|█████████▎| 466/500 [01:33<00:06, 4.93it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 93%|█████████▎| 467/500 [01:33<00:05, 5.76it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 93%|█████████▎| 467/500 [01:33<00:05, 5.76it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 94%|█████████▎| 468/500 [01:33<00:05, 5.55it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 94%|█████████▎| 468/500 [01:33<00:05, 5.55it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 94%|█████████▍| 469/500 [01:33<00:04, 6.35it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 94%|█████████▍| 469/500 [01:33<00:04, 6.35it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 94%|█████████▍| 470/500 [01:33<00:05, 5.95it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 94%|█████████▍| 470/500 [01:33<00:05, 5.95it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 94%|█████████▍| 471/500 [01:33<00:05, 5.64it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 94%|█████████▍| 471/500 [01:33<00:05, 5.64it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 94%|█████████▍| 472/500 [01:33<00:04, 6.49it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 94%|█████████▍| 472/500 [01:34<00:04, 6.49it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 95%|█████████▍| 473/500 [01:34<00:04, 5.95it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 95%|█████████▍| 473/500 [01:34<00:04, 5.95it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 95%|█████████▍| 474/500 [01:34<00:04, 5.63it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 95%|█████████▍| 474/500 [01:34<00:04, 5.63it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 95%|█████████▌| 475/500 [01:34<00:03, 6.43it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 95%|█████████▌| 475/500 [01:34<00:03, 6.43it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 95%|█████████▌| 476/500 [01:34<00:03, 7.12it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 95%|█████████▌| 476/500 [01:34<00:03, 7.12it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 95%|█████████▌| 477/500 [01:34<00:03, 6.38it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 95%|█████████▌| 477/500 [01:34<00:03, 6.38it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 96%|█████████▌| 478/500 [01:34<00:03, 7.08it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 96%|█████████▌| 478/500 [01:34<00:03, 7.08it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 96%|█████████▌| 479/500 [01:34<00:03, 6.38it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 96%|█████████▌| 479/500 [01:35<00:03, 6.38it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 96%|█████████▌| 480/500 [01:35<00:03, 5.91it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 96%|█████████▌| 480/500 [01:35<00:03, 5.91it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 96%|█████████▌| 481/500 [01:35<00:03, 5.59it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 96%|█████████▌| 481/500 [01:35<00:03, 5.59it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 96%|█████████▋| 482/500 [01:35<00:03, 5.38it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 96%|█████████▋| 482/500 [01:35<00:03, 5.38it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 97%|█████████▋| 483/500 [01:35<00:03, 5.26it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 97%|█████████▋| 483/500 [01:35<00:03, 5.26it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 97%|█████████▋| 484/500 [01:35<00:02, 6.08it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 97%|█████████▋| 484/500 [01:36<00:02, 6.08it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 97%|█████████▋| 485/500 [01:36<00:02, 5.76it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 97%|█████████▋| 485/500 [01:36<00:02, 5.76it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 97%|█████████▋| 486/500 [01:36<00:02, 5.52it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 97%|█████████▋| 486/500 [01:36<00:02, 5.52it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 97%|█████████▋| 487/500 [01:36<00:02, 6.32it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 97%|█████████▋| 487/500 [01:36<00:02, 6.32it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 98%|█████████▊| 488/500 [01:36<00:02, 5.87it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 98%|█████████▊| 488/500 [01:36<00:02, 5.87it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 98%|█████████▊| 489/500 [01:36<00:01, 5.60it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 98%|█████████▊| 489/500 [01:36<00:01, 5.60it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 98%|█████████▊| 490/500 [01:36<00:01, 6.38it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 98%|█████████▊| 490/500 [01:37<00:01, 6.38it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 98%|█████████▊| 491/500 [01:37<00:01, 5.95it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 98%|█████████▊| 491/500 [01:37<00:01, 5.95it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 98%|█████████▊| 492/500 [01:37<00:01, 5.62it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 98%|█████████▊| 492/500 [01:37<00:01, 5.62it/s]
[TorchDR] TSNE: Loss: 1.20e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 99%|█████████▊| 493/500 [01:37<00:01, 5.43it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 99%|█████████▊| 493/500 [01:37<00:01, 5.43it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 99%|█████████▉| 494/500 [01:37<00:00, 6.22it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 99%|█████████▉| 494/500 [01:37<00:00, 6.22it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 99%|█████████▉| 495/500 [01:37<00:00, 5.85it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 99%|█████████▉| 495/500 [01:37<00:00, 5.85it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 99%|█████████▉| 496/500 [01:37<00:00, 6.62it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 99%|█████████▉| 496/500 [01:38<00:00, 6.62it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 99%|█████████▉| 497/500 [01:38<00:00, 6.09it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 99%|█████████▉| 497/500 [01:38<00:00, 6.09it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 100%|█████████▉| 498/500 [01:38<00:00, 6.84it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 100%|█████████▉| 498/500 [01:38<00:00, 6.84it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 100%|█████████▉| 499/500 [01:38<00:00, 6.22it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 100%|█████████▉| 499/500 [01:38<00:00, 6.22it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 100%|██████████| 500/500 [01:38<00:00, 5.75it/s]
[TorchDR] TSNE: Loss: 1.19e+01 | Grad norm: 3.23e-03 | LR: 5.00e+01: 100%|██████████| 500/500 [01:38<00:00, 5.07it/s]
[TorchDR] COSNE: Starting on device 'None' with backend 'None'.
[TorchDR] COSNE: [Step 1/2] --- Computing the input affinity matrix with EntropicAffinity ---
[TorchDR] COSNE: [Step 2/2] --- Optimizing the embedding ---
0%| | 0/500 [00:00<?, ?it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 3.46e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 0%| | 0/500 [00:00<?, ?it/s]
[TorchDR] COSNE: Loss: 3.46e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 0%| | 1/500 [00:00<04:08, 2.01it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 3.30e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 0%| | 1/500 [00:00<04:08, 2.01it/s]
[TorchDR] COSNE: Loss: 3.30e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 0%| | 2/500 [00:00<04:09, 2.00it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 3.15e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 0%| | 2/500 [00:01<04:09, 2.00it/s]
[TorchDR] COSNE: Loss: 3.15e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 1%| | 3/500 [00:01<03:45, 2.20it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 3.01e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 1%| | 3/500 [00:01<03:45, 2.20it/s]
[TorchDR] COSNE: Loss: 3.01e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 1%| | 4/500 [00:01<03:15, 2.54it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 2.85e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 1%| | 4/500 [00:02<03:15, 2.54it/s]
[TorchDR] COSNE: Loss: 2.85e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 1%| | 5/500 [00:02<03:15, 2.53it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 2.70e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 1%| | 5/500 [00:02<03:15, 2.53it/s]
[TorchDR] COSNE: Loss: 2.70e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 1%| | 6/500 [00:02<03:32, 2.32it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 2.54e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 1%| | 6/500 [00:03<03:32, 2.32it/s]
[TorchDR] COSNE: Loss: 2.54e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 1%|▏ | 7/500 [00:03<03:43, 2.20it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 2.37e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 1%|▏ | 7/500 [00:03<03:43, 2.20it/s]
[TorchDR] COSNE: Loss: 2.37e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 2%|▏ | 8/500 [00:03<03:50, 2.14it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 2.20e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 2%|▏ | 8/500 [00:04<03:50, 2.14it/s]
[TorchDR] COSNE: Loss: 2.20e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 2%|▏ | 9/500 [00:04<03:39, 2.23it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 2.04e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 2%|▏ | 9/500 [00:04<03:39, 2.23it/s]
[TorchDR] COSNE: Loss: 2.04e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 2%|▏ | 10/500 [00:04<03:32, 2.30it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.87e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 2%|▏ | 10/500 [00:04<03:32, 2.30it/s]
[TorchDR] COSNE: Loss: 1.87e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 2%|▏ | 11/500 [00:04<03:41, 2.21it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.72e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 2%|▏ | 11/500 [00:05<03:41, 2.21it/s]
[TorchDR] COSNE: Loss: 1.72e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 2%|▏ | 12/500 [00:05<03:34, 2.27it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.58e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 2%|▏ | 12/500 [00:05<03:34, 2.27it/s]
[TorchDR] COSNE: Loss: 1.58e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 3%|▎ | 13/500 [00:05<03:56, 2.06it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.45e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 3%|▎ | 13/500 [00:06<03:56, 2.06it/s]
[TorchDR] COSNE: Loss: 1.45e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 3%|▎ | 14/500 [00:06<03:57, 2.04it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.34e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 3%|▎ | 14/500 [00:07<03:57, 2.04it/s]
[TorchDR] COSNE: Loss: 1.34e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 3%|▎ | 15/500 [00:07<04:14, 1.91it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 3%|▎ | 15/500 [00:07<04:14, 1.91it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 3%|▎ | 16/500 [00:07<04:09, 1.94it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.15e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 3%|▎ | 16/500 [00:08<04:09, 1.94it/s]
[TorchDR] COSNE: Loss: 1.15e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 3%|▎ | 17/500 [00:08<04:06, 1.96it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.09e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 3%|▎ | 17/500 [00:08<04:06, 1.96it/s]
[TorchDR] COSNE: Loss: 1.09e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 4%|▎ | 18/500 [00:08<03:50, 2.09it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.03e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 4%|▎ | 18/500 [00:08<03:50, 2.09it/s]
[TorchDR] COSNE: Loss: 1.03e+01 | Grad norm: 2.15e+00 | LR: 1.00e-01: 4%|▍ | 19/500 [00:08<03:38, 2.20it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 9.84e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 4%|▍ | 19/500 [00:09<03:38, 2.20it/s]
[TorchDR] COSNE: Loss: 9.84e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 4%|▍ | 20/500 [00:09<03:30, 2.28it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 9.46e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 4%|▍ | 20/500 [00:09<03:30, 2.28it/s]
[TorchDR] COSNE: Loss: 9.46e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 4%|▍ | 21/500 [00:09<03:24, 2.35it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 9.15e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 4%|▍ | 21/500 [00:09<03:24, 2.35it/s]
[TorchDR] COSNE: Loss: 9.15e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 4%|▍ | 22/500 [00:09<03:19, 2.39it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 8.89e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 4%|▍ | 22/500 [00:10<03:19, 2.39it/s]
[TorchDR] COSNE: Loss: 8.89e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 5%|▍ | 23/500 [00:10<03:02, 2.61it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 8.67e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 5%|▍ | 23/500 [00:10<03:02, 2.61it/s]
[TorchDR] COSNE: Loss: 8.67e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 5%|▍ | 24/500 [00:10<03:18, 2.39it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 8.50e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 5%|▍ | 24/500 [00:11<03:18, 2.39it/s]
[TorchDR] COSNE: Loss: 8.50e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 5%|▌ | 25/500 [00:11<03:30, 2.25it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 8.35e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 5%|▌ | 25/500 [00:11<03:30, 2.25it/s]
[TorchDR] COSNE: Loss: 8.35e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 5%|▌ | 26/500 [00:11<03:23, 2.33it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 8.22e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 5%|▌ | 26/500 [00:12<03:23, 2.33it/s]
[TorchDR] COSNE: Loss: 8.22e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 5%|▌ | 27/500 [00:12<03:05, 2.55it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 8.07e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 5%|▌ | 27/500 [00:12<03:05, 2.55it/s]
[TorchDR] COSNE: Loss: 8.07e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 6%|▌ | 28/500 [00:12<03:05, 2.54it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 7.92e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 6%|▌ | 28/500 [00:12<03:05, 2.54it/s]
[TorchDR] COSNE: Loss: 7.92e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 6%|▌ | 29/500 [00:12<03:06, 2.53it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 7.76e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 6%|▌ | 29/500 [00:13<03:06, 2.53it/s]
[TorchDR] COSNE: Loss: 7.76e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 6%|▌ | 30/500 [00:13<03:06, 2.51it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 7.59e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 6%|▌ | 30/500 [00:13<03:06, 2.51it/s]
[TorchDR] COSNE: Loss: 7.59e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 6%|▌ | 31/500 [00:13<03:20, 2.34it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 7.42e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 6%|▌ | 31/500 [00:14<03:20, 2.34it/s]
[TorchDR] COSNE: Loss: 7.42e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 6%|▋ | 32/500 [00:14<03:16, 2.39it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 7.27e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 6%|▋ | 32/500 [00:14<03:16, 2.39it/s]
[TorchDR] COSNE: Loss: 7.27e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 7%|▋ | 33/500 [00:14<03:12, 2.42it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 7.13e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 7%|▋ | 33/500 [00:14<03:12, 2.42it/s]
[TorchDR] COSNE: Loss: 7.13e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 7%|▋ | 34/500 [00:14<02:43, 2.85it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 7.01e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 7%|▋ | 34/500 [00:15<02:43, 2.85it/s]
[TorchDR] COSNE: Loss: 7.01e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 7%|▋ | 35/500 [00:15<02:49, 2.75it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 6.91e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 7%|▋ | 35/500 [00:15<02:49, 2.75it/s]
[TorchDR] COSNE: Loss: 6.91e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 7%|▋ | 36/500 [00:15<02:40, 2.89it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 6.82e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 7%|▋ | 36/500 [00:15<02:40, 2.89it/s]
[TorchDR] COSNE: Loss: 6.82e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 7%|▋ | 37/500 [00:15<03:01, 2.56it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 6.75e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 7%|▋ | 37/500 [00:16<03:01, 2.56it/s]
[TorchDR] COSNE: Loss: 6.75e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 8%|▊ | 38/500 [00:16<03:02, 2.53it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 6.69e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 8%|▊ | 38/500 [00:16<03:02, 2.53it/s]
[TorchDR] COSNE: Loss: 6.69e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 8%|▊ | 39/500 [00:16<03:16, 2.35it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 6.63e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 8%|▊ | 39/500 [00:17<03:16, 2.35it/s]
[TorchDR] COSNE: Loss: 6.63e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 8%|▊ | 40/500 [00:17<02:59, 2.56it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 6.57e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 8%|▊ | 40/500 [00:17<02:59, 2.56it/s]
[TorchDR] COSNE: Loss: 6.57e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 8%|▊ | 41/500 [00:17<03:27, 2.21it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 6.52e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 8%|▊ | 41/500 [00:18<03:27, 2.21it/s]
[TorchDR] COSNE: Loss: 6.52e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 8%|▊ | 42/500 [00:18<03:19, 2.29it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 6.47e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 8%|▊ | 42/500 [00:18<03:19, 2.29it/s]
[TorchDR] COSNE: Loss: 6.47e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 9%|▊ | 43/500 [00:18<03:27, 2.20it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 6.41e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 9%|▊ | 43/500 [00:18<03:27, 2.20it/s]
[TorchDR] COSNE: Loss: 6.41e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 9%|▉ | 44/500 [00:18<03:19, 2.29it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 6.36e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 9%|▉ | 44/500 [00:19<03:19, 2.29it/s]
[TorchDR] COSNE: Loss: 6.36e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 9%|▉ | 45/500 [00:19<03:00, 2.52it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 6.32e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 9%|▉ | 45/500 [00:19<03:00, 2.52it/s]
[TorchDR] COSNE: Loss: 6.32e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 9%|▉ | 46/500 [00:19<03:00, 2.51it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 6.27e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 9%|▉ | 46/500 [00:20<03:00, 2.51it/s]
[TorchDR] COSNE: Loss: 6.27e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 9%|▉ | 47/500 [00:20<03:00, 2.51it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 6.23e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 9%|▉ | 47/500 [00:20<03:00, 2.51it/s]
[TorchDR] COSNE: Loss: 6.23e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 10%|▉ | 48/500 [00:20<02:47, 2.70it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 6.18e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 10%|▉ | 48/500 [00:20<02:47, 2.70it/s]
[TorchDR] COSNE: Loss: 6.18e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 10%|▉ | 49/500 [00:20<03:03, 2.46it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 6.15e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 10%|▉ | 49/500 [00:21<03:03, 2.46it/s]
[TorchDR] COSNE: Loss: 6.15e+00 | Grad norm: 2.15e+00 | LR: 1.00e-01: 10%|█ | 50/500 [00:21<02:48, 2.66it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 6.11e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 10%|█ | 50/500 [00:21<02:48, 2.66it/s]
[TorchDR] COSNE: Loss: 6.11e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 10%|█ | 51/500 [00:21<02:38, 2.83it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 6.08e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 10%|█ | 51/500 [00:21<02:38, 2.83it/s]
[TorchDR] COSNE: Loss: 6.08e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 10%|█ | 52/500 [00:21<02:44, 2.72it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 6.05e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 10%|█ | 52/500 [00:22<02:44, 2.72it/s]
[TorchDR] COSNE: Loss: 6.05e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 11%|█ | 53/500 [00:22<02:48, 2.65it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 6.03e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 11%|█ | 53/500 [00:22<02:48, 2.65it/s]
[TorchDR] COSNE: Loss: 6.03e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 11%|█ | 54/500 [00:22<02:38, 2.82it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 6.01e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 11%|█ | 54/500 [00:22<02:38, 2.82it/s]
[TorchDR] COSNE: Loss: 6.01e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 11%|█ | 55/500 [00:22<02:43, 2.72it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.98e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 11%|█ | 55/500 [00:23<02:43, 2.72it/s]
[TorchDR] COSNE: Loss: 5.98e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 11%|█ | 56/500 [00:23<02:47, 2.65it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.96e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 11%|█ | 56/500 [00:23<02:47, 2.65it/s]
[TorchDR] COSNE: Loss: 5.96e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 11%|█▏ | 57/500 [00:23<02:49, 2.61it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.93e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 11%|█▏ | 57/500 [00:24<02:49, 2.61it/s]
[TorchDR] COSNE: Loss: 5.93e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 12%|█▏ | 58/500 [00:24<02:51, 2.57it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.91e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 12%|█▏ | 58/500 [00:24<02:51, 2.57it/s]
[TorchDR] COSNE: Loss: 5.91e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 12%|█▏ | 59/500 [00:24<02:53, 2.54it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.89e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 12%|█▏ | 59/500 [00:25<02:53, 2.54it/s]
[TorchDR] COSNE: Loss: 5.89e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 12%|█▏ | 60/500 [00:25<03:06, 2.36it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.87e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 12%|█▏ | 60/500 [00:25<03:06, 2.36it/s]
[TorchDR] COSNE: Loss: 5.87e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 12%|█▏ | 61/500 [00:25<03:15, 2.24it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.85e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 12%|█▏ | 61/500 [00:25<03:15, 2.24it/s]
[TorchDR] COSNE: Loss: 5.85e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 12%|█▏ | 62/500 [00:25<02:57, 2.47it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.84e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 12%|█▏ | 62/500 [00:26<02:57, 2.47it/s]
[TorchDR] COSNE: Loss: 5.84e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 13%|█▎ | 63/500 [00:26<02:55, 2.49it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.83e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 13%|█▎ | 63/500 [00:26<02:55, 2.49it/s]
[TorchDR] COSNE: Loss: 5.83e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 13%|█▎ | 64/500 [00:26<02:54, 2.49it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.82e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 13%|█▎ | 64/500 [00:27<02:54, 2.49it/s]
[TorchDR] COSNE: Loss: 5.82e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 13%|█▎ | 65/500 [00:27<02:54, 2.49it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.80e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 13%|█▎ | 65/500 [00:27<02:54, 2.49it/s]
[TorchDR] COSNE: Loss: 5.80e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 13%|█▎ | 66/500 [00:27<03:06, 2.32it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.79e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 13%|█▎ | 66/500 [00:28<03:06, 2.32it/s]
[TorchDR] COSNE: Loss: 5.79e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 13%|█▎ | 67/500 [00:28<03:02, 2.37it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.78e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 13%|█▎ | 67/500 [00:28<03:02, 2.37it/s]
[TorchDR] COSNE: Loss: 5.78e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 14%|█▎ | 68/500 [00:28<03:12, 2.25it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.77e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 14%|█▎ | 68/500 [00:28<03:12, 2.25it/s]
[TorchDR] COSNE: Loss: 5.77e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 14%|█▍ | 69/500 [00:28<02:53, 2.48it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.76e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 14%|█▍ | 69/500 [00:29<02:53, 2.48it/s]
[TorchDR] COSNE: Loss: 5.76e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 14%|█▍ | 70/500 [00:29<02:52, 2.49it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.75e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 14%|█▍ | 70/500 [00:29<02:52, 2.49it/s]
[TorchDR] COSNE: Loss: 5.75e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 14%|█▍ | 71/500 [00:29<03:04, 2.33it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.74e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 14%|█▍ | 71/500 [00:30<03:04, 2.33it/s]
[TorchDR] COSNE: Loss: 5.74e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 14%|█▍ | 72/500 [00:30<02:47, 2.55it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.73e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 14%|█▍ | 72/500 [00:30<02:47, 2.55it/s]
[TorchDR] COSNE: Loss: 5.73e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 15%|█▍ | 73/500 [00:30<03:13, 2.20it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.73e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 15%|█▍ | 73/500 [00:31<03:13, 2.20it/s]
[TorchDR] COSNE: Loss: 5.73e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 15%|█▍ | 74/500 [00:31<03:31, 2.01it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.72e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 15%|█▍ | 74/500 [00:31<03:31, 2.01it/s]
[TorchDR] COSNE: Loss: 5.72e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 15%|█▌ | 75/500 [00:31<03:44, 1.89it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 5.71e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 15%|█▌ | 75/500 [00:32<03:44, 1.89it/s]
[TorchDR] COSNE: Loss: 5.71e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 15%|█▌ | 76/500 [00:32<03:15, 2.17it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.71e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 15%|█▌ | 76/500 [00:32<03:15, 2.17it/s]
[TorchDR] COSNE: Loss: 5.71e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 15%|█▌ | 77/500 [00:32<03:06, 2.27it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.70e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 15%|█▌ | 77/500 [00:32<03:06, 2.27it/s]
[TorchDR] COSNE: Loss: 5.70e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 16%|█▌ | 78/500 [00:32<03:01, 2.32it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.70e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 16%|█▌ | 78/500 [00:33<03:01, 2.32it/s]
[TorchDR] COSNE: Loss: 5.70e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 16%|█▌ | 79/500 [00:33<03:09, 2.22it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.69e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 16%|█▌ | 79/500 [00:33<03:09, 2.22it/s]
[TorchDR] COSNE: Loss: 5.69e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 16%|█▌ | 80/500 [00:33<03:15, 2.15it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 5.69e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 16%|█▌ | 80/500 [00:34<03:15, 2.15it/s]
[TorchDR] COSNE: Loss: 5.69e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 16%|█▌ | 81/500 [00:34<03:07, 2.24it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.69e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 16%|█▌ | 81/500 [00:34<03:07, 2.24it/s]
[TorchDR] COSNE: Loss: 5.69e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 16%|█▋ | 82/500 [00:34<03:25, 2.04it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.68e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 16%|█▋ | 82/500 [00:35<03:25, 2.04it/s]
[TorchDR] COSNE: Loss: 5.68e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 17%|█▋ | 83/500 [00:35<03:26, 2.02it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.68e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 17%|█▋ | 83/500 [00:35<03:26, 2.02it/s]
[TorchDR] COSNE: Loss: 5.68e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 17%|█▋ | 84/500 [00:35<03:14, 2.14it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.67e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 17%|█▋ | 84/500 [00:36<03:14, 2.14it/s]
[TorchDR] COSNE: Loss: 5.67e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 17%|█▋ | 85/500 [00:36<03:17, 2.10it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.67e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 17%|█▋ | 85/500 [00:36<03:17, 2.10it/s]
[TorchDR] COSNE: Loss: 5.67e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 17%|█▋ | 86/500 [00:36<03:07, 2.20it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.67e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 17%|█▋ | 86/500 [00:37<03:07, 2.20it/s]
[TorchDR] COSNE: Loss: 5.67e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 17%|█▋ | 87/500 [00:37<03:24, 2.02it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.67e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 17%|█▋ | 87/500 [00:37<03:24, 2.02it/s]
[TorchDR] COSNE: Loss: 5.67e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 18%|█▊ | 88/500 [00:37<03:13, 2.13it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 18%|█▊ | 88/500 [00:38<03:13, 2.13it/s]
[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 18%|█▊ | 89/500 [00:38<03:16, 2.10it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 18%|█▊ | 89/500 [00:38<03:16, 2.10it/s]
[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 18%|█▊ | 90/500 [00:38<02:54, 2.35it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 18%|█▊ | 90/500 [00:39<02:54, 2.35it/s]
[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 18%|█▊ | 91/500 [00:39<03:03, 2.23it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 18%|█▊ | 91/500 [00:39<03:03, 2.23it/s]
[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 18%|█▊ | 92/500 [00:39<03:20, 2.03it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 18%|█▊ | 92/500 [00:40<03:20, 2.03it/s]
[TorchDR] COSNE: Loss: 5.66e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 19%|█▊ | 93/500 [00:40<03:21, 2.02it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 19%|█▊ | 93/500 [00:40<03:21, 2.02it/s]
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 19%|█▉ | 94/500 [00:40<03:21, 2.01it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 19%|█▉ | 94/500 [00:41<03:21, 2.01it/s]
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 19%|█▉ | 95/500 [00:41<03:20, 2.02it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 19%|█▉ | 95/500 [00:41<03:20, 2.02it/s]
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 19%|█▉ | 96/500 [00:41<03:08, 2.14it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 19%|█▉ | 96/500 [00:41<03:08, 2.14it/s]
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 19%|█▉ | 97/500 [00:41<02:48, 2.39it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 19%|█▉ | 97/500 [00:42<02:48, 2.39it/s]
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 20%|█▉ | 98/500 [00:42<02:46, 2.42it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 20%|█▉ | 98/500 [00:42<02:46, 2.42it/s]
[TorchDR] COSNE: Loss: 5.65e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 20%|█▉ | 99/500 [00:42<02:43, 2.45it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 20%|█▉ | 99/500 [00:42<02:43, 2.45it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 5.03e-01 | LR: 1.00e-01: 20%|██ | 100/500 [00:42<02:30, 2.66it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 20%|██ | 100/500 [00:43<02:30, 2.66it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 20%|██ | 101/500 [00:43<02:32, 2.61it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 20%|██ | 101/500 [00:43<02:32, 2.61it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 20%|██ | 102/500 [00:43<02:34, 2.58it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 20%|██ | 102/500 [00:44<02:34, 2.58it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 21%|██ | 103/500 [00:44<02:35, 2.55it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 21%|██ | 103/500 [00:44<02:35, 2.55it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 21%|██ | 104/500 [00:44<02:47, 2.36it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 21%|██ | 104/500 [00:45<02:47, 2.36it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 21%|██ | 105/500 [00:45<03:08, 2.10it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 21%|██ | 105/500 [00:45<03:08, 2.10it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 21%|██ | 106/500 [00:45<02:46, 2.36it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 21%|██ | 106/500 [00:45<02:46, 2.36it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 21%|██▏ | 107/500 [00:45<02:43, 2.40it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 21%|██▏ | 107/500 [00:46<02:43, 2.40it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 22%|██▏ | 108/500 [00:46<02:41, 2.43it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 22%|██▏ | 108/500 [00:46<02:41, 2.43it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 22%|██▏ | 109/500 [00:46<02:39, 2.45it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 22%|██▏ | 109/500 [00:47<02:39, 2.45it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 22%|██▏ | 110/500 [00:47<02:38, 2.46it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 22%|██▏ | 110/500 [00:47<02:38, 2.46it/s]
[TorchDR] COSNE: Loss: 5.64e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 22%|██▏ | 111/500 [00:47<02:37, 2.47it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 22%|██▏ | 111/500 [00:47<02:37, 2.47it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 22%|██▏ | 112/500 [00:47<02:47, 2.31it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 22%|██▏ | 112/500 [00:48<02:47, 2.31it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 23%|██▎ | 113/500 [00:48<02:44, 2.35it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 23%|██▎ | 113/500 [00:48<02:44, 2.35it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 23%|██▎ | 114/500 [00:48<02:40, 2.40it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 23%|██▎ | 114/500 [00:49<02:40, 2.40it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 23%|██▎ | 115/500 [00:49<02:49, 2.27it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 23%|██▎ | 115/500 [00:49<02:49, 2.27it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 23%|██▎ | 116/500 [00:49<02:33, 2.51it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 23%|██▎ | 116/500 [00:49<02:33, 2.51it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 23%|██▎ | 117/500 [00:49<02:32, 2.51it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 23%|██▎ | 117/500 [00:50<02:32, 2.51it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 24%|██▎ | 118/500 [00:50<02:43, 2.33it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 24%|██▎ | 118/500 [00:50<02:43, 2.33it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 24%|██▍ | 119/500 [00:50<02:18, 2.75it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 24%|██▍ | 119/500 [00:51<02:18, 2.75it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 24%|██▍ | 120/500 [00:51<02:21, 2.68it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 24%|██▍ | 120/500 [00:51<02:21, 2.68it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 24%|██▍ | 121/500 [00:51<02:35, 2.44it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 24%|██▍ | 121/500 [00:52<02:35, 2.44it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 24%|██▍ | 122/500 [00:52<02:34, 2.45it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 24%|██▍ | 122/500 [00:52<02:34, 2.45it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 25%|██▍ | 123/500 [00:52<02:54, 2.15it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 25%|██▍ | 123/500 [00:53<02:54, 2.15it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 25%|██▍ | 124/500 [00:53<02:58, 2.10it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 25%|██▍ | 124/500 [00:53<02:58, 2.10it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 25%|██▌ | 125/500 [00:53<02:50, 2.20it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 25%|██▌ | 125/500 [00:53<02:50, 2.20it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 25%|██▌ | 126/500 [00:53<02:43, 2.29it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 25%|██▌ | 126/500 [00:54<02:43, 2.29it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 25%|██▌ | 127/500 [00:54<02:38, 2.35it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 25%|██▌ | 127/500 [00:54<02:38, 2.35it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 26%|██▌ | 128/500 [00:54<02:24, 2.57it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 26%|██▌ | 128/500 [00:54<02:24, 2.57it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 26%|██▌ | 129/500 [00:54<02:14, 2.77it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 26%|██▌ | 129/500 [00:55<02:14, 2.77it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 26%|██▌ | 130/500 [00:55<02:07, 2.91it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 26%|██▌ | 130/500 [00:55<02:07, 2.91it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 26%|██▌ | 131/500 [00:55<02:12, 2.78it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 26%|██▌ | 131/500 [00:56<02:12, 2.78it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 26%|██▋ | 132/500 [00:56<02:17, 2.68it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 26%|██▋ | 132/500 [00:56<02:17, 2.68it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 27%|██▋ | 133/500 [00:56<02:19, 2.63it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 27%|██▋ | 133/500 [00:56<02:19, 2.63it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 27%|██▋ | 134/500 [00:56<02:10, 2.81it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 27%|██▋ | 134/500 [00:57<02:10, 2.81it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 27%|██▋ | 135/500 [00:57<02:26, 2.50it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 27%|██▋ | 135/500 [00:57<02:26, 2.50it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 27%|██▋ | 136/500 [00:57<02:36, 2.32it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 27%|██▋ | 136/500 [00:58<02:36, 2.32it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 27%|██▋ | 137/500 [00:58<02:43, 2.22it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 27%|██▋ | 137/500 [00:58<02:43, 2.22it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 28%|██▊ | 138/500 [00:58<02:48, 2.15it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 28%|██▊ | 138/500 [00:59<02:48, 2.15it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 28%|██▊ | 139/500 [00:59<02:40, 2.25it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 28%|██▊ | 139/500 [00:59<02:40, 2.25it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 28%|██▊ | 140/500 [00:59<02:25, 2.48it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 28%|██▊ | 140/500 [00:59<02:25, 2.48it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 28%|██▊ | 141/500 [00:59<02:24, 2.49it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 28%|██▊ | 141/500 [01:00<02:24, 2.49it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 28%|██▊ | 142/500 [01:00<02:33, 2.32it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 28%|██▊ | 142/500 [01:00<02:33, 2.32it/s]
[TorchDR] COSNE: Loss: 5.63e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 29%|██▊ | 143/500 [01:00<02:30, 2.38it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 29%|██▊ | 143/500 [01:01<02:30, 2.38it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 29%|██▉ | 144/500 [01:01<02:27, 2.41it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 29%|██▉ | 144/500 [01:01<02:27, 2.41it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 29%|██▉ | 145/500 [01:01<02:05, 2.83it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 29%|██▉ | 145/500 [01:01<02:05, 2.83it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 29%|██▉ | 146/500 [01:01<02:19, 2.54it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 29%|██▉ | 146/500 [01:02<02:19, 2.54it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 29%|██▉ | 147/500 [01:02<02:09, 2.72it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 29%|██▉ | 147/500 [01:02<02:09, 2.72it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 30%|██▉ | 148/500 [01:02<02:12, 2.65it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 30%|██▉ | 148/500 [01:02<02:12, 2.65it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 30%|██▉ | 149/500 [01:02<02:14, 2.60it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 30%|██▉ | 149/500 [01:03<02:14, 2.60it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.65e-02 | LR: 1.00e-01: 30%|███ | 150/500 [01:03<02:26, 2.39it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 30%|███ | 150/500 [01:03<02:26, 2.39it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 30%|███ | 151/500 [01:03<02:23, 2.43it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 30%|███ | 151/500 [01:04<02:23, 2.43it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 30%|███ | 152/500 [01:04<02:22, 2.45it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 30%|███ | 152/500 [01:04<02:22, 2.45it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 31%|███ | 153/500 [01:04<02:21, 2.46it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 31%|███ | 153/500 [01:05<02:21, 2.46it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 31%|███ | 154/500 [01:05<02:20, 2.47it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 31%|███ | 154/500 [01:05<02:20, 2.47it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 31%|███ | 155/500 [01:05<02:29, 2.31it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 31%|███ | 155/500 [01:05<02:29, 2.31it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 31%|███ | 156/500 [01:05<02:25, 2.36it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 31%|███ | 156/500 [01:06<02:25, 2.36it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 31%|███▏ | 157/500 [01:06<02:22, 2.40it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 31%|███▏ | 157/500 [01:06<02:22, 2.40it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 32%|███▏ | 158/500 [01:06<02:20, 2.43it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 32%|███▏ | 158/500 [01:07<02:20, 2.43it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 32%|███▏ | 159/500 [01:07<02:19, 2.45it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 32%|███▏ | 159/500 [01:07<02:19, 2.45it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 32%|███▏ | 160/500 [01:07<02:17, 2.47it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 32%|███▏ | 160/500 [01:07<02:17, 2.47it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 32%|███▏ | 161/500 [01:07<02:26, 2.31it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 32%|███▏ | 161/500 [01:08<02:26, 2.31it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 32%|███▏ | 162/500 [01:08<02:23, 2.35it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 32%|███▏ | 162/500 [01:08<02:23, 2.35it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 33%|███▎ | 163/500 [01:08<02:30, 2.24it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 33%|███▎ | 163/500 [01:09<02:30, 2.24it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 33%|███▎ | 164/500 [01:09<02:35, 2.17it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 33%|███▎ | 164/500 [01:09<02:35, 2.17it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 33%|███▎ | 165/500 [01:09<02:19, 2.41it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 33%|███▎ | 165/500 [01:10<02:19, 2.41it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 33%|███▎ | 166/500 [01:10<02:36, 2.13it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 33%|███▎ | 166/500 [01:10<02:36, 2.13it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 33%|███▎ | 167/500 [01:10<02:39, 2.09it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 33%|███▎ | 167/500 [01:11<02:39, 2.09it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 34%|███▎ | 168/500 [01:11<02:40, 2.06it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 34%|███▎ | 168/500 [01:11<02:40, 2.06it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 34%|███▍ | 169/500 [01:11<02:42, 2.04it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 34%|███▍ | 169/500 [01:12<02:42, 2.04it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 34%|███▍ | 170/500 [01:12<02:52, 1.91it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 34%|███▍ | 170/500 [01:12<02:52, 1.91it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 34%|███▍ | 171/500 [01:12<02:50, 1.94it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 34%|███▍ | 171/500 [01:13<02:50, 1.94it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 34%|███▍ | 172/500 [01:13<02:37, 2.08it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 34%|███▍ | 172/500 [01:13<02:37, 2.08it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 35%|███▍ | 173/500 [01:13<02:29, 2.19it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 35%|███▍ | 173/500 [01:14<02:29, 2.19it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 35%|███▍ | 174/500 [01:14<02:23, 2.27it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 35%|███▍ | 174/500 [01:14<02:23, 2.27it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 35%|███▌ | 175/500 [01:14<02:18, 2.34it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 35%|███▌ | 175/500 [01:14<02:18, 2.34it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 35%|███▌ | 176/500 [01:14<02:15, 2.39it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 35%|███▌ | 176/500 [01:15<02:15, 2.39it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 35%|███▌ | 177/500 [01:15<02:13, 2.42it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 35%|███▌ | 177/500 [01:15<02:13, 2.42it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 36%|███▌ | 178/500 [01:15<02:12, 2.44it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 36%|███▌ | 178/500 [01:16<02:12, 2.44it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 36%|███▌ | 179/500 [01:16<02:10, 2.46it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 36%|███▌ | 179/500 [01:16<02:10, 2.46it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 36%|███▌ | 180/500 [01:16<02:09, 2.47it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 36%|███▌ | 180/500 [01:16<02:09, 2.47it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 36%|███▌ | 181/500 [01:16<01:59, 2.67it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 36%|███▌ | 181/500 [01:17<01:59, 2.67it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 36%|███▋ | 182/500 [01:17<02:01, 2.63it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 36%|███▋ | 182/500 [01:17<02:01, 2.63it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 37%|███▋ | 183/500 [01:17<01:53, 2.79it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 37%|███▋ | 183/500 [01:17<01:53, 2.79it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 37%|███▋ | 184/500 [01:17<01:56, 2.71it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 37%|███▋ | 184/500 [01:18<01:56, 2.71it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 37%|███▋ | 185/500 [01:18<01:59, 2.65it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 37%|███▋ | 185/500 [01:18<01:59, 2.65it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 37%|███▋ | 186/500 [01:18<02:00, 2.60it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 37%|███▋ | 186/500 [01:19<02:00, 2.60it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 37%|███▋ | 187/500 [01:19<02:02, 2.56it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 37%|███▋ | 187/500 [01:19<02:02, 2.56it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 38%|███▊ | 188/500 [01:19<01:53, 2.76it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 38%|███▊ | 188/500 [01:19<01:53, 2.76it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 38%|███▊ | 189/500 [01:19<01:47, 2.89it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 38%|███▊ | 189/500 [01:20<01:47, 2.89it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 38%|███▊ | 190/500 [01:20<01:51, 2.78it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 38%|███▊ | 190/500 [01:20<01:51, 2.78it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 38%|███▊ | 191/500 [01:20<01:55, 2.68it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 38%|███▊ | 191/500 [01:20<01:55, 2.68it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 38%|███▊ | 192/500 [01:20<01:57, 2.62it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 38%|███▊ | 192/500 [01:21<01:57, 2.62it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 39%|███▊ | 193/500 [01:21<01:58, 2.59it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 39%|███▊ | 193/500 [01:21<01:58, 2.59it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 39%|███▉ | 194/500 [01:21<01:59, 2.56it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 39%|███▉ | 194/500 [01:22<01:59, 2.56it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 39%|███▉ | 195/500 [01:22<02:00, 2.54it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 39%|███▉ | 195/500 [01:22<02:00, 2.54it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 39%|███▉ | 196/500 [01:22<02:00, 2.53it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 39%|███▉ | 196/500 [01:22<02:00, 2.53it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 39%|███▉ | 197/500 [01:22<01:51, 2.72it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 39%|███▉ | 197/500 [01:23<01:51, 2.72it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 40%|███▉ | 198/500 [01:23<01:53, 2.66it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 40%|███▉ | 198/500 [01:23<01:53, 2.66it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 40%|███▉ | 199/500 [01:23<02:04, 2.42it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 40%|███▉ | 199/500 [01:23<02:04, 2.42it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 7.24e-03 | LR: 1.00e-01: 40%|████ | 200/500 [01:23<01:53, 2.64it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 40%|████ | 200/500 [01:24<01:53, 2.64it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 40%|████ | 201/500 [01:24<02:04, 2.41it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 40%|████ | 201/500 [01:24<02:04, 2.41it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 40%|████ | 202/500 [01:24<02:02, 2.43it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 40%|████ | 202/500 [01:25<02:02, 2.43it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 41%|████ | 203/500 [01:25<02:01, 2.45it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 41%|████ | 203/500 [01:25<02:01, 2.45it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 41%|████ | 204/500 [01:25<02:00, 2.46it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 41%|████ | 204/500 [01:26<02:00, 2.46it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 41%|████ | 205/500 [01:26<02:25, 2.03it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 41%|████ | 205/500 [01:26<02:25, 2.03it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 41%|████ | 206/500 [01:26<02:17, 2.15it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 41%|████ | 206/500 [01:27<02:17, 2.15it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 41%|████▏ | 207/500 [01:27<02:10, 2.24it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 41%|████▏ | 207/500 [01:27<02:10, 2.24it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 42%|████▏ | 208/500 [01:27<02:06, 2.32it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 42%|████▏ | 208/500 [01:28<02:06, 2.32it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 42%|████▏ | 209/500 [01:28<02:11, 2.21it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 42%|████▏ | 209/500 [01:28<02:11, 2.21it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 42%|████▏ | 210/500 [01:28<01:58, 2.46it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 42%|████▏ | 210/500 [01:28<01:58, 2.46it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 42%|████▏ | 211/500 [01:28<01:56, 2.48it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 42%|████▏ | 211/500 [01:29<01:56, 2.48it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 42%|████▏ | 212/500 [01:29<02:04, 2.31it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 42%|████▏ | 212/500 [01:29<02:04, 2.31it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 43%|████▎ | 213/500 [01:29<02:01, 2.36it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 43%|████▎ | 213/500 [01:30<02:01, 2.36it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 43%|████▎ | 214/500 [01:30<02:07, 2.24it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 43%|████▎ | 214/500 [01:30<02:07, 2.24it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 43%|████▎ | 215/500 [01:30<01:54, 2.48it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 43%|████▎ | 215/500 [01:31<01:54, 2.48it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 43%|████▎ | 216/500 [01:31<02:02, 2.31it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 43%|████▎ | 216/500 [01:31<02:02, 2.31it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 43%|████▎ | 217/500 [01:31<01:59, 2.37it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 43%|████▎ | 217/500 [01:31<01:59, 2.37it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 44%|████▎ | 218/500 [01:31<01:57, 2.41it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 44%|████▎ | 218/500 [01:32<01:57, 2.41it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 44%|████▍ | 219/500 [01:32<02:04, 2.26it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 44%|████▍ | 219/500 [01:32<02:04, 2.26it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 44%|████▍ | 220/500 [01:32<02:16, 2.05it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 44%|████▍ | 220/500 [01:33<02:16, 2.05it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 44%|████▍ | 221/500 [01:33<02:09, 2.16it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 44%|████▍ | 221/500 [01:33<02:09, 2.16it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 44%|████▍ | 222/500 [01:33<02:03, 2.25it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 44%|████▍ | 222/500 [01:34<02:03, 2.25it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 45%|████▍ | 223/500 [01:34<02:07, 2.17it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 45%|████▍ | 223/500 [01:34<02:07, 2.17it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 45%|████▍ | 224/500 [01:34<02:02, 2.26it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 45%|████▍ | 224/500 [01:35<02:02, 2.26it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 45%|████▌ | 225/500 [01:35<02:06, 2.18it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 45%|████▌ | 225/500 [01:35<02:06, 2.18it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 45%|████▌ | 226/500 [01:35<02:00, 2.27it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 45%|████▌ | 226/500 [01:35<02:00, 2.27it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 45%|████▌ | 227/500 [01:35<02:05, 2.18it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 45%|████▌ | 227/500 [01:36<02:05, 2.18it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 46%|████▌ | 228/500 [01:36<02:00, 2.26it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 46%|████▌ | 228/500 [01:36<02:00, 2.26it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 46%|████▌ | 229/500 [01:36<01:56, 2.33it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 46%|████▌ | 229/500 [01:37<01:56, 2.33it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 46%|████▌ | 230/500 [01:37<01:45, 2.56it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 46%|████▌ | 230/500 [01:37<01:45, 2.56it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 46%|████▌ | 231/500 [01:37<01:53, 2.37it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 46%|████▌ | 231/500 [01:37<01:53, 2.37it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 46%|████▋ | 232/500 [01:37<01:43, 2.58it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 46%|████▋ | 232/500 [01:38<01:43, 2.58it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 47%|████▋ | 233/500 [01:38<01:52, 2.38it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 47%|████▋ | 233/500 [01:39<01:52, 2.38it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 47%|████▋ | 234/500 [01:39<02:06, 2.11it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 47%|████▋ | 234/500 [01:39<02:06, 2.11it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 47%|████▋ | 235/500 [01:39<02:07, 2.07it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 47%|████▋ | 235/500 [01:40<02:07, 2.07it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 47%|████▋ | 236/500 [01:40<02:08, 2.05it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 47%|████▋ | 236/500 [01:40<02:08, 2.05it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 47%|████▋ | 237/500 [01:40<02:17, 1.92it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 47%|████▋ | 237/500 [01:41<02:17, 1.92it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 48%|████▊ | 238/500 [01:41<02:22, 1.84it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 48%|████▊ | 238/500 [01:41<02:22, 1.84it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 48%|████▊ | 239/500 [01:41<02:18, 1.88it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 48%|████▊ | 239/500 [01:42<02:18, 1.88it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 48%|████▊ | 240/500 [01:42<02:39, 1.63it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 48%|████▊ | 240/500 [01:43<02:39, 1.63it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 48%|████▊ | 241/500 [01:43<02:29, 1.73it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 48%|████▊ | 241/500 [01:43<02:29, 1.73it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 48%|████▊ | 242/500 [01:43<02:15, 1.90it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 48%|████▊ | 242/500 [01:43<02:15, 1.90it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 49%|████▊ | 243/500 [01:43<02:05, 2.05it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 49%|████▊ | 243/500 [01:44<02:05, 2.05it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 49%|████▉ | 244/500 [01:44<02:20, 1.82it/s][TorchDR] CauchyAffinity: Computed in 0.11s.
[TorchDR] CauchyAffinity: Computed in 0.89s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 49%|████▉ | 244/500 [01:46<02:20, 1.82it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 49%|████▉ | 245/500 [01:46<03:33, 1.20it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 49%|████▉ | 245/500 [01:46<03:33, 1.20it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 49%|████▉ | 246/500 [01:46<03:14, 1.31it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 49%|████▉ | 246/500 [01:47<03:14, 1.31it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 49%|████▉ | 247/500 [01:47<02:53, 1.46it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 49%|████▉ | 247/500 [01:47<02:53, 1.46it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 50%|████▉ | 248/500 [01:47<02:31, 1.67it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 50%|████▉ | 248/500 [01:47<02:31, 1.67it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 50%|████▉ | 249/500 [01:47<02:15, 1.85it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 50%|████▉ | 249/500 [01:48<02:15, 1.85it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 4.99e-03 | LR: 1.00e-01: 50%|█████ | 250/500 [01:48<02:12, 1.89it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.30s.
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 1.52e-03 | LR: 1.00e-01: 50%|█████ | 250/500 [01:49<02:12, 1.89it/s]
[TorchDR] COSNE: Loss: 5.62e+00 | Grad norm: 1.52e-03 | LR: 1.00e-01: 50%|█████ | 251/500 [01:49<02:46, 1.50it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.29e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 50%|█████ | 251/500 [01:49<02:46, 1.50it/s]
[TorchDR] COSNE: Loss: 1.29e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 50%|█████ | 252/500 [01:49<02:33, 1.62it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.28e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 50%|█████ | 252/500 [01:50<02:33, 1.62it/s]
[TorchDR] COSNE: Loss: 1.28e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 51%|█████ | 253/500 [01:50<02:31, 1.63it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.27e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 51%|█████ | 253/500 [01:50<02:31, 1.63it/s]
[TorchDR] COSNE: Loss: 1.27e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 51%|█████ | 254/500 [01:50<02:22, 1.73it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.25e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 51%|█████ | 254/500 [01:51<02:22, 1.73it/s]
[TorchDR] COSNE: Loss: 1.25e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 51%|█████ | 255/500 [01:51<02:09, 1.90it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 51%|█████ | 255/500 [01:51<02:09, 1.90it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 51%|█████ | 256/500 [01:51<02:06, 1.93it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 51%|█████ | 256/500 [01:52<02:06, 1.93it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 51%|█████▏ | 257/500 [01:52<02:04, 1.95it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 51%|█████▏ | 257/500 [01:52<02:04, 1.95it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 52%|█████▏ | 258/500 [01:52<02:02, 1.97it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 52%|█████▏ | 258/500 [01:53<02:02, 1.97it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 52%|█████▏ | 259/500 [01:53<02:01, 1.98it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.11s.
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 52%|█████▏ | 259/500 [01:53<02:01, 1.98it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 52%|█████▏ | 260/500 [01:53<02:01, 1.98it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 52%|█████▏ | 260/500 [01:54<02:01, 1.98it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 52%|█████▏ | 261/500 [01:54<02:00, 1.99it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 52%|█████▏ | 261/500 [01:54<02:00, 1.99it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 52%|█████▏ | 262/500 [01:54<01:59, 2.00it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 52%|█████▏ | 262/500 [01:55<01:59, 2.00it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 53%|█████▎ | 263/500 [01:55<01:51, 2.12it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 53%|█████▎ | 263/500 [01:55<01:51, 2.12it/s]
[TorchDR] COSNE: Loss: 1.24e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 53%|█████▎ | 264/500 [01:55<02:00, 1.96it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 53%|█████▎ | 264/500 [01:56<02:00, 1.96it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 53%|█████▎ | 265/500 [01:56<01:52, 2.09it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 53%|█████▎ | 265/500 [01:56<01:52, 2.09it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 53%|█████▎ | 266/500 [01:56<02:00, 1.95it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 53%|█████▎ | 266/500 [01:57<02:00, 1.95it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 53%|█████▎ | 267/500 [01:57<01:58, 1.96it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 53%|█████▎ | 267/500 [01:57<01:58, 1.96it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 54%|█████▎ | 268/500 [01:57<01:57, 1.98it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 54%|█████▎ | 268/500 [01:58<01:57, 1.98it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 54%|█████▍ | 269/500 [01:58<01:49, 2.11it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 54%|█████▍ | 269/500 [01:58<01:49, 2.11it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 54%|█████▍ | 270/500 [01:58<01:44, 2.21it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 54%|█████▍ | 270/500 [01:59<01:44, 2.21it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 54%|█████▍ | 271/500 [01:59<01:47, 2.14it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 54%|█████▍ | 271/500 [01:59<01:47, 2.14it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 54%|█████▍ | 272/500 [01:59<01:48, 2.09it/s][TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 54%|█████▍ | 272/500 [02:00<01:48, 2.09it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 55%|█████▍ | 273/500 [02:00<01:56, 1.95it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 55%|█████▍ | 273/500 [02:00<01:56, 1.95it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 55%|█████▍ | 274/500 [02:00<01:48, 2.08it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 55%|█████▍ | 274/500 [02:01<01:48, 2.08it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 55%|█████▌ | 275/500 [02:01<01:49, 2.06it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 55%|█████▌ | 275/500 [02:01<01:49, 2.06it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 55%|█████▌ | 276/500 [02:01<01:49, 2.04it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 55%|█████▌ | 276/500 [02:02<01:49, 2.04it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 55%|█████▌ | 277/500 [02:02<01:43, 2.16it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 55%|█████▌ | 277/500 [02:02<01:43, 2.16it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 56%|█████▌ | 278/500 [02:02<01:38, 2.26it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 56%|█████▌ | 278/500 [02:02<01:38, 2.26it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 56%|█████▌ | 279/500 [02:02<01:41, 2.17it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 56%|█████▌ | 279/500 [02:03<01:41, 2.17it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 56%|█████▌ | 280/500 [02:03<01:43, 2.12it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 56%|█████▌ | 280/500 [02:04<01:43, 2.12it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 56%|█████▌ | 281/500 [02:04<01:45, 2.07it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 56%|█████▌ | 281/500 [02:04<01:45, 2.07it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 56%|█████▋ | 282/500 [02:04<01:45, 2.06it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 56%|█████▋ | 282/500 [02:04<01:45, 2.06it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 57%|█████▋ | 283/500 [02:04<01:39, 2.17it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 57%|█████▋ | 283/500 [02:05<01:39, 2.17it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 57%|█████▋ | 284/500 [02:05<01:35, 2.26it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 57%|█████▋ | 284/500 [02:05<01:35, 2.26it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 57%|█████▋ | 285/500 [02:05<01:32, 2.33it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 57%|█████▋ | 285/500 [02:06<01:32, 2.33it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 57%|█████▋ | 286/500 [02:06<01:36, 2.22it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 57%|█████▋ | 286/500 [02:06<01:36, 2.22it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 57%|█████▋ | 287/500 [02:06<01:26, 2.47it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 57%|█████▋ | 287/500 [02:07<01:26, 2.47it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 58%|█████▊ | 288/500 [02:07<01:32, 2.29it/s][TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] CauchyAffinity: Computed in 0.11s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 58%|█████▊ | 288/500 [02:07<01:32, 2.29it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 58%|█████▊ | 289/500 [02:07<01:54, 1.84it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 58%|█████▊ | 289/500 [02:08<01:54, 1.84it/s]
[TorchDR] COSNE: Loss: 1.23e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 58%|█████▊ | 290/500 [02:08<01:51, 1.89it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 58%|█████▊ | 290/500 [02:08<01:51, 1.89it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 58%|█████▊ | 291/500 [02:08<01:42, 2.04it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 58%|█████▊ | 291/500 [02:09<01:42, 2.04it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 58%|█████▊ | 292/500 [02:09<01:36, 2.15it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 58%|█████▊ | 292/500 [02:09<01:36, 2.15it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 59%|█████▊ | 293/500 [02:09<01:32, 2.25it/s][TorchDR] CauchyAffinity: Computed in 0.29s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 59%|█████▊ | 293/500 [02:10<01:32, 2.25it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 59%|█████▉ | 294/500 [02:10<01:41, 2.03it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 59%|█████▉ | 294/500 [02:10<01:41, 2.03it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 59%|█████▉ | 295/500 [02:10<01:41, 2.03it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 59%|█████▉ | 295/500 [02:11<01:41, 2.03it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 59%|█████▉ | 296/500 [02:11<01:40, 2.02it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 59%|█████▉ | 296/500 [02:11<01:40, 2.02it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 59%|█████▉ | 297/500 [02:11<01:40, 2.02it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 59%|█████▉ | 297/500 [02:12<01:40, 2.02it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 60%|█████▉ | 298/500 [02:12<01:34, 2.13it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 60%|█████▉ | 298/500 [02:12<01:34, 2.13it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 60%|█████▉ | 299/500 [02:12<01:41, 1.97it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 60%|█████▉ | 299/500 [02:13<01:41, 1.97it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.52e-03 | LR: 1.00e-01: 60%|██████ | 300/500 [02:13<01:35, 2.10it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 60%|██████ | 300/500 [02:13<01:35, 2.10it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 60%|██████ | 301/500 [02:13<01:29, 2.21it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 60%|██████ | 301/500 [02:13<01:29, 2.21it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 60%|██████ | 302/500 [02:13<01:38, 2.02it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 60%|██████ | 302/500 [02:14<01:38, 2.02it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 61%|██████ | 303/500 [02:14<01:32, 2.14it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 61%|██████ | 303/500 [02:14<01:32, 2.14it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 61%|██████ | 304/500 [02:14<01:33, 2.09it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 61%|██████ | 304/500 [02:15<01:33, 2.09it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 61%|██████ | 305/500 [02:15<01:34, 2.06it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 61%|██████ | 305/500 [02:15<01:34, 2.06it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 61%|██████ | 306/500 [02:15<01:40, 1.93it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 61%|██████ | 306/500 [02:16<01:40, 1.93it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 61%|██████▏ | 307/500 [02:16<01:39, 1.94it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 61%|██████▏ | 307/500 [02:17<01:39, 1.94it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 62%|██████▏ | 308/500 [02:17<01:43, 1.86it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 62%|██████▏ | 308/500 [02:17<01:43, 1.86it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 62%|██████▏ | 309/500 [02:17<01:35, 2.01it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 62%|██████▏ | 309/500 [02:17<01:35, 2.01it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 62%|██████▏ | 310/500 [02:17<01:29, 2.13it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 62%|██████▏ | 310/500 [02:18<01:29, 2.13it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 62%|██████▏ | 311/500 [02:18<01:30, 2.10it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 62%|██████▏ | 311/500 [02:18<01:30, 2.10it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 62%|██████▏ | 312/500 [02:18<01:19, 2.36it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 62%|██████▏ | 312/500 [02:19<01:19, 2.36it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 63%|██████▎ | 313/500 [02:19<01:18, 2.39it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 63%|██████▎ | 313/500 [02:19<01:18, 2.39it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 63%|██████▎ | 314/500 [02:19<01:22, 2.26it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 63%|██████▎ | 314/500 [02:20<01:22, 2.26it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 63%|██████▎ | 315/500 [02:20<01:24, 2.18it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 63%|██████▎ | 315/500 [02:20<01:24, 2.18it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 63%|██████▎ | 316/500 [02:20<01:37, 1.88it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 63%|██████▎ | 316/500 [02:21<01:37, 1.88it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 63%|██████▎ | 317/500 [02:21<01:35, 1.91it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 63%|██████▎ | 317/500 [02:21<01:35, 1.91it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 64%|██████▎ | 318/500 [02:21<01:39, 1.83it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 64%|██████▎ | 318/500 [02:22<01:39, 1.83it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 64%|██████▍ | 319/500 [02:22<01:47, 1.69it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 64%|██████▍ | 319/500 [02:23<01:47, 1.69it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 64%|██████▍ | 320/500 [02:23<01:36, 1.87it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 64%|██████▍ | 320/500 [02:23<01:36, 1.87it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 64%|██████▍ | 321/500 [02:23<01:33, 1.90it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 64%|██████▍ | 321/500 [02:24<01:33, 1.90it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 64%|██████▍ | 322/500 [02:24<01:31, 1.94it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 64%|██████▍ | 322/500 [02:24<01:31, 1.94it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 65%|██████▍ | 323/500 [02:24<01:25, 2.08it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 65%|██████▍ | 323/500 [02:24<01:25, 2.08it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 65%|██████▍ | 324/500 [02:24<01:20, 2.19it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 65%|██████▍ | 324/500 [02:25<01:20, 2.19it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 65%|██████▌ | 325/500 [02:25<01:22, 2.13it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 65%|██████▌ | 325/500 [02:25<01:22, 2.13it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 65%|██████▌ | 326/500 [02:25<01:18, 2.23it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 65%|██████▌ | 326/500 [02:26<01:18, 2.23it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 65%|██████▌ | 327/500 [02:26<01:15, 2.30it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 65%|██████▌ | 327/500 [02:26<01:15, 2.30it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 66%|██████▌ | 328/500 [02:26<01:18, 2.20it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 66%|██████▌ | 328/500 [02:27<01:18, 2.20it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 66%|██████▌ | 329/500 [02:27<01:19, 2.14it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 66%|██████▌ | 329/500 [02:27<01:19, 2.14it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 66%|██████▌ | 330/500 [02:27<01:16, 2.23it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 66%|██████▌ | 330/500 [02:27<01:16, 2.23it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 66%|██████▌ | 331/500 [02:27<01:18, 2.16it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 66%|██████▌ | 331/500 [02:28<01:18, 2.16it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 66%|██████▋ | 332/500 [02:28<01:19, 2.11it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 66%|██████▋ | 332/500 [02:28<01:19, 2.11it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 67%|██████▋ | 333/500 [02:28<01:20, 2.08it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 67%|██████▋ | 333/500 [02:29<01:20, 2.08it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 67%|██████▋ | 334/500 [02:29<01:15, 2.19it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 67%|██████▋ | 334/500 [02:29<01:15, 2.19it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 67%|██████▋ | 335/500 [02:29<01:17, 2.13it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 67%|██████▋ | 335/500 [02:30<01:17, 2.13it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 67%|██████▋ | 336/500 [02:30<01:18, 2.09it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 67%|██████▋ | 336/500 [02:30<01:18, 2.09it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 67%|██████▋ | 337/500 [02:30<01:19, 2.06it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 67%|██████▋ | 337/500 [02:31<01:19, 2.06it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 68%|██████▊ | 338/500 [02:31<01:19, 2.04it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 68%|██████▊ | 338/500 [02:31<01:19, 2.04it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 68%|██████▊ | 339/500 [02:31<01:19, 2.03it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 68%|██████▊ | 339/500 [02:32<01:19, 2.03it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 68%|██████▊ | 340/500 [02:32<01:28, 1.80it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 68%|██████▊ | 340/500 [02:33<01:28, 1.80it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 68%|██████▊ | 341/500 [02:33<01:35, 1.67it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 68%|██████▊ | 341/500 [02:33<01:35, 1.67it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 68%|██████▊ | 342/500 [02:33<01:29, 1.76it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 68%|██████▊ | 342/500 [02:34<01:29, 1.76it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 69%|██████▊ | 343/500 [02:34<01:26, 1.82it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 69%|██████▊ | 343/500 [02:34<01:26, 1.82it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 69%|██████▉ | 344/500 [02:34<01:23, 1.87it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 69%|██████▉ | 344/500 [02:35<01:23, 1.87it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 69%|██████▉ | 345/500 [02:35<01:21, 1.91it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 69%|██████▉ | 345/500 [02:35<01:21, 1.91it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 69%|██████▉ | 346/500 [02:35<01:24, 1.83it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 69%|██████▉ | 346/500 [02:36<01:24, 1.83it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 69%|██████▉ | 347/500 [02:36<01:26, 1.78it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 69%|██████▉ | 347/500 [02:37<01:26, 1.78it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 70%|██████▉ | 348/500 [02:37<01:36, 1.58it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.29s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 70%|██████▉ | 348/500 [02:38<01:36, 1.58it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 70%|██████▉ | 349/500 [02:38<01:52, 1.34it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.30s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 70%|██████▉ | 349/500 [02:39<01:52, 1.34it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.47e-02 | LR: 1.00e-01: 70%|███████ | 350/500 [02:39<01:58, 1.27it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 70%|███████ | 350/500 [02:39<01:58, 1.27it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 70%|███████ | 351/500 [02:39<01:49, 1.37it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 70%|███████ | 351/500 [02:40<01:49, 1.37it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 70%|███████ | 352/500 [02:40<01:38, 1.51it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 70%|███████ | 352/500 [02:40<01:38, 1.51it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 71%|███████ | 353/500 [02:40<01:26, 1.71it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 71%|███████ | 353/500 [02:41<01:26, 1.71it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 71%|███████ | 354/500 [02:41<01:26, 1.70it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 71%|███████ | 354/500 [02:41<01:26, 1.70it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 71%|███████ | 355/500 [02:41<01:25, 1.69it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 71%|███████ | 355/500 [02:42<01:25, 1.69it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 71%|███████ | 356/500 [02:42<01:25, 1.68it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 71%|███████ | 356/500 [02:42<01:25, 1.68it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 71%|███████▏ | 357/500 [02:42<01:20, 1.77it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 71%|███████▏ | 357/500 [02:43<01:20, 1.77it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 72%|███████▏ | 358/500 [02:43<01:17, 1.83it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 72%|███████▏ | 358/500 [02:43<01:17, 1.83it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 72%|███████▏ | 359/500 [02:43<01:15, 1.88it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 72%|███████▏ | 359/500 [02:44<01:15, 1.88it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 72%|███████▏ | 360/500 [02:44<01:09, 2.03it/s][TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 72%|███████▏ | 360/500 [02:45<01:09, 2.03it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 72%|███████▏ | 361/500 [02:45<01:17, 1.80it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 72%|███████▏ | 361/500 [02:45<01:17, 1.80it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 72%|███████▏ | 362/500 [02:45<01:14, 1.86it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 72%|███████▏ | 362/500 [02:46<01:14, 1.86it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 73%|███████▎ | 363/500 [02:46<01:16, 1.80it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 73%|███████▎ | 363/500 [02:46<01:16, 1.80it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 73%|███████▎ | 364/500 [02:46<01:13, 1.85it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 73%|███████▎ | 364/500 [02:47<01:13, 1.85it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 73%|███████▎ | 365/500 [02:47<01:11, 1.89it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 73%|███████▎ | 365/500 [02:47<01:11, 1.89it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 73%|███████▎ | 366/500 [02:47<01:13, 1.82it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 73%|███████▎ | 366/500 [02:48<01:13, 1.82it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 73%|███████▎ | 367/500 [02:48<01:15, 1.77it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 73%|███████▎ | 367/500 [02:49<01:15, 1.77it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 74%|███████▎ | 368/500 [02:49<01:15, 1.74it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 74%|███████▎ | 368/500 [02:49<01:15, 1.74it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 74%|███████▍ | 369/500 [02:49<01:16, 1.72it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 74%|███████▍ | 369/500 [02:50<01:16, 1.72it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 74%|███████▍ | 370/500 [02:50<01:12, 1.79it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 74%|███████▍ | 370/500 [02:50<01:12, 1.79it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 74%|███████▍ | 371/500 [02:50<01:13, 1.76it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 74%|███████▍ | 371/500 [02:51<01:13, 1.76it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 74%|███████▍ | 372/500 [02:51<01:14, 1.73it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 74%|███████▍ | 372/500 [02:51<01:14, 1.73it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 75%|███████▍ | 373/500 [02:51<01:14, 1.71it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 75%|███████▍ | 373/500 [02:52<01:14, 1.71it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 75%|███████▍ | 374/500 [02:52<01:10, 1.78it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 75%|███████▍ | 374/500 [02:52<01:10, 1.78it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 75%|███████▌ | 375/500 [02:52<01:07, 1.84it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 75%|███████▌ | 375/500 [02:53<01:07, 1.84it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 75%|███████▌ | 376/500 [02:53<01:05, 1.89it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 75%|███████▌ | 376/500 [02:53<01:05, 1.89it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 75%|███████▌ | 377/500 [02:53<01:07, 1.82it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 75%|███████▌ | 377/500 [02:54<01:07, 1.82it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 76%|███████▌ | 378/500 [02:54<01:01, 1.97it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 76%|███████▌ | 378/500 [02:54<01:01, 1.97it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 76%|███████▌ | 379/500 [02:54<01:00, 1.99it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 76%|███████▌ | 379/500 [02:55<01:00, 1.99it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 76%|███████▌ | 380/500 [02:55<01:00, 1.99it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 76%|███████▌ | 380/500 [02:55<01:00, 1.99it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 76%|███████▌ | 381/500 [02:55<01:03, 1.88it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 76%|███████▌ | 381/500 [02:56<01:03, 1.88it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 76%|███████▋ | 382/500 [02:56<01:01, 1.91it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 76%|███████▋ | 382/500 [02:57<01:01, 1.91it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 77%|███████▋ | 383/500 [02:57<01:03, 1.83it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 77%|███████▋ | 383/500 [02:57<01:03, 1.83it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 77%|███████▋ | 384/500 [02:57<01:01, 1.88it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 77%|███████▋ | 384/500 [02:58<01:01, 1.88it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 77%|███████▋ | 385/500 [02:58<00:56, 2.03it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 77%|███████▋ | 385/500 [02:58<00:56, 2.03it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 77%|███████▋ | 386/500 [02:58<00:53, 2.15it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 77%|███████▋ | 386/500 [02:58<00:53, 2.15it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 77%|███████▋ | 387/500 [02:58<00:53, 2.10it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 77%|███████▋ | 387/500 [02:59<00:53, 2.10it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 78%|███████▊ | 388/500 [02:59<01:10, 1.58it/s][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.71s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 78%|███████▊ | 388/500 [03:04<01:10, 1.58it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 78%|███████▊ | 389/500 [03:04<03:09, 1.70s/it][TorchDR] CauchyAffinity: Computed in 0.50s.
[TorchDR] CauchyAffinity: Computed in 0.99s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 78%|███████▊ | 389/500 [03:07<03:09, 1.70s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 78%|███████▊ | 390/500 [03:07<04:00, 2.18s/it][TorchDR] CauchyAffinity: Computed in 0.60s.
[TorchDR] CauchyAffinity: Computed in 0.70s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 78%|███████▊ | 390/500 [03:10<04:00, 2.18s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 78%|███████▊ | 391/500 [03:10<04:31, 2.49s/it][TorchDR] CauchyAffinity: Computed in 0.50s.
[TorchDR] CauchyAffinity: Computed in 0.90s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 78%|███████▊ | 391/500 [03:13<04:31, 2.49s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 78%|███████▊ | 392/500 [03:13<04:55, 2.73s/it][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.50s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 78%|███████▊ | 392/500 [03:15<04:55, 2.73s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 79%|███████▊ | 393/500 [03:15<04:12, 2.36s/it][TorchDR] CauchyAffinity: Computed in 0.30s.
[TorchDR] CauchyAffinity: Computed in 0.50s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 79%|███████▊ | 393/500 [03:17<04:12, 2.36s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 79%|███████▉ | 394/500 [03:17<03:49, 2.16s/it][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.39s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 79%|███████▉ | 394/500 [03:18<03:49, 2.16s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 79%|███████▉ | 395/500 [03:18<03:22, 1.93s/it][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.60s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 79%|███████▉ | 395/500 [03:20<03:22, 1.93s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 79%|███████▉ | 396/500 [03:20<03:10, 1.84s/it][TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] CauchyAffinity: Computed in 0.40s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 79%|███████▉ | 396/500 [03:21<03:10, 1.84s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 79%|███████▉ | 397/500 [03:21<03:01, 1.76s/it][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.30s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 79%|███████▉ | 397/500 [03:23<03:01, 1.76s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 80%|███████▉ | 398/500 [03:23<02:54, 1.71s/it][TorchDR] CauchyAffinity: Computed in 0.30s.
[TorchDR] CauchyAffinity: Computed in 0.59s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 80%|███████▉ | 398/500 [03:25<02:54, 1.71s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 80%|███████▉ | 399/500 [03:25<03:11, 1.89s/it][TorchDR] CauchyAffinity: Computed in 0.30s.
[TorchDR] CauchyAffinity: Computed in 0.51s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 80%|███████▉ | 399/500 [03:28<03:11, 1.89s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 1.85e-03 | LR: 1.00e-01: 80%|████████ | 400/500 [03:28<03:36, 2.16s/it][TorchDR] CauchyAffinity: Computed in 0.60s.
[TorchDR] CauchyAffinity: Computed in 0.59s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 80%|████████ | 400/500 [03:32<03:36, 2.16s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 80%|████████ | 401/500 [03:32<04:28, 2.71s/it][TorchDR] CauchyAffinity: Computed in 0.31s.
[TorchDR] CauchyAffinity: Computed in 0.81s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 80%|████████ | 401/500 [03:37<04:28, 2.71s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 80%|████████ | 402/500 [03:37<05:24, 3.31s/it][TorchDR] CauchyAffinity: Computed in 1.99s.
[TorchDR] CauchyAffinity: Computed in 1.30s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 80%|████████ | 402/500 [03:43<05:24, 3.31s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 81%|████████ | 403/500 [03:43<06:59, 4.33s/it][TorchDR] CauchyAffinity: Computed in 0.30s.
[TorchDR] CauchyAffinity: Computed in 0.60s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 81%|████████ | 403/500 [03:46<06:59, 4.33s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 81%|████████ | 404/500 [03:46<06:02, 3.78s/it][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.49s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 81%|████████ | 404/500 [03:47<06:02, 3.78s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 81%|████████ | 405/500 [03:47<04:45, 3.01s/it][TorchDR] CauchyAffinity: Computed in 0.30s.
[TorchDR] CauchyAffinity: Computed in 0.40s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 81%|████████ | 405/500 [03:48<04:45, 3.01s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 81%|████████ | 406/500 [03:48<03:51, 2.46s/it][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 81%|████████ | 406/500 [03:49<03:51, 2.46s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 81%|████████▏ | 407/500 [03:49<02:54, 1.87s/it][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 81%|████████▏ | 407/500 [03:49<02:54, 1.87s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 82%|████████▏ | 408/500 [03:49<02:14, 1.46s/it][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 82%|████████▏ | 408/500 [03:50<02:14, 1.46s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 82%|████████▏ | 409/500 [03:50<01:49, 1.21s/it][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 82%|████████▏ | 409/500 [03:50<01:49, 1.21s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 82%|████████▏ | 410/500 [03:50<01:32, 1.02s/it][TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 82%|████████▏ | 410/500 [03:51<01:32, 1.02s/it]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 82%|████████▏ | 411/500 [03:51<01:27, 1.01it/s][TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 82%|████████▏ | 411/500 [03:52<01:27, 1.01it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 82%|████████▏ | 412/500 [03:52<01:21, 1.08it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 82%|████████▏ | 412/500 [03:53<01:21, 1.08it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 83%|████████▎ | 413/500 [03:53<01:20, 1.09it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 83%|████████▎ | 413/500 [03:54<01:20, 1.09it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 83%|████████▎ | 414/500 [03:54<01:13, 1.17it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 83%|████████▎ | 414/500 [03:54<01:13, 1.17it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 83%|████████▎ | 415/500 [03:54<01:03, 1.34it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 83%|████████▎ | 415/500 [03:55<01:03, 1.34it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 83%|████████▎ | 416/500 [03:55<00:56, 1.48it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 83%|████████▎ | 416/500 [03:55<00:56, 1.48it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 83%|████████▎ | 417/500 [03:55<00:54, 1.54it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.11s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 83%|████████▎ | 417/500 [03:56<00:54, 1.54it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 84%|████████▎ | 418/500 [03:56<00:54, 1.50it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 84%|████████▎ | 418/500 [03:57<00:54, 1.50it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 84%|████████▍ | 419/500 [03:57<00:50, 1.62it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 84%|████████▍ | 419/500 [03:57<00:50, 1.62it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 84%|████████▍ | 420/500 [03:57<00:48, 1.63it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 84%|████████▍ | 420/500 [03:58<00:48, 1.63it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 84%|████████▍ | 421/500 [03:58<00:50, 1.57it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 84%|████████▍ | 421/500 [03:58<00:50, 1.57it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 84%|████████▍ | 422/500 [03:58<00:48, 1.60it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 84%|████████▍ | 422/500 [03:59<00:48, 1.60it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 85%|████████▍ | 423/500 [03:59<00:49, 1.54it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 85%|████████▍ | 423/500 [04:00<00:49, 1.54it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 85%|████████▍ | 424/500 [04:00<00:45, 1.66it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 85%|████████▍ | 424/500 [04:00<00:45, 1.66it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 85%|████████▌ | 425/500 [04:00<00:47, 1.58it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 85%|████████▌ | 425/500 [04:01<00:47, 1.58it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 85%|████████▌ | 426/500 [04:01<00:43, 1.69it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 85%|████████▌ | 426/500 [04:02<00:43, 1.69it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 85%|████████▌ | 427/500 [04:02<00:45, 1.60it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 85%|████████▌ | 427/500 [04:02<00:45, 1.60it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 86%|████████▌ | 428/500 [04:02<00:48, 1.48it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 86%|████████▌ | 428/500 [04:03<00:48, 1.48it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 86%|████████▌ | 429/500 [04:03<00:46, 1.53it/s][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 86%|████████▌ | 429/500 [04:04<00:46, 1.53it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 86%|████████▌ | 430/500 [04:04<00:48, 1.44it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 86%|████████▌ | 430/500 [04:04<00:48, 1.44it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 86%|████████▌ | 431/500 [04:04<00:44, 1.56it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 86%|████████▌ | 431/500 [04:05<00:44, 1.56it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 86%|████████▋ | 432/500 [04:05<00:40, 1.68it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 86%|████████▋ | 432/500 [04:05<00:40, 1.68it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 87%|████████▋ | 433/500 [04:05<00:40, 1.67it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 87%|████████▋ | 433/500 [04:06<00:40, 1.67it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 87%|████████▋ | 434/500 [04:06<00:37, 1.76it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 87%|████████▋ | 434/500 [04:06<00:37, 1.76it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 87%|████████▋ | 435/500 [04:06<00:37, 1.73it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 87%|████████▋ | 435/500 [04:07<00:37, 1.73it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 87%|████████▋ | 436/500 [04:07<00:39, 1.62it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 87%|████████▋ | 436/500 [04:08<00:39, 1.62it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 87%|████████▋ | 437/500 [04:08<00:38, 1.64it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.30s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 87%|████████▋ | 437/500 [04:08<00:38, 1.64it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 88%|████████▊ | 438/500 [04:08<00:37, 1.65it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 88%|████████▊ | 438/500 [04:09<00:37, 1.65it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 88%|████████▊ | 439/500 [04:09<00:36, 1.65it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 88%|████████▊ | 439/500 [04:09<00:36, 1.65it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 88%|████████▊ | 440/500 [04:09<00:34, 1.74it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 88%|████████▊ | 440/500 [04:10<00:34, 1.74it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 88%|████████▊ | 441/500 [04:10<00:34, 1.72it/s][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 88%|████████▊ | 441/500 [04:11<00:34, 1.72it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 88%|████████▊ | 442/500 [04:11<00:35, 1.62it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 88%|████████▊ | 442/500 [04:11<00:35, 1.62it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 89%|████████▊ | 443/500 [04:11<00:34, 1.64it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 89%|████████▊ | 443/500 [04:12<00:34, 1.64it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 89%|████████▉ | 444/500 [04:12<00:32, 1.73it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 89%|████████▉ | 444/500 [04:13<00:32, 1.73it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 89%|████████▉ | 445/500 [04:13<00:33, 1.63it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 89%|████████▉ | 445/500 [04:13<00:33, 1.63it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 89%|████████▉ | 446/500 [04:13<00:31, 1.72it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 89%|████████▉ | 446/500 [04:13<00:31, 1.72it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 89%|████████▉ | 447/500 [04:13<00:29, 1.80it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 89%|████████▉ | 447/500 [04:14<00:29, 1.80it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 90%|████████▉ | 448/500 [04:14<00:28, 1.85it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 90%|████████▉ | 448/500 [04:14<00:28, 1.85it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 90%|████████▉ | 449/500 [04:14<00:26, 1.90it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 90%|████████▉ | 449/500 [04:15<00:26, 1.90it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 2.13e-04 | LR: 1.00e-01: 90%|█████████ | 450/500 [04:15<00:24, 2.04it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 90%|█████████ | 450/500 [04:16<00:24, 2.04it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 90%|█████████ | 451/500 [04:16<00:25, 1.91it/s][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 90%|█████████ | 451/500 [04:16<00:25, 1.91it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 90%|█████████ | 452/500 [04:16<00:26, 1.84it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 90%|█████████ | 452/500 [04:17<00:26, 1.84it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 91%|█████████ | 453/500 [04:17<00:25, 1.87it/s][TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 91%|█████████ | 453/500 [04:17<00:25, 1.87it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 91%|█████████ | 454/500 [04:17<00:23, 1.92it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 91%|█████████ | 454/500 [04:18<00:23, 1.92it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 91%|█████████ | 455/500 [04:18<00:21, 2.06it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 91%|█████████ | 455/500 [04:18<00:21, 2.06it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 91%|█████████ | 456/500 [04:18<00:26, 1.64it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.39s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 91%|█████████ | 456/500 [04:20<00:26, 1.64it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 91%|█████████▏| 457/500 [04:20<00:32, 1.32it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 91%|█████████▏| 457/500 [04:20<00:32, 1.32it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 92%|█████████▏| 458/500 [04:20<00:29, 1.41it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 92%|█████████▏| 458/500 [04:21<00:29, 1.41it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 92%|█████████▏| 459/500 [04:21<00:26, 1.55it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 92%|█████████▏| 459/500 [04:21<00:26, 1.55it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 92%|█████████▏| 460/500 [04:21<00:24, 1.66it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 92%|█████████▏| 460/500 [04:21<00:24, 1.66it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 92%|█████████▏| 461/500 [04:22<00:21, 1.85it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 92%|█████████▏| 461/500 [04:22<00:21, 1.85it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 92%|█████████▏| 462/500 [04:22<00:20, 1.89it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 92%|█████████▏| 462/500 [04:22<00:20, 1.89it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 93%|█████████▎| 463/500 [04:22<00:18, 2.03it/s][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 93%|█████████▎| 463/500 [04:23<00:18, 2.03it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 93%|█████████▎| 464/500 [04:23<00:20, 1.72it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 93%|█████████▎| 464/500 [04:24<00:20, 1.72it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 93%|█████████▎| 465/500 [04:24<00:19, 1.79it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 93%|█████████▎| 465/500 [04:24<00:19, 1.79it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 93%|█████████▎| 466/500 [04:24<00:17, 1.96it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.29s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 93%|█████████▎| 466/500 [04:25<00:17, 1.96it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 93%|█████████▎| 467/500 [04:25<00:17, 1.86it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 93%|█████████▎| 467/500 [04:25<00:17, 1.86it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 94%|█████████▎| 468/500 [04:25<00:15, 2.01it/s][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 94%|█████████▎| 468/500 [04:26<00:15, 2.01it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 94%|█████████▍| 469/500 [04:26<00:16, 1.89it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 94%|█████████▍| 469/500 [04:26<00:16, 1.89it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 94%|█████████▍| 470/500 [04:26<00:16, 1.82it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 94%|█████████▍| 470/500 [04:27<00:16, 1.82it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 94%|█████████▍| 471/500 [04:27<00:17, 1.69it/s][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 94%|█████████▍| 471/500 [04:28<00:17, 1.69it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 94%|█████████▍| 472/500 [04:28<00:16, 1.68it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 94%|█████████▍| 472/500 [04:28<00:16, 1.68it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 95%|█████████▍| 473/500 [04:28<00:16, 1.60it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 95%|█████████▍| 473/500 [04:29<00:16, 1.60it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 95%|█████████▍| 474/500 [04:29<00:15, 1.70it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 95%|█████████▍| 474/500 [04:30<00:15, 1.70it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 95%|█████████▌| 475/500 [04:30<00:15, 1.61it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 95%|█████████▌| 475/500 [04:30<00:15, 1.61it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 95%|█████████▌| 476/500 [04:30<00:14, 1.70it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 95%|█████████▌| 476/500 [04:30<00:14, 1.70it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 95%|█████████▌| 477/500 [04:30<00:12, 1.79it/s][TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 95%|█████████▌| 477/500 [04:31<00:12, 1.79it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 96%|█████████▌| 478/500 [04:31<00:11, 1.95it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 96%|█████████▌| 478/500 [04:31<00:11, 1.95it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 96%|█████████▌| 479/500 [04:31<00:10, 2.08it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 96%|█████████▌| 479/500 [04:32<00:10, 2.08it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 96%|█████████▌| 480/500 [04:32<00:09, 2.06it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 96%|█████████▌| 480/500 [04:32<00:09, 2.06it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 96%|█████████▌| 481/500 [04:32<00:09, 2.04it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 96%|█████████▌| 481/500 [04:33<00:09, 2.04it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 96%|█████████▋| 482/500 [04:33<00:08, 2.16it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 96%|█████████▋| 482/500 [04:33<00:08, 2.16it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 97%|█████████▋| 483/500 [04:33<00:07, 2.26it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 97%|█████████▋| 483/500 [04:34<00:07, 2.26it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 97%|█████████▋| 484/500 [04:34<00:07, 2.04it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 97%|█████████▋| 484/500 [04:34<00:07, 2.04it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 97%|█████████▋| 485/500 [04:34<00:07, 2.03it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 97%|█████████▋| 485/500 [04:35<00:07, 2.03it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 97%|█████████▋| 486/500 [04:35<00:06, 2.14it/s][TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 97%|█████████▋| 486/500 [04:35<00:06, 2.14it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 97%|█████████▋| 487/500 [04:35<00:07, 1.76it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 97%|█████████▋| 487/500 [04:36<00:07, 1.76it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 98%|█████████▊| 488/500 [04:36<00:07, 1.65it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 98%|█████████▊| 488/500 [04:37<00:07, 1.65it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 98%|█████████▊| 489/500 [04:37<00:06, 1.74it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 98%|█████████▊| 489/500 [04:37<00:06, 1.74it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 98%|█████████▊| 490/500 [04:37<00:05, 1.81it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 98%|█████████▊| 490/500 [04:38<00:05, 1.81it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 98%|█████████▊| 491/500 [04:38<00:04, 1.86it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 98%|█████████▊| 491/500 [04:38<00:04, 1.86it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 98%|█████████▊| 492/500 [04:38<00:04, 1.80it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 98%|█████████▊| 492/500 [04:39<00:04, 1.80it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 99%|█████████▊| 493/500 [04:39<00:03, 1.96it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 99%|█████████▊| 493/500 [04:39<00:03, 1.96it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 99%|█████████▉| 494/500 [04:39<00:03, 1.98it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 99%|█████████▉| 494/500 [04:40<00:03, 1.98it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 99%|█████████▉| 495/500 [04:40<00:02, 1.77it/s][TorchDR] CauchyAffinity: Computed in 0.19s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 99%|█████████▉| 495/500 [04:40<00:02, 1.77it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 99%|█████████▉| 496/500 [04:40<00:02, 1.74it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 99%|█████████▉| 496/500 [04:41<00:02, 1.74it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 99%|█████████▉| 497/500 [04:41<00:01, 1.71it/s][TorchDR] CauchyAffinity: Computed in 0.09s.
[TorchDR] CauchyAffinity: Computed in 0.20s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 99%|█████████▉| 497/500 [04:42<00:01, 1.71it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 100%|█████████▉| 498/500 [04:42<00:01, 1.71it/s][TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] CauchyAffinity: Computed in 0.01s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 100%|█████████▉| 498/500 [04:42<00:01, 1.71it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 100%|█████████▉| 499/500 [04:42<00:00, 1.78it/s][TorchDR] CauchyAffinity: Computed in 0.00s.
[TorchDR] CauchyAffinity: Computed in 0.10s.
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 100%|█████████▉| 499/500 [04:43<00:00, 1.78it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 100%|██████████| 500/500 [04:43<00:00, 1.84it/s]
[TorchDR] COSNE: Loss: 1.22e+01 | Grad norm: 9.91e-04 | LR: 1.00e-01: 100%|██████████| 500/500 [04:43<00:00, 1.77it/s]
[TorchDR] COSNE: Computed in 283.60s.
/home/circleci/project/torchdr/utils/visu.py:29: RuntimeWarning: invalid value encountered in arccosh
hypDistance = np.arccosh(1 + 2 * (distance) / (1 - distance + 1e-10))
Total running time of the script: (6 minutes 24.716 seconds)