TSNE vs COSNE : Euclidean vs Hyperbolic#

We compare in this example two dimensionalty reduction methods: T-SNE and CO-SNE on a synthetic hierarchical toy dataset and on singlecell data. The first method computes an embedding in a 2D Euclidean space while the second one operates in the Hyperbolic Poincaré Ball model.

Designing the synthetic hierarchical dataset#

We first construct a synthetic hierarchical dataset with the following class

import numpy as np
from torchdr.utils.visu import plotGrid
from torchdr import TSNE, COSNE
from torchdr import pairwise_distances
import torch
import itertools
import urllib.request
import matplotlib.pylab as plt
from torchdr.utils import geoopt


class SyntheticDataset(torch.utils.data.Dataset):
    """
    Implementation of a synthetic dataset by hierarchical diffusion.

    Adopted from https://github.com/emilemathieu/pvae/

    Parameters
    ----------
    ball : torchdr.utils.geoopt.PoincareBall
        The Poincaré ball used for generating the dataset.
    dim : int
        Dimension of the input sample.
    depth : int
        Depth of the tree; the root corresponds to the depth 0.
    num_children : int
        Number of children of each node in the tree.
    dist_children : float
        Distance parameter for children nodes.
    sigma_sibling : float
        Noise parameter for sibling nodes.
    num_siblings : int
        Number of noisy observations obtained from the nodes of the tree.
    """

    def __init__(
        self,
        ball,
        dim,
        depth,
        num_children=2,
        dist_children=1,
        sigma_sibling=2,
        num_siblings=1,
    ):
        assert num_children == 2
        self.dim = int(dim)
        self.ball = ball
        self.root = ball.origin(self.dim)
        self.sigma_sibling = sigma_sibling
        self.depth = int(depth)
        self.dist_children = dist_children
        self.num_children = int(num_children)
        self.num_siblings = int(num_siblings)
        self.__class_counter = itertools.count()
        self.origin_data, self.origin_labels, self.data, self.labels = map(
            torch.detach, self.bst()
        )
        self.num_classes = self.origin_labels.max().item() + 1

    def __len__(self):
        """
        Return the total number of samples/nodes.

        Returns
        -------
        int
            Number of samples in the dataset.
        """
        return len(self.data)

    def __getitem__(self, idx):
        """
        Generate one sample.

        Parameters
        ----------
        idx : int
            Index of the sample to retrieve.

        Returns
        -------
        tuple
            Contains (data, labels, max_label) for the requested index.
        """
        data, labels = self.data[idx], self.labels[idx]
        return data, labels, labels.max(-1).values

    def get_children(self, parent_value, parent_label, current_depth, offspring=True):
        """
        Generate children nodes or noisy observations from a parent node.

        Parameters
        ----------
        parent_value : torch.Tensor
            1D array representing the parent node value.
        parent_label : torch.Tensor
            1D array representing the parent node label.
        current_depth : int
            Current depth in the tree.
        offspring : bool, default=True
            If True, the parent node gives birth to num_children nodes.
            If False, the parent node gives birth to num_siblings noisy observations.

        Returns
        -------
        list
            List of 2-tuples containing the value and label of each child of the
            parent node. Length depends on offspring parameter.
        """
        if offspring:
            num_children = self.num_children
            sigma = self.dist_children
        else:
            num_children = self.num_siblings
            sigma = self.sigma_sibling
        if offspring:
            direction = torch.randn_like(parent_value)
            parent_value_n = parent_value / parent_value.norm().clamp_min(1e-15)
            direction -= parent_value_n @ direction * parent_value_n
            child_value_1 = self.ball.geodesic_unit(
                torch.tensor(sigma), parent_value, direction
            )
            child_value_2 = self.ball.geodesic_unit(
                torch.tensor(sigma), parent_value, -direction
            )
            child_label_1 = parent_label.clone()
            child_label_1[current_depth] = next(self.__class_counter)
            child_label_2 = parent_label.clone()
            child_label_2[current_depth] = next(self.__class_counter)
            children = [(child_value_1, child_label_1), (child_value_2, child_label_2)]
        else:
            children = []
            for i in range(num_children):
                child_value = self.ball.random(
                    self.dim, mean=parent_value, std=sigma**0.5
                )
                child_label = parent_label.clone()
                children.append((child_value, child_label))
        return children

    def bst(self):
        """
        Generate all nodes of a level before proceeding to the next level.

        This method builds the hierarchical tree structure level by level.

        Returns
        -------
        tuple
            Contains (images, labels_visited, values_clones, labels_clones)
            representing the original data points, their labels, and the
            noisy observations with their labels.
        """
        label = -torch.ones(self.depth + 1, dtype=torch.long)
        label[0] = next(self.__class_counter)
        queue = [(self.root, label, 0)]
        visited = []
        labels_visited = []
        values_clones = []
        labels_clones = []
        while len(queue) > 0:
            current_node, current_label, current_depth = queue.pop(0)
            visited.append(current_node)
            labels_visited.append(current_label)
            if current_depth < self.depth:
                children = self.get_children(current_node, current_label, current_depth)
                for child in children:
                    queue.append((child[0], child[1], current_depth + 1))
            if current_depth <= self.depth:
                clones = self.get_children(
                    current_node, current_label, current_depth, False
                )
                for clone in clones:
                    values_clones.append(clone[0])
                    labels_clones.append(clone[1])
        length = int(
            ((self.num_children) ** (self.depth + 1) - 1) / (self.num_children - 1)
        )
        images = torch.cat([i for i in visited]).reshape(length, self.dim)
        labels_visited = torch.cat([i for i in labels_visited]).reshape(
            length, self.depth + 1
        )[:, : self.depth]
        values_clones = torch.cat([i for i in values_clones]).reshape(
            self.num_siblings * length, self.dim
        )
        labels_clones = torch.cat([i for i in labels_clones]).reshape(
            self.num_siblings * length, self.depth + 1
        )
        return images, labels_visited, values_clones, labels_clones

Generating the data#

Let us now generate some data of interest. The dimension of the input space is set to 50

ball = geoopt.PoincareBall()

dataset = SyntheticDataset(
    ball, 50, 2, num_siblings=100, sigma_sibling=0.05, dist_children=0.7
)
data_points = dataset.data
data_points = data_points - data_points.mean(axis=0)

labels = dataset.labels
colors = dataset.labels.max(-1).values

Visualization of the original similarities#

We can observe the hierarchical nature of the input data by examining the pairwaise distance matrix in the input space

dist_matrix, _ = pairwise_distances(data_points, data_points, metric="sqeuclidean")

plt.figure()
plt.imshow(dist_matrix)
plt.title("Distance matrix in the input space")
plt.show()
Distance matrix in the input space

Computing TSNE and COSNE#

We can now proceed to computing the two DR methods and visualizing the results

tsne_model = TSNE(verbose=True, max_iter=500)
out_tsne = tsne_model.fit_transform(data_points)

cosne_model = COSNE(lr=1e-1, verbose=True, gamma=0.5, lambda1=0.01, max_iter=500)
out_cosne = cosne_model.fit_transform(data_points)


fig, axes = plt.subplots(nrows=1, ncols=2, figsize=(8, 4))
axes[0].scatter(*out_tsne.T, c=colors, cmap=plt.get_cmap("rainbow"))
axes[0].set_xticks([])
axes[0].set_yticks([])
axes[0].set_title("T-SNE", fontsize=24)
plotGrid(axes[1])
axes[1].scatter(*out_cosne.T, c=colors, cmap=plt.get_cmap("rainbow"))
axes[1].axis("off")
axes[1].set_title("CO-SNE", fontsize=24)
plt.show()
T-SNE, CO-SNE
Random state is None
[TorchDR] Initializing DR model TSNE.
[TorchDR] Affinity : computing the Entropic Affinity matrix.
[TorchDR] Affinity : sparsity mode enabled, computing 90 nearest neighbors. If this step is too slow, consider reducing the dimensionality of the data or disabling sparsity.

  0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  5.27e-01 (std =  5.68e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  3.72e-01 (std =  5.93e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  2.42e-01 (std =  5.54e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  1.48e-01 (std =  4.64e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  8.68e-02 (std =  3.58e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  5.01e-02 (std =  2.60e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  5.01e-02 (std =  2.60e-02) :   6%|▌         | 6/100 [00:00<00:01, 59.68it/s]
[TorchDR] Root search : mean abs value =  2.88e-02 (std =  1.82e-02) :   6%|▌         | 6/100 [00:00<00:01, 59.68it/s]
[TorchDR] Root search : mean abs value =  1.66e-02 (std =  1.24e-02) :   6%|▌         | 6/100 [00:00<00:01, 59.68it/s]
[TorchDR] Root search : mean abs value =  9.57e-03 (std =  8.43e-03) :   6%|▌         | 6/100 [00:00<00:01, 59.68it/s]
[TorchDR] Root search : mean abs value =  5.58e-03 (std =  5.69e-03) :   6%|▌         | 6/100 [00:00<00:01, 59.68it/s]
[TorchDR] Root search : mean abs value =  3.28e-03 (std =  3.84e-03) :   6%|▌         | 6/100 [00:00<00:01, 59.68it/s]
[TorchDR] Root search : mean abs value =  1.95e-03 (std =  2.60e-03) :   6%|▌         | 6/100 [00:00<00:01, 59.68it/s]
[TorchDR] Root search : mean abs value =  1.17e-03 (std =  1.76e-03) :   6%|▌         | 6/100 [00:00<00:01, 59.68it/s]
[TorchDR] Root search : mean abs value =  7.05e-04 (std =  1.20e-03) :   6%|▌         | 6/100 [00:00<00:01, 59.68it/s]
[TorchDR] Root search : mean abs value =  4.30e-04 (std =  8.23e-04) :   6%|▌         | 6/100 [00:00<00:01, 59.68it/s]
[TorchDR] Root search : mean abs value =  4.30e-04 (std =  8.23e-04) :  15%|█▌        | 15/100 [00:00<00:01, 77.16it/s]
[TorchDR] Root search : mean abs value =  2.64e-04 (std =  5.66e-04) :  15%|█▌        | 15/100 [00:00<00:01, 77.16it/s]
[TorchDR] Root search : mean abs value =  1.64e-04 (std =  3.92e-04) :  15%|█▌        | 15/100 [00:00<00:01, 77.16it/s]
[TorchDR] Root search : mean abs value =  1.02e-04 (std =  2.72e-04) :  15%|█▌        | 15/100 [00:00<00:01, 77.16it/s]
[TorchDR] Root search : mean abs value =  6.47e-05 (std =  1.90e-04) :  15%|█▌        | 15/100 [00:00<00:01, 77.16it/s]
[TorchDR] Root search : mean abs value =  4.11e-05 (std =  1.33e-04) :  15%|█▌        | 15/100 [00:00<00:01, 77.16it/s]
[TorchDR] Root search : mean abs value =  2.63e-05 (std =  9.35e-05) :  15%|█▌        | 15/100 [00:00<00:01, 77.16it/s]
[TorchDR] Root search : mean abs value =  1.70e-05 (std =  6.59e-05) :  15%|█▌        | 15/100 [00:00<00:01, 77.16it/s]
[TorchDR] Root search : mean abs value =  1.11e-05 (std =  4.65e-05) :  15%|█▌        | 15/100 [00:00<00:01, 77.16it/s]
[TorchDR] Root search : mean abs value =  1.11e-05 (std =  4.65e-05) :  23%|██▎       | 23/100 [00:00<00:01, 54.35it/s]
[TorchDR] Root search : mean abs value =  1.11e-05 (std =  4.65e-05) :  23%|██▎       | 23/100 [00:00<00:01, 57.65it/s]
Random state is None

  0%|          | 0/500 [00:00<?, ?it/s]
[TorchDR] DR Loss : 1.31e+01 | Grad norm : 2.50e-05 :   0%|          | 0/500 [00:00<?, ?it/s]
[TorchDR] DR Loss : 1.31e+01 | Grad norm : 2.50e-05 :   0%|          | 0/500 [00:00<?, ?it/s]
[TorchDR] DR Loss : 1.31e+01 | Grad norm : 2.50e-05 :   0%|          | 2/500 [00:00<00:49, 10.05it/s]
[TorchDR] DR Loss : 1.31e+01 | Grad norm : 2.50e-05 :   0%|          | 2/500 [00:00<00:49, 10.05it/s]
[TorchDR] DR Loss : 1.31e+01 | Grad norm : 2.50e-05 :   0%|          | 2/500 [00:00<00:49, 10.05it/s]
[TorchDR] DR Loss : 1.31e+01 | Grad norm : 2.50e-05 :   1%|          | 4/500 [00:00<00:49, 10.04it/s]
[TorchDR] DR Loss : 1.31e+01 | Grad norm : 2.50e-05 :   1%|          | 4/500 [00:00<00:49, 10.04it/s]
[TorchDR] DR Loss : 1.31e+01 | Grad norm : 2.50e-05 :   1%|          | 4/500 [00:00<00:49, 10.04it/s]
[TorchDR] DR Loss : 1.31e+01 | Grad norm : 2.50e-05 :   1%|          | 6/500 [00:00<00:38, 12.86it/s]
[TorchDR] DR Loss : 1.40e+01 | Grad norm : 2.50e-05 :   1%|          | 6/500 [00:00<00:38, 12.86it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 2.50e-05 :   1%|          | 6/500 [00:00<00:38, 12.86it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 2.50e-05 :   2%|▏         | 8/500 [00:00<00:42, 11.61it/s]
[TorchDR] DR Loss : 1.74e+01 | Grad norm : 2.50e-05 :   2%|▏         | 8/500 [00:00<00:42, 11.61it/s]
[TorchDR] DR Loss : 1.85e+01 | Grad norm : 2.50e-05 :   2%|▏         | 8/500 [00:00<00:42, 11.61it/s]
[TorchDR] DR Loss : 1.85e+01 | Grad norm : 2.50e-05 :   2%|▏         | 10/500 [00:00<00:44, 10.95it/s]
[TorchDR] DR Loss : 1.79e+01 | Grad norm : 2.50e-05 :   2%|▏         | 10/500 [00:00<00:44, 10.95it/s]
[TorchDR] DR Loss : 1.84e+01 | Grad norm : 2.50e-05 :   2%|▏         | 10/500 [00:01<00:44, 10.95it/s]
[TorchDR] DR Loss : 1.84e+01 | Grad norm : 2.50e-05 :   2%|▏         | 12/500 [00:01<00:45, 10.66it/s]
[TorchDR] DR Loss : 1.88e+01 | Grad norm : 2.50e-05 :   2%|▏         | 12/500 [00:01<00:45, 10.66it/s]
[TorchDR] DR Loss : 1.78e+01 | Grad norm : 2.50e-05 :   2%|▏         | 12/500 [00:01<00:45, 10.66it/s]
[TorchDR] DR Loss : 1.78e+01 | Grad norm : 2.50e-05 :   3%|▎         | 14/500 [00:01<00:39, 12.45it/s]
[TorchDR] DR Loss : 1.90e+01 | Grad norm : 2.50e-05 :   3%|▎         | 14/500 [00:01<00:39, 12.45it/s]
[TorchDR] DR Loss : 1.81e+01 | Grad norm : 2.50e-05 :   3%|▎         | 14/500 [00:01<00:39, 12.45it/s]
[TorchDR] DR Loss : 1.81e+01 | Grad norm : 2.50e-05 :   3%|▎         | 16/500 [00:01<00:41, 11.60it/s]
[TorchDR] DR Loss : 1.81e+01 | Grad norm : 2.50e-05 :   3%|▎         | 16/500 [00:01<00:41, 11.60it/s]
[TorchDR] DR Loss : 1.81e+01 | Grad norm : 2.50e-05 :   3%|▎         | 16/500 [00:01<00:41, 11.60it/s]
[TorchDR] DR Loss : 1.81e+01 | Grad norm : 2.50e-05 :   4%|▎         | 18/500 [00:01<00:43, 11.11it/s]
[TorchDR] DR Loss : 1.84e+01 | Grad norm : 2.50e-05 :   4%|▎         | 18/500 [00:01<00:43, 11.11it/s]
[TorchDR] DR Loss : 1.78e+01 | Grad norm : 2.50e-05 :   4%|▎         | 18/500 [00:01<00:43, 11.11it/s]
[TorchDR] DR Loss : 1.78e+01 | Grad norm : 2.50e-05 :   4%|▍         | 20/500 [00:01<00:37, 12.77it/s]
[TorchDR] DR Loss : 1.81e+01 | Grad norm : 2.50e-05 :   4%|▍         | 20/500 [00:01<00:37, 12.77it/s]
[TorchDR] DR Loss : 1.81e+01 | Grad norm : 2.50e-05 :   4%|▍         | 20/500 [00:01<00:37, 12.77it/s]
[TorchDR] DR Loss : 1.81e+01 | Grad norm : 2.50e-05 :   4%|▍         | 22/500 [00:01<00:40, 11.81it/s]
[TorchDR] DR Loss : 1.80e+01 | Grad norm : 2.50e-05 :   4%|▍         | 22/500 [00:01<00:40, 11.81it/s]
[TorchDR] DR Loss : 1.83e+01 | Grad norm : 2.50e-05 :   4%|▍         | 22/500 [00:02<00:40, 11.81it/s]
[TorchDR] DR Loss : 1.83e+01 | Grad norm : 2.50e-05 :   5%|▍         | 24/500 [00:02<00:35, 13.44it/s]
[TorchDR] DR Loss : 1.79e+01 | Grad norm : 2.50e-05 :   5%|▍         | 24/500 [00:02<00:35, 13.44it/s]
[TorchDR] DR Loss : 1.86e+01 | Grad norm : 2.50e-05 :   5%|▍         | 24/500 [00:02<00:35, 13.44it/s]
[TorchDR] DR Loss : 1.86e+01 | Grad norm : 2.50e-05 :   5%|▌         | 26/500 [00:02<00:38, 12.24it/s]
[TorchDR] DR Loss : 1.77e+01 | Grad norm : 2.50e-05 :   5%|▌         | 26/500 [00:02<00:38, 12.24it/s]
[TorchDR] DR Loss : 1.84e+01 | Grad norm : 2.50e-05 :   5%|▌         | 26/500 [00:02<00:38, 12.24it/s]
[TorchDR] DR Loss : 1.84e+01 | Grad norm : 2.50e-05 :   6%|▌         | 28/500 [00:02<00:34, 13.77it/s]
[TorchDR] DR Loss : 1.83e+01 | Grad norm : 2.50e-05 :   6%|▌         | 28/500 [00:02<00:34, 13.77it/s]
[TorchDR] DR Loss : 1.74e+01 | Grad norm : 2.50e-05 :   6%|▌         | 28/500 [00:02<00:34, 13.77it/s]
[TorchDR] DR Loss : 1.74e+01 | Grad norm : 2.50e-05 :   6%|▌         | 30/500 [00:02<00:37, 12.42it/s]
[TorchDR] DR Loss : 1.87e+01 | Grad norm : 2.50e-05 :   6%|▌         | 30/500 [00:02<00:37, 12.42it/s]
[TorchDR] DR Loss : 1.78e+01 | Grad norm : 2.50e-05 :   6%|▌         | 30/500 [00:02<00:37, 12.42it/s]
[TorchDR] DR Loss : 1.78e+01 | Grad norm : 2.50e-05 :   6%|▋         | 32/500 [00:02<00:40, 11.64it/s]
[TorchDR] DR Loss : 1.81e+01 | Grad norm : 2.50e-05 :   6%|▋         | 32/500 [00:02<00:40, 11.64it/s]
[TorchDR] DR Loss : 1.76e+01 | Grad norm : 2.50e-05 :   6%|▋         | 32/500 [00:02<00:40, 11.64it/s]
[TorchDR] DR Loss : 1.76e+01 | Grad norm : 2.50e-05 :   7%|▋         | 34/500 [00:02<00:35, 13.19it/s]
[TorchDR] DR Loss : 1.79e+01 | Grad norm : 2.50e-05 :   7%|▋         | 34/500 [00:02<00:35, 13.19it/s]
[TorchDR] DR Loss : 1.76e+01 | Grad norm : 2.50e-05 :   7%|▋         | 34/500 [00:02<00:35, 13.19it/s]
[TorchDR] DR Loss : 1.76e+01 | Grad norm : 2.50e-05 :   7%|▋         | 36/500 [00:02<00:38, 12.11it/s]
[TorchDR] DR Loss : 1.76e+01 | Grad norm : 2.50e-05 :   7%|▋         | 36/500 [00:03<00:38, 12.11it/s]
[TorchDR] DR Loss : 1.73e+01 | Grad norm : 2.50e-05 :   7%|▋         | 36/500 [00:03<00:38, 12.11it/s]
[TorchDR] DR Loss : 1.73e+01 | Grad norm : 2.50e-05 :   8%|▊         | 38/500 [00:03<00:40, 11.33it/s]
[TorchDR] DR Loss : 1.74e+01 | Grad norm : 2.50e-05 :   8%|▊         | 38/500 [00:03<00:40, 11.33it/s]
[TorchDR] DR Loss : 1.72e+01 | Grad norm : 2.50e-05 :   8%|▊         | 38/500 [00:03<00:40, 11.33it/s]
[TorchDR] DR Loss : 1.72e+01 | Grad norm : 2.50e-05 :   8%|▊         | 40/500 [00:03<00:42, 10.92it/s]
[TorchDR] DR Loss : 1.72e+01 | Grad norm : 2.50e-05 :   8%|▊         | 40/500 [00:03<00:42, 10.92it/s]
[TorchDR] DR Loss : 1.73e+01 | Grad norm : 2.50e-05 :   8%|▊         | 40/500 [00:03<00:42, 10.92it/s]
[TorchDR] DR Loss : 1.73e+01 | Grad norm : 2.50e-05 :   8%|▊         | 42/500 [00:03<00:43, 10.63it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 2.50e-05 :   8%|▊         | 42/500 [00:03<00:43, 10.63it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 2.50e-05 :   8%|▊         | 42/500 [00:03<00:43, 10.63it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 2.50e-05 :   9%|▉         | 44/500 [00:03<00:43, 10.43it/s]
[TorchDR] DR Loss : 1.73e+01 | Grad norm : 2.50e-05 :   9%|▉         | 44/500 [00:03<00:43, 10.43it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 2.50e-05 :   9%|▉         | 44/500 [00:03<00:43, 10.43it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 2.50e-05 :   9%|▉         | 46/500 [00:03<00:37, 12.09it/s]
[TorchDR] DR Loss : 1.75e+01 | Grad norm : 2.50e-05 :   9%|▉         | 46/500 [00:04<00:37, 12.09it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 2.50e-05 :   9%|▉         | 46/500 [00:04<00:37, 12.09it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 2.50e-05 :  10%|▉         | 48/500 [00:04<00:39, 11.37it/s]
[TorchDR] DR Loss : 1.74e+01 | Grad norm : 2.50e-05 :  10%|▉         | 48/500 [00:04<00:39, 11.37it/s]
[TorchDR] DR Loss : 1.72e+01 | Grad norm : 2.50e-05 :  10%|▉         | 48/500 [00:04<00:39, 11.37it/s]
[TorchDR] DR Loss : 1.72e+01 | Grad norm : 2.50e-05 :  10%|█         | 50/500 [00:04<00:41, 10.96it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.50e-01 :  10%|█         | 50/500 [00:04<00:41, 10.96it/s]
[TorchDR] DR Loss : 1.74e+01 | Grad norm : 6.50e-01 :  10%|█         | 50/500 [00:04<00:41, 10.96it/s]
[TorchDR] DR Loss : 1.74e+01 | Grad norm : 6.50e-01 :  10%|█         | 52/500 [00:04<00:41, 10.71it/s]
[TorchDR] DR Loss : 1.74e+01 | Grad norm : 6.50e-01 :  10%|█         | 52/500 [00:04<00:41, 10.71it/s]
[TorchDR] DR Loss : 1.74e+01 | Grad norm : 6.50e-01 :  10%|█         | 52/500 [00:04<00:41, 10.71it/s]
[TorchDR] DR Loss : 1.74e+01 | Grad norm : 6.50e-01 :  11%|█         | 54/500 [00:04<00:42, 10.48it/s]
[TorchDR] DR Loss : 1.71e+01 | Grad norm : 6.50e-01 :  11%|█         | 54/500 [00:04<00:42, 10.48it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.50e-01 :  11%|█         | 54/500 [00:04<00:42, 10.48it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.50e-01 :  11%|█         | 56/500 [00:04<00:42, 10.33it/s]
[TorchDR] DR Loss : 1.73e+01 | Grad norm : 6.50e-01 :  11%|█         | 56/500 [00:04<00:42, 10.33it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.50e-01 :  11%|█         | 56/500 [00:05<00:42, 10.33it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.50e-01 :  12%|█▏        | 58/500 [00:05<00:43, 10.18it/s]
[TorchDR] DR Loss : 1.73e+01 | Grad norm : 6.50e-01 :  12%|█▏        | 58/500 [00:05<00:43, 10.18it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.50e-01 :  12%|█▏        | 58/500 [00:05<00:43, 10.18it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.50e-01 :  12%|█▏        | 60/500 [00:05<00:43, 10.11it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.50e-01 :  12%|█▏        | 60/500 [00:05<00:43, 10.11it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.50e-01 :  12%|█▏        | 60/500 [00:05<00:43, 10.11it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.50e-01 :  12%|█▏        | 62/500 [00:05<00:43, 10.09it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.50e-01 :  12%|█▏        | 62/500 [00:05<00:43, 10.09it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.50e-01 :  12%|█▏        | 62/500 [00:05<00:43, 10.09it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.50e-01 :  13%|█▎        | 64/500 [00:05<00:43, 10.05it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.50e-01 :  13%|█▎        | 64/500 [00:05<00:43, 10.05it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.50e-01 :  13%|█▎        | 64/500 [00:05<00:43, 10.05it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.50e-01 :  13%|█▎        | 66/500 [00:05<00:43, 10.06it/s]
[TorchDR] DR Loss : 1.72e+01 | Grad norm : 6.50e-01 :  13%|█▎        | 66/500 [00:06<00:43, 10.06it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.50e-01 :  13%|█▎        | 66/500 [00:06<00:43, 10.06it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.50e-01 :  14%|█▎        | 68/500 [00:06<00:47,  9.12it/s]
[TorchDR] DR Loss : 1.71e+01 | Grad norm : 6.50e-01 :  14%|█▎        | 68/500 [00:06<00:47,  9.12it/s]
[TorchDR] DR Loss : 1.71e+01 | Grad norm : 6.50e-01 :  14%|█▍        | 69/500 [00:06<00:48,  8.84it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.50e-01 :  14%|█▍        | 69/500 [00:06<00:48,  8.84it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.50e-01 :  14%|█▍        | 70/500 [00:06<00:47,  9.04it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.50e-01 :  14%|█▍        | 70/500 [00:06<00:47,  9.04it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.50e-01 :  14%|█▍        | 71/500 [00:06<00:46,  9.22it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.50e-01 :  14%|█▍        | 71/500 [00:06<00:46,  9.22it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.50e-01 :  14%|█▍        | 72/500 [00:06<00:45,  9.36it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.50e-01 :  14%|█▍        | 72/500 [00:06<00:45,  9.36it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.50e-01 :  14%|█▍        | 72/500 [00:06<00:45,  9.36it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.50e-01 :  15%|█▍        | 74/500 [00:06<00:44,  9.64it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.50e-01 :  15%|█▍        | 74/500 [00:06<00:44,  9.64it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.50e-01 :  15%|█▌        | 75/500 [00:06<00:43,  9.70it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.50e-01 :  15%|█▌        | 75/500 [00:07<00:43,  9.70it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.50e-01 :  15%|█▌        | 76/500 [00:07<00:43,  9.74it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.50e-01 :  15%|█▌        | 76/500 [00:07<00:43,  9.74it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.50e-01 :  15%|█▌        | 76/500 [00:07<00:43,  9.74it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.50e-01 :  16%|█▌        | 78/500 [00:07<00:42,  9.86it/s]
[TorchDR] DR Loss : 1.61e+01 | Grad norm : 6.50e-01 :  16%|█▌        | 78/500 [00:07<00:42,  9.86it/s]
[TorchDR] DR Loss : 1.61e+01 | Grad norm : 6.50e-01 :  16%|█▌        | 79/500 [00:07<00:48,  8.60it/s]
[TorchDR] DR Loss : 1.73e+01 | Grad norm : 6.50e-01 :  16%|█▌        | 79/500 [00:07<00:48,  8.60it/s]
[TorchDR] DR Loss : 1.73e+01 | Grad norm : 6.50e-01 :  16%|█▌        | 80/500 [00:07<00:47,  8.89it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.50e-01 :  16%|█▌        | 80/500 [00:07<00:47,  8.89it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.50e-01 :  16%|█▌        | 80/500 [00:07<00:47,  8.89it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.50e-01 :  16%|█▋        | 82/500 [00:07<00:47,  8.80it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.50e-01 :  16%|█▋        | 82/500 [00:07<00:47,  8.80it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.50e-01 :  17%|█▋        | 83/500 [00:07<00:46,  9.03it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.50e-01 :  17%|█▋        | 83/500 [00:07<00:46,  9.03it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.50e-01 :  17%|█▋        | 84/500 [00:07<00:45,  9.20it/s]
[TorchDR] DR Loss : 1.71e+01 | Grad norm : 6.50e-01 :  17%|█▋        | 84/500 [00:08<00:45,  9.20it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.50e-01 :  17%|█▋        | 84/500 [00:08<00:45,  9.20it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.50e-01 :  17%|█▋        | 86/500 [00:08<00:43,  9.52it/s]
[TorchDR] DR Loss : 1.71e+01 | Grad norm : 6.50e-01 :  17%|█▋        | 86/500 [00:08<00:43,  9.52it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.50e-01 :  17%|█▋        | 86/500 [00:08<00:43,  9.52it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.50e-01 :  18%|█▊        | 88/500 [00:08<00:42,  9.73it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.50e-01 :  18%|█▊        | 88/500 [00:08<00:42,  9.73it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.50e-01 :  18%|█▊        | 89/500 [00:08<00:47,  8.58it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.50e-01 :  18%|█▊        | 89/500 [00:08<00:47,  8.58it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.50e-01 :  18%|█▊        | 89/500 [00:08<00:47,  8.58it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.50e-01 :  18%|█▊        | 91/500 [00:08<00:47,  8.59it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.50e-01 :  18%|█▊        | 91/500 [00:08<00:47,  8.59it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.50e-01 :  18%|█▊        | 91/500 [00:08<00:47,  8.59it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.50e-01 :  19%|█▊        | 93/500 [00:08<00:44,  9.05it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.50e-01 :  19%|█▊        | 93/500 [00:08<00:44,  9.05it/s]
[TorchDR] DR Loss : 1.72e+01 | Grad norm : 6.50e-01 :  19%|█▊        | 93/500 [00:09<00:44,  9.05it/s]
[TorchDR] DR Loss : 1.72e+01 | Grad norm : 6.50e-01 :  19%|█▉        | 95/500 [00:09<00:43,  9.33it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.50e-01 :  19%|█▉        | 95/500 [00:09<00:43,  9.33it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.50e-01 :  19%|█▉        | 96/500 [00:09<00:42,  9.45it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.50e-01 :  19%|█▉        | 96/500 [00:09<00:42,  9.45it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.50e-01 :  19%|█▉        | 96/500 [00:09<00:42,  9.45it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.50e-01 :  20%|█▉        | 98/500 [00:09<00:41,  9.65it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.50e-01 :  20%|█▉        | 98/500 [00:09<00:41,  9.65it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.50e-01 :  20%|█▉        | 98/500 [00:09<00:41,  9.65it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.50e-01 :  20%|██        | 100/500 [00:09<00:40,  9.79it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.44e-01 :  20%|██        | 100/500 [00:09<00:40,  9.79it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.44e-01 :  20%|██        | 101/500 [00:09<00:40,  9.83it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.44e-01 :  20%|██        | 101/500 [00:09<00:40,  9.83it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.44e-01 :  20%|██        | 102/500 [00:09<00:40,  9.86it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.44e-01 :  20%|██        | 102/500 [00:09<00:40,  9.86it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.44e-01 :  20%|██        | 102/500 [00:09<00:40,  9.86it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.44e-01 :  21%|██        | 104/500 [00:09<00:32, 12.15it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.44e-01 :  21%|██        | 104/500 [00:10<00:32, 12.15it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.44e-01 :  21%|██        | 104/500 [00:10<00:32, 12.15it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.44e-01 :  21%|██        | 106/500 [00:10<00:34, 11.33it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.44e-01 :  21%|██        | 106/500 [00:10<00:34, 11.33it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.44e-01 :  21%|██        | 106/500 [00:10<00:34, 11.33it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.44e-01 :  22%|██▏       | 108/500 [00:10<00:36, 10.88it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.44e-01 :  22%|██▏       | 108/500 [00:10<00:36, 10.88it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.44e-01 :  22%|██▏       | 108/500 [00:10<00:36, 10.88it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.44e-01 :  22%|██▏       | 110/500 [00:10<00:30, 12.68it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.44e-01 :  22%|██▏       | 110/500 [00:10<00:30, 12.68it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.44e-01 :  22%|██▏       | 110/500 [00:10<00:30, 12.68it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.44e-01 :  22%|██▏       | 112/500 [00:10<00:33, 11.73it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.44e-01 :  22%|██▏       | 112/500 [00:10<00:33, 11.73it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.44e-01 :  22%|██▏       | 112/500 [00:10<00:33, 11.73it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.44e-01 :  23%|██▎       | 114/500 [00:10<00:34, 11.17it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.44e-01 :  23%|██▎       | 114/500 [00:10<00:34, 11.17it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.44e-01 :  23%|██▎       | 114/500 [00:10<00:34, 11.17it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.44e-01 :  23%|██▎       | 116/500 [00:10<00:29, 12.82it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.44e-01 :  23%|██▎       | 116/500 [00:11<00:29, 12.82it/s]
[TorchDR] DR Loss : 1.71e+01 | Grad norm : 6.44e-01 :  23%|██▎       | 116/500 [00:11<00:29, 12.82it/s]
[TorchDR] DR Loss : 1.71e+01 | Grad norm : 6.44e-01 :  24%|██▎       | 118/500 [00:11<00:32, 11.81it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.44e-01 :  24%|██▎       | 118/500 [00:11<00:32, 11.81it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.44e-01 :  24%|██▎       | 118/500 [00:11<00:32, 11.81it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.44e-01 :  24%|██▍       | 120/500 [00:11<00:33, 11.21it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.44e-01 :  24%|██▍       | 120/500 [00:11<00:33, 11.21it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.44e-01 :  24%|██▍       | 120/500 [00:11<00:33, 11.21it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.44e-01 :  24%|██▍       | 122/500 [00:11<00:34, 10.87it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.44e-01 :  24%|██▍       | 122/500 [00:11<00:34, 10.87it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.44e-01 :  24%|██▍       | 122/500 [00:11<00:34, 10.87it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.44e-01 :  25%|██▍       | 124/500 [00:11<00:30, 12.53it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.44e-01 :  25%|██▍       | 124/500 [00:11<00:30, 12.53it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.44e-01 :  25%|██▍       | 124/500 [00:11<00:30, 12.53it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.44e-01 :  25%|██▌       | 126/500 [00:11<00:31, 11.71it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.44e-01 :  25%|██▌       | 126/500 [00:11<00:31, 11.71it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.44e-01 :  25%|██▌       | 126/500 [00:11<00:31, 11.71it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.44e-01 :  26%|██▌       | 128/500 [00:11<00:28, 13.28it/s]
[TorchDR] DR Loss : 1.71e+01 | Grad norm : 6.44e-01 :  26%|██▌       | 128/500 [00:11<00:28, 13.28it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.44e-01 :  26%|██▌       | 128/500 [00:12<00:28, 13.28it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.44e-01 :  26%|██▌       | 130/500 [00:12<00:30, 12.12it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.44e-01 :  26%|██▌       | 130/500 [00:12<00:30, 12.12it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.44e-01 :  26%|██▌       | 130/500 [00:12<00:30, 12.12it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.44e-01 :  26%|██▋       | 132/500 [00:12<00:26, 13.69it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.44e-01 :  26%|██▋       | 132/500 [00:12<00:26, 13.69it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.44e-01 :  26%|██▋       | 132/500 [00:12<00:26, 13.69it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.44e-01 :  27%|██▋       | 134/500 [00:12<00:29, 12.35it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.44e-01 :  27%|██▋       | 134/500 [00:12<00:29, 12.35it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.44e-01 :  27%|██▋       | 134/500 [00:12<00:29, 12.35it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.44e-01 :  27%|██▋       | 136/500 [00:12<00:31, 11.60it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.44e-01 :  27%|██▋       | 136/500 [00:12<00:31, 11.60it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.44e-01 :  27%|██▋       | 136/500 [00:12<00:31, 11.60it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.44e-01 :  28%|██▊       | 138/500 [00:12<00:27, 13.18it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.44e-01 :  28%|██▊       | 138/500 [00:12<00:27, 13.18it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.44e-01 :  28%|██▊       | 138/500 [00:12<00:27, 13.18it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.44e-01 :  28%|██▊       | 140/500 [00:12<00:29, 12.11it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.44e-01 :  28%|██▊       | 140/500 [00:12<00:29, 12.11it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.44e-01 :  28%|██▊       | 140/500 [00:13<00:29, 12.11it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.44e-01 :  28%|██▊       | 142/500 [00:13<00:26, 13.64it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.44e-01 :  28%|██▊       | 142/500 [00:13<00:26, 13.64it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.44e-01 :  28%|██▊       | 142/500 [00:13<00:26, 13.64it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.44e-01 :  29%|██▉       | 144/500 [00:13<00:28, 12.34it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.44e-01 :  29%|██▉       | 144/500 [00:13<00:28, 12.34it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.44e-01 :  29%|██▉       | 144/500 [00:13<00:28, 12.34it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.44e-01 :  29%|██▉       | 146/500 [00:13<00:25, 13.87it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.44e-01 :  29%|██▉       | 146/500 [00:13<00:25, 13.87it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.44e-01 :  29%|██▉       | 146/500 [00:13<00:25, 13.87it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.44e-01 :  30%|██▉       | 148/500 [00:13<00:28, 12.48it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.44e-01 :  30%|██▉       | 148/500 [00:13<00:28, 12.48it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.44e-01 :  30%|██▉       | 148/500 [00:13<00:28, 12.48it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.44e-01 :  30%|███       | 150/500 [00:13<00:25, 13.96it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.45e-01 :  30%|███       | 150/500 [00:13<00:25, 13.96it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.45e-01 :  30%|███       | 150/500 [00:13<00:25, 13.96it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.45e-01 :  30%|███       | 152/500 [00:13<00:27, 12.55it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  30%|███       | 152/500 [00:13<00:27, 12.55it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  30%|███       | 152/500 [00:13<00:27, 12.55it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  31%|███       | 154/500 [00:13<00:24, 14.03it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.45e-01 :  31%|███       | 154/500 [00:14<00:24, 14.03it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  31%|███       | 154/500 [00:14<00:24, 14.03it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  31%|███       | 156/500 [00:14<00:27, 12.57it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.45e-01 :  31%|███       | 156/500 [00:14<00:27, 12.57it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  31%|███       | 156/500 [00:14<00:27, 12.57it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  32%|███▏      | 158/500 [00:14<00:29, 11.72it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.45e-01 :  32%|███▏      | 158/500 [00:14<00:29, 11.72it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  32%|███▏      | 158/500 [00:14<00:29, 11.72it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  32%|███▏      | 160/500 [00:14<00:25, 13.26it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.45e-01 :  32%|███▏      | 160/500 [00:14<00:25, 13.26it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  32%|███▏      | 160/500 [00:14<00:25, 13.26it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  32%|███▏      | 162/500 [00:14<00:27, 12.12it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  32%|███▏      | 162/500 [00:14<00:27, 12.12it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  32%|███▏      | 162/500 [00:14<00:27, 12.12it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  33%|███▎      | 164/500 [00:14<00:29, 11.45it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  33%|███▎      | 164/500 [00:14<00:29, 11.45it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  33%|███▎      | 164/500 [00:14<00:29, 11.45it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  33%|███▎      | 166/500 [00:14<00:25, 13.05it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  33%|███▎      | 166/500 [00:15<00:25, 13.05it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.45e-01 :  33%|███▎      | 166/500 [00:15<00:25, 13.05it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.45e-01 :  34%|███▎      | 168/500 [00:15<00:27, 12.02it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  34%|███▎      | 168/500 [00:15<00:27, 12.02it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.45e-01 :  34%|███▎      | 168/500 [00:15<00:27, 12.02it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.45e-01 :  34%|███▍      | 170/500 [00:15<00:24, 13.57it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  34%|███▍      | 170/500 [00:15<00:24, 13.57it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  34%|███▍      | 170/500 [00:15<00:24, 13.57it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  34%|███▍      | 172/500 [00:15<00:26, 12.35it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  34%|███▍      | 172/500 [00:15<00:26, 12.35it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  34%|███▍      | 172/500 [00:15<00:26, 12.35it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  35%|███▍      | 174/500 [00:15<00:23, 13.86it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.45e-01 :  35%|███▍      | 174/500 [00:15<00:23, 13.86it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.45e-01 :  35%|███▍      | 174/500 [00:15<00:23, 13.86it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.45e-01 :  35%|███▌      | 176/500 [00:15<00:26, 12.46it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  35%|███▌      | 176/500 [00:15<00:26, 12.46it/s]
[TorchDR] DR Loss : 1.61e+01 | Grad norm : 6.45e-01 :  35%|███▌      | 176/500 [00:15<00:26, 12.46it/s]
[TorchDR] DR Loss : 1.61e+01 | Grad norm : 6.45e-01 :  36%|███▌      | 178/500 [00:15<00:23, 13.93it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  36%|███▌      | 178/500 [00:15<00:23, 13.93it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.45e-01 :  36%|███▌      | 178/500 [00:15<00:23, 13.93it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.45e-01 :  36%|███▌      | 180/500 [00:15<00:25, 12.54it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  36%|███▌      | 180/500 [00:16<00:25, 12.54it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  36%|███▌      | 180/500 [00:16<00:25, 12.54it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  36%|███▋      | 182/500 [00:16<00:27, 11.70it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  36%|███▋      | 182/500 [00:16<00:27, 11.70it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  36%|███▋      | 182/500 [00:16<00:27, 11.70it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  37%|███▋      | 184/500 [00:16<00:23, 13.28it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  37%|███▋      | 184/500 [00:16<00:23, 13.28it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.45e-01 :  37%|███▋      | 184/500 [00:16<00:23, 13.28it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.45e-01 :  37%|███▋      | 186/500 [00:16<00:21, 14.67it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.45e-01 :  37%|███▋      | 186/500 [00:16<00:21, 14.67it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  37%|███▋      | 186/500 [00:16<00:21, 14.67it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  38%|███▊      | 188/500 [00:16<00:24, 12.92it/s]
[TorchDR] DR Loss : 1.62e+01 | Grad norm : 6.45e-01 :  38%|███▊      | 188/500 [00:16<00:24, 12.92it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.45e-01 :  38%|███▊      | 188/500 [00:16<00:24, 12.92it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.45e-01 :  38%|███▊      | 190/500 [00:16<00:25, 11.93it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.45e-01 :  38%|███▊      | 190/500 [00:16<00:25, 11.93it/s]
[TorchDR] DR Loss : 1.73e+01 | Grad norm : 6.45e-01 :  38%|███▊      | 190/500 [00:16<00:25, 11.93it/s]
[TorchDR] DR Loss : 1.73e+01 | Grad norm : 6.45e-01 :  38%|███▊      | 192/500 [00:16<00:22, 13.49it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.45e-01 :  38%|███▊      | 192/500 [00:17<00:22, 13.49it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.45e-01 :  38%|███▊      | 192/500 [00:17<00:22, 13.49it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.45e-01 :  39%|███▉      | 194/500 [00:17<00:24, 12.27it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  39%|███▉      | 194/500 [00:17<00:24, 12.27it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  39%|███▉      | 194/500 [00:17<00:24, 12.27it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  39%|███▉      | 196/500 [00:17<00:22, 13.77it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  39%|███▉      | 196/500 [00:17<00:22, 13.77it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  39%|███▉      | 196/500 [00:17<00:22, 13.77it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  40%|███▉      | 198/500 [00:17<00:24, 12.41it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.45e-01 :  40%|███▉      | 198/500 [00:17<00:24, 12.41it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  40%|███▉      | 198/500 [00:17<00:24, 12.41it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  40%|████      | 200/500 [00:17<00:25, 11.62it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  40%|████      | 200/500 [00:17<00:25, 11.62it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  40%|████      | 200/500 [00:17<00:25, 11.62it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  40%|████      | 202/500 [00:17<00:22, 13.18it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  40%|████      | 202/500 [00:17<00:22, 13.18it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  40%|████      | 202/500 [00:17<00:22, 13.18it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  41%|████      | 204/500 [00:17<00:24, 12.13it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  41%|████      | 204/500 [00:17<00:24, 12.13it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.45e-01 :  41%|████      | 204/500 [00:18<00:24, 12.13it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.45e-01 :  41%|████      | 206/500 [00:18<00:25, 11.39it/s]
[TorchDR] DR Loss : 1.59e+01 | Grad norm : 6.45e-01 :  41%|████      | 206/500 [00:18<00:25, 11.39it/s]
[TorchDR] DR Loss : 1.72e+01 | Grad norm : 6.45e-01 :  41%|████      | 206/500 [00:18<00:25, 11.39it/s]
[TorchDR] DR Loss : 1.72e+01 | Grad norm : 6.45e-01 :  42%|████▏     | 208/500 [00:18<00:22, 13.00it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.45e-01 :  42%|████▏     | 208/500 [00:18<00:22, 13.00it/s]
[TorchDR] DR Loss : 1.71e+01 | Grad norm : 6.45e-01 :  42%|████▏     | 208/500 [00:18<00:22, 13.00it/s]
[TorchDR] DR Loss : 1.71e+01 | Grad norm : 6.45e-01 :  42%|████▏     | 210/500 [00:18<00:24, 12.02it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.45e-01 :  42%|████▏     | 210/500 [00:18<00:24, 12.02it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.45e-01 :  42%|████▏     | 210/500 [00:18<00:24, 12.02it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 6.45e-01 :  42%|████▏     | 212/500 [00:18<00:21, 13.53it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.45e-01 :  42%|████▏     | 212/500 [00:18<00:21, 13.53it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.45e-01 :  42%|████▏     | 212/500 [00:18<00:21, 13.53it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.45e-01 :  43%|████▎     | 214/500 [00:18<00:23, 12.27it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  43%|████▎     | 214/500 [00:18<00:23, 12.27it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  43%|████▎     | 214/500 [00:18<00:23, 12.27it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  43%|████▎     | 216/500 [00:18<00:24, 11.57it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  43%|████▎     | 216/500 [00:18<00:24, 11.57it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  43%|████▎     | 216/500 [00:19<00:24, 11.57it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  44%|████▎     | 218/500 [00:19<00:21, 13.16it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  44%|████▎     | 218/500 [00:19<00:21, 13.16it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  44%|████▎     | 218/500 [00:19<00:21, 13.16it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  44%|████▍     | 220/500 [00:19<00:23, 12.10it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  44%|████▍     | 220/500 [00:19<00:23, 12.10it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  44%|████▍     | 220/500 [00:19<00:23, 12.10it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  44%|████▍     | 222/500 [00:19<00:20, 13.62it/s]
[TorchDR] DR Loss : 1.61e+01 | Grad norm : 6.45e-01 :  44%|████▍     | 222/500 [00:19<00:20, 13.62it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  44%|████▍     | 222/500 [00:19<00:20, 13.62it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  45%|████▍     | 224/500 [00:19<00:22, 12.36it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  45%|████▍     | 224/500 [00:19<00:22, 12.36it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  45%|████▍     | 224/500 [00:19<00:22, 12.36it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  45%|████▌     | 226/500 [00:19<00:19, 13.85it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  45%|████▌     | 226/500 [00:19<00:19, 13.85it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  45%|████▌     | 226/500 [00:19<00:19, 13.85it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  46%|████▌     | 228/500 [00:19<00:21, 12.51it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  46%|████▌     | 228/500 [00:19<00:21, 12.51it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  46%|████▌     | 228/500 [00:19<00:21, 12.51it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  46%|████▌     | 230/500 [00:19<00:19, 13.96it/s]
[TorchDR] DR Loss : 1.62e+01 | Grad norm : 6.45e-01 :  46%|████▌     | 230/500 [00:19<00:19, 13.96it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  46%|████▌     | 230/500 [00:20<00:19, 13.96it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  46%|████▋     | 232/500 [00:20<00:21, 12.56it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.45e-01 :  46%|████▋     | 232/500 [00:20<00:21, 12.56it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  46%|████▋     | 232/500 [00:20<00:21, 12.56it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  47%|████▋     | 234/500 [00:20<00:18, 14.04it/s]
[TorchDR] DR Loss : 1.65e+01 | Grad norm : 6.45e-01 :  47%|████▋     | 234/500 [00:20<00:18, 14.04it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  47%|████▋     | 234/500 [00:20<00:18, 14.04it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  47%|████▋     | 236/500 [00:20<00:20, 12.60it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.45e-01 :  47%|████▋     | 236/500 [00:20<00:20, 12.60it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  47%|████▋     | 236/500 [00:20<00:20, 12.60it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  48%|████▊     | 238/500 [00:20<00:18, 14.06it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  48%|████▊     | 238/500 [00:20<00:18, 14.06it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.45e-01 :  48%|████▊     | 238/500 [00:20<00:18, 14.06it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.45e-01 :  48%|████▊     | 240/500 [00:20<00:20, 12.63it/s]
[TorchDR] DR Loss : 1.66e+01 | Grad norm : 6.45e-01 :  48%|████▊     | 240/500 [00:20<00:20, 12.63it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.45e-01 :  48%|████▊     | 240/500 [00:20<00:20, 12.63it/s]
[TorchDR] DR Loss : 1.63e+01 | Grad norm : 6.45e-01 :  48%|████▊     | 242/500 [00:20<00:18, 14.09it/s]
[TorchDR] DR Loss : 1.64e+01 | Grad norm : 6.45e-01 :  48%|████▊     | 242/500 [00:20<00:18, 14.09it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  48%|████▊     | 242/500 [00:20<00:18, 14.09it/s]
[TorchDR] DR Loss : 1.67e+01 | Grad norm : 6.45e-01 :  49%|████▉     | 244/500 [00:20<00:20, 12.62it/s]
[TorchDR] DR Loss : 1.62e+01 | Grad norm : 6.45e-01 :  49%|████▉     | 244/500 [00:21<00:20, 12.62it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  49%|████▉     | 244/500 [00:21<00:20, 12.62it/s]
[TorchDR] DR Loss : 1.68e+01 | Grad norm : 6.45e-01 :  49%|████▉     | 246/500 [00:21<00:18, 14.08it/s]
[TorchDR] DR Loss : 1.62e+01 | Grad norm : 6.45e-01 :  49%|████▉     | 246/500 [00:21<00:18, 14.08it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.45e-01 :  49%|████▉     | 246/500 [00:21<00:18, 14.08it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.45e-01 :  50%|████▉     | 248/500 [00:21<00:19, 12.61it/s]
[TorchDR] DR Loss : 1.61e+01 | Grad norm : 6.45e-01 :  50%|████▉     | 248/500 [00:21<00:19, 12.61it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.45e-01 :  50%|████▉     | 248/500 [00:21<00:19, 12.61it/s]
[TorchDR] DR Loss : 1.70e+01 | Grad norm : 6.45e-01 :  50%|█████     | 250/500 [00:21<00:21, 11.74it/s]
[TorchDR] DR Loss : 1.62e+01 | Grad norm : 6.42e-01 :  50%|█████     | 250/500 [00:21<00:21, 11.74it/s]
[TorchDR] DR Loss : 1.16e+01 | Grad norm : 6.42e-01 :  50%|█████     | 250/500 [00:21<00:21, 11.74it/s]
[TorchDR] DR Loss : 1.16e+01 | Grad norm : 6.42e-01 :  50%|█████     | 252/500 [00:21<00:18, 13.31it/s]
[TorchDR] DR Loss : 1.15e+01 | Grad norm : 6.42e-01 :  50%|█████     | 252/500 [00:21<00:18, 13.31it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 6.42e-01 :  50%|█████     | 252/500 [00:21<00:18, 13.31it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 6.42e-01 :  51%|█████     | 254/500 [00:21<00:20, 12.14it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 6.42e-01 :  51%|█████     | 254/500 [00:21<00:20, 12.14it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  51%|█████     | 254/500 [00:21<00:20, 12.14it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  51%|█████     | 256/500 [00:21<00:17, 13.73it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  51%|█████     | 256/500 [00:21<00:17, 13.73it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  51%|█████     | 256/500 [00:22<00:17, 13.73it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  52%|█████▏    | 258/500 [00:22<00:19, 12.40it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  52%|█████▏    | 258/500 [00:22<00:19, 12.40it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  52%|█████▏    | 258/500 [00:22<00:19, 12.40it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  52%|█████▏    | 260/500 [00:22<00:17, 13.89it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  52%|█████▏    | 260/500 [00:22<00:17, 13.89it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  52%|█████▏    | 260/500 [00:22<00:17, 13.89it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  52%|█████▏    | 262/500 [00:22<00:19, 12.50it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  52%|█████▏    | 262/500 [00:22<00:19, 12.50it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  52%|█████▏    | 262/500 [00:22<00:19, 12.50it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  53%|█████▎    | 264/500 [00:22<00:16, 14.00it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  53%|█████▎    | 264/500 [00:22<00:16, 14.00it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  53%|█████▎    | 264/500 [00:22<00:16, 14.00it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  53%|█████▎    | 266/500 [00:22<00:18, 12.55it/s]
[TorchDR] DR Loss : 1.13e+01 | Grad norm : 6.42e-01 :  53%|█████▎    | 266/500 [00:22<00:18, 12.55it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  53%|█████▎    | 266/500 [00:22<00:18, 12.55it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  54%|█████▎    | 268/500 [00:22<00:19, 11.71it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  54%|█████▎    | 268/500 [00:22<00:19, 11.71it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  54%|█████▎    | 268/500 [00:23<00:19, 11.71it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  54%|█████▍    | 270/500 [00:23<00:17, 13.25it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  54%|█████▍    | 270/500 [00:23<00:17, 13.25it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  54%|█████▍    | 270/500 [00:23<00:17, 13.25it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  54%|█████▍    | 272/500 [00:23<00:18, 12.11it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  54%|█████▍    | 272/500 [00:23<00:18, 12.11it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  54%|█████▍    | 272/500 [00:23<00:18, 12.11it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  55%|█████▍    | 274/500 [00:23<00:19, 11.43it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  55%|█████▍    | 274/500 [00:23<00:19, 11.43it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  55%|█████▍    | 274/500 [00:23<00:19, 11.43it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  55%|█████▌    | 276/500 [00:23<00:17, 12.97it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  55%|█████▌    | 276/500 [00:23<00:17, 12.97it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  55%|█████▌    | 276/500 [00:23<00:17, 12.97it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  56%|█████▌    | 278/500 [00:23<00:18, 11.99it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  56%|█████▌    | 278/500 [00:23<00:18, 11.99it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  56%|█████▌    | 278/500 [00:23<00:18, 11.99it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  56%|█████▌    | 280/500 [00:23<00:18, 11.96it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  56%|█████▌    | 280/500 [00:23<00:18, 11.96it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  56%|█████▌    | 280/500 [00:24<00:18, 11.96it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  56%|█████▋    | 282/500 [00:24<00:17, 12.74it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  56%|█████▋    | 282/500 [00:24<00:17, 12.74it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  56%|█████▋    | 282/500 [00:24<00:17, 12.74it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  57%|█████▋    | 284/500 [00:24<00:18, 11.79it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  57%|█████▋    | 284/500 [00:24<00:18, 11.79it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  57%|█████▋    | 284/500 [00:24<00:18, 11.79it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  57%|█████▋    | 286/500 [00:24<00:19, 11.18it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  57%|█████▋    | 286/500 [00:24<00:19, 11.18it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  57%|█████▋    | 286/500 [00:24<00:19, 11.18it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  58%|█████▊    | 288/500 [00:24<00:19, 10.86it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  58%|█████▊    | 288/500 [00:24<00:19, 10.86it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  58%|█████▊    | 288/500 [00:24<00:19, 10.86it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  58%|█████▊    | 290/500 [00:24<00:16, 12.48it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  58%|█████▊    | 290/500 [00:24<00:16, 12.48it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  58%|█████▊    | 290/500 [00:24<00:16, 12.48it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  58%|█████▊    | 292/500 [00:24<00:17, 11.70it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  58%|█████▊    | 292/500 [00:24<00:17, 11.70it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  58%|█████▊    | 292/500 [00:25<00:17, 11.70it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  59%|█████▉    | 294/500 [00:25<00:18, 11.08it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  59%|█████▉    | 294/500 [00:25<00:18, 11.08it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  59%|█████▉    | 294/500 [00:25<00:18, 11.08it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  59%|█████▉    | 296/500 [00:25<00:18, 10.80it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  59%|█████▉    | 296/500 [00:25<00:18, 10.80it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  59%|█████▉    | 296/500 [00:25<00:18, 10.80it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  60%|█████▉    | 298/500 [00:25<00:16, 12.45it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  60%|█████▉    | 298/500 [00:25<00:16, 12.45it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  60%|█████▉    | 298/500 [00:25<00:16, 12.45it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 6.42e-01 :  60%|██████    | 300/500 [00:25<00:17, 11.67it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  60%|██████    | 300/500 [00:25<00:17, 11.67it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  60%|██████    | 300/500 [00:25<00:17, 11.67it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  60%|██████    | 302/500 [00:25<00:14, 13.22it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  60%|██████    | 302/500 [00:25<00:14, 13.22it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  60%|██████    | 302/500 [00:25<00:14, 13.22it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  61%|██████    | 304/500 [00:25<00:16, 12.12it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  61%|██████    | 304/500 [00:25<00:16, 12.12it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  61%|██████    | 304/500 [00:26<00:16, 12.12it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  61%|██████    | 306/500 [00:26<00:14, 13.62it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  61%|██████    | 306/500 [00:26<00:14, 13.62it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  61%|██████    | 306/500 [00:26<00:14, 13.62it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  62%|██████▏   | 308/500 [00:26<00:15, 12.33it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  62%|██████▏   | 308/500 [00:26<00:15, 12.33it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  62%|██████▏   | 308/500 [00:26<00:15, 12.33it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  62%|██████▏   | 310/500 [00:26<00:16, 11.58it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  62%|██████▏   | 310/500 [00:26<00:16, 11.58it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  62%|██████▏   | 310/500 [00:26<00:16, 11.58it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  62%|██████▏   | 312/500 [00:26<00:14, 13.16it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  62%|██████▏   | 312/500 [00:26<00:14, 13.16it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  62%|██████▏   | 312/500 [00:26<00:14, 13.16it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  63%|██████▎   | 314/500 [00:26<00:15, 12.03it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  63%|██████▎   | 314/500 [00:26<00:15, 12.03it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  63%|██████▎   | 314/500 [00:26<00:15, 12.03it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  63%|██████▎   | 316/500 [00:26<00:16, 11.38it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  63%|██████▎   | 316/500 [00:26<00:16, 11.38it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  63%|██████▎   | 316/500 [00:27<00:16, 11.38it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  64%|██████▎   | 318/500 [00:27<00:14, 13.00it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  64%|██████▎   | 318/500 [00:27<00:14, 13.00it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  64%|██████▎   | 318/500 [00:27<00:14, 13.00it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  64%|██████▍   | 320/500 [00:27<00:15, 11.93it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  64%|██████▍   | 320/500 [00:27<00:15, 11.93it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  64%|██████▍   | 320/500 [00:27<00:15, 11.93it/s]
[TorchDR] DR Loss : 1.12e+01 | Grad norm : 3.16e-03 :  64%|██████▍   | 322/500 [00:27<00:15, 11.28it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  64%|██████▍   | 322/500 [00:27<00:15, 11.28it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  64%|██████▍   | 322/500 [00:27<00:15, 11.28it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  65%|██████▍   | 324/500 [00:27<00:16, 10.87it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  65%|██████▍   | 324/500 [00:27<00:16, 10.87it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  65%|██████▍   | 324/500 [00:27<00:16, 10.87it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  65%|██████▌   | 326/500 [00:27<00:16, 10.64it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  65%|██████▌   | 326/500 [00:27<00:16, 10.64it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  65%|██████▌   | 326/500 [00:27<00:16, 10.64it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  66%|██████▌   | 328/500 [00:27<00:13, 12.29it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  66%|██████▌   | 328/500 [00:28<00:13, 12.29it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  66%|██████▌   | 328/500 [00:28<00:13, 12.29it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  66%|██████▌   | 330/500 [00:28<00:14, 11.53it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  66%|██████▌   | 330/500 [00:28<00:14, 11.53it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  66%|██████▌   | 330/500 [00:28<00:14, 11.53it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  66%|██████▋   | 332/500 [00:28<00:15, 11.06it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  66%|██████▋   | 332/500 [00:28<00:15, 11.06it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  66%|██████▋   | 332/500 [00:28<00:15, 11.06it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  67%|██████▋   | 334/500 [00:28<00:13, 12.69it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  67%|██████▋   | 334/500 [00:28<00:13, 12.69it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  67%|██████▋   | 334/500 [00:28<00:13, 12.69it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  67%|██████▋   | 336/500 [00:28<00:13, 11.78it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  67%|██████▋   | 336/500 [00:28<00:13, 11.78it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  67%|██████▋   | 336/500 [00:28<00:13, 11.78it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  68%|██████▊   | 338/500 [00:28<00:14, 11.24it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  68%|██████▊   | 338/500 [00:28<00:14, 11.24it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  68%|██████▊   | 338/500 [00:28<00:14, 11.24it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  68%|██████▊   | 340/500 [00:28<00:12, 12.82it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  68%|██████▊   | 340/500 [00:29<00:12, 12.82it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  68%|██████▊   | 340/500 [00:29<00:12, 12.82it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  68%|██████▊   | 342/500 [00:29<00:13, 11.87it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  68%|██████▊   | 342/500 [00:29<00:13, 11.87it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  68%|██████▊   | 342/500 [00:29<00:13, 11.87it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  69%|██████▉   | 344/500 [00:29<00:13, 11.28it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  69%|██████▉   | 344/500 [00:29<00:13, 11.28it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  69%|██████▉   | 344/500 [00:29<00:13, 11.28it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  69%|██████▉   | 346/500 [00:29<00:11, 12.88it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  69%|██████▉   | 346/500 [00:29<00:11, 12.88it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  69%|██████▉   | 346/500 [00:29<00:11, 12.88it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  70%|██████▉   | 348/500 [00:29<00:12, 11.91it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  70%|██████▉   | 348/500 [00:29<00:12, 11.91it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  70%|██████▉   | 348/500 [00:29<00:12, 11.91it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 3.16e-03 :  70%|███████   | 350/500 [00:29<00:13, 11.32it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  70%|███████   | 350/500 [00:29<00:13, 11.32it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  70%|███████   | 350/500 [00:29<00:13, 11.32it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  70%|███████   | 352/500 [00:29<00:11, 12.90it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  70%|███████   | 352/500 [00:29<00:11, 12.90it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  70%|███████   | 352/500 [00:30<00:11, 12.90it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  71%|███████   | 354/500 [00:30<00:12, 11.92it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  71%|███████   | 354/500 [00:30<00:12, 11.92it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  71%|███████   | 354/500 [00:30<00:12, 11.92it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  71%|███████   | 356/500 [00:30<00:10, 13.45it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  71%|███████   | 356/500 [00:30<00:10, 13.45it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  71%|███████   | 356/500 [00:30<00:10, 13.45it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  72%|███████▏  | 358/500 [00:30<00:11, 12.23it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  72%|███████▏  | 358/500 [00:30<00:11, 12.23it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  72%|███████▏  | 358/500 [00:30<00:11, 12.23it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  72%|███████▏  | 360/500 [00:30<00:12, 11.50it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  72%|███████▏  | 360/500 [00:30<00:12, 11.50it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  72%|███████▏  | 360/500 [00:30<00:12, 11.50it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  72%|███████▏  | 362/500 [00:30<00:10, 13.11it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  72%|███████▏  | 362/500 [00:30<00:10, 13.11it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  72%|███████▏  | 362/500 [00:30<00:10, 13.11it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  73%|███████▎  | 364/500 [00:30<00:11, 12.03it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  73%|███████▎  | 364/500 [00:30<00:11, 12.03it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  73%|███████▎  | 364/500 [00:31<00:11, 12.03it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  73%|███████▎  | 366/500 [00:31<00:11, 11.39it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  73%|███████▎  | 366/500 [00:31<00:11, 11.39it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  73%|███████▎  | 366/500 [00:31<00:11, 11.39it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  74%|███████▎  | 368/500 [00:31<00:10, 12.97it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  74%|███████▎  | 368/500 [00:31<00:10, 12.97it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  74%|███████▎  | 368/500 [00:31<00:10, 12.97it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  74%|███████▍  | 370/500 [00:31<00:10, 11.95it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  74%|███████▍  | 370/500 [00:31<00:10, 11.95it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  74%|███████▍  | 370/500 [00:31<00:10, 11.95it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  74%|███████▍  | 372/500 [00:31<00:09, 13.49it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  74%|███████▍  | 372/500 [00:31<00:09, 13.49it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  74%|███████▍  | 372/500 [00:31<00:09, 13.49it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  75%|███████▍  | 374/500 [00:31<00:10, 12.26it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  75%|███████▍  | 374/500 [00:31<00:10, 12.26it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  75%|███████▍  | 374/500 [00:31<00:10, 12.26it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  75%|███████▌  | 376/500 [00:31<00:10, 11.53it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  75%|███████▌  | 376/500 [00:31<00:10, 11.53it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  75%|███████▌  | 376/500 [00:32<00:10, 11.53it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  76%|███████▌  | 378/500 [00:32<00:09, 13.15it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  76%|███████▌  | 378/500 [00:32<00:09, 13.15it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  76%|███████▌  | 378/500 [00:32<00:09, 13.15it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  76%|███████▌  | 380/500 [00:32<00:09, 12.03it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  76%|███████▌  | 380/500 [00:32<00:09, 12.03it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  76%|███████▌  | 380/500 [00:32<00:09, 12.03it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  76%|███████▋  | 382/500 [00:32<00:10, 11.40it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  76%|███████▋  | 382/500 [00:32<00:10, 11.40it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  76%|███████▋  | 382/500 [00:32<00:10, 11.40it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  77%|███████▋  | 384/500 [00:32<00:08, 13.01it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  77%|███████▋  | 384/500 [00:32<00:08, 13.01it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  77%|███████▋  | 384/500 [00:32<00:08, 13.01it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  77%|███████▋  | 386/500 [00:32<00:07, 14.44it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  77%|███████▋  | 386/500 [00:32<00:07, 14.44it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  77%|███████▋  | 386/500 [00:32<00:07, 14.44it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  78%|███████▊  | 388/500 [00:32<00:08, 12.83it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  78%|███████▊  | 388/500 [00:32<00:08, 12.83it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  78%|███████▊  | 388/500 [00:32<00:08, 12.83it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  78%|███████▊  | 390/500 [00:32<00:07, 14.25it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  78%|███████▊  | 390/500 [00:33<00:07, 14.25it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  78%|███████▊  | 390/500 [00:33<00:07, 14.25it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  78%|███████▊  | 392/500 [00:33<00:08, 12.74it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  78%|███████▊  | 392/500 [00:33<00:08, 12.74it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  78%|███████▊  | 392/500 [00:33<00:08, 12.74it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  79%|███████▉  | 394/500 [00:33<00:08, 11.79it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  79%|███████▉  | 394/500 [00:33<00:08, 11.79it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  79%|███████▉  | 394/500 [00:33<00:08, 11.79it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  79%|███████▉  | 396/500 [00:33<00:09, 11.20it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  79%|███████▉  | 396/500 [00:33<00:09, 11.20it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  79%|███████▉  | 396/500 [00:33<00:09, 11.20it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  80%|███████▉  | 398/500 [00:33<00:08, 12.71it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  80%|███████▉  | 398/500 [00:33<00:08, 12.71it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  80%|███████▉  | 398/500 [00:33<00:08, 12.71it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 2.25e-03 :  80%|████████  | 400/500 [00:33<00:08, 11.88it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  80%|████████  | 400/500 [00:33<00:08, 11.88it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  80%|████████  | 400/500 [00:33<00:08, 11.88it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  80%|████████  | 402/500 [00:33<00:08, 11.30it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  80%|████████  | 402/500 [00:34<00:08, 11.30it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  80%|████████  | 402/500 [00:34<00:08, 11.30it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  81%|████████  | 404/500 [00:34<00:07, 12.91it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  81%|████████  | 404/500 [00:34<00:07, 12.91it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  81%|████████  | 404/500 [00:34<00:07, 12.91it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  81%|████████  | 406/500 [00:34<00:06, 14.35it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  81%|████████  | 406/500 [00:34<00:06, 14.35it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  81%|████████  | 406/500 [00:34<00:06, 14.35it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  82%|████████▏ | 408/500 [00:34<00:07, 12.80it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  82%|████████▏ | 408/500 [00:34<00:07, 12.80it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  82%|████████▏ | 408/500 [00:34<00:07, 12.80it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  82%|████████▏ | 410/500 [00:34<00:06, 14.26it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  82%|████████▏ | 410/500 [00:34<00:06, 14.26it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  82%|████████▏ | 410/500 [00:34<00:06, 14.26it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  82%|████████▏ | 412/500 [00:34<00:06, 12.72it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  82%|████████▏ | 412/500 [00:34<00:06, 12.72it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  82%|████████▏ | 412/500 [00:34<00:06, 12.72it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  83%|████████▎ | 414/500 [00:34<00:06, 14.20it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  83%|████████▎ | 414/500 [00:34<00:06, 14.20it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  83%|████████▎ | 414/500 [00:34<00:06, 14.20it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  83%|████████▎ | 416/500 [00:34<00:06, 12.65it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  83%|████████▎ | 416/500 [00:35<00:06, 12.65it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  83%|████████▎ | 416/500 [00:35<00:06, 12.65it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  84%|████████▎ | 418/500 [00:35<00:05, 14.16it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  84%|████████▎ | 418/500 [00:35<00:05, 14.16it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  84%|████████▎ | 418/500 [00:35<00:05, 14.16it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  84%|████████▍ | 420/500 [00:35<00:06, 12.66it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  84%|████████▍ | 420/500 [00:35<00:06, 12.66it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  84%|████████▍ | 420/500 [00:35<00:06, 12.66it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  84%|████████▍ | 422/500 [00:35<00:05, 14.15it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  84%|████████▍ | 422/500 [00:35<00:05, 14.15it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  84%|████████▍ | 422/500 [00:35<00:05, 14.15it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  85%|████████▍ | 424/500 [00:35<00:06, 12.66it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  85%|████████▍ | 424/500 [00:35<00:06, 12.66it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  85%|████████▍ | 424/500 [00:35<00:06, 12.66it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  85%|████████▌ | 426/500 [00:35<00:05, 14.13it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  85%|████████▌ | 426/500 [00:35<00:05, 14.13it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  85%|████████▌ | 426/500 [00:35<00:05, 14.13it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  86%|████████▌ | 428/500 [00:35<00:05, 12.65it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  86%|████████▌ | 428/500 [00:35<00:05, 12.65it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  86%|████████▌ | 428/500 [00:36<00:05, 12.65it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  86%|████████▌ | 430/500 [00:36<00:04, 14.11it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  86%|████████▌ | 430/500 [00:36<00:04, 14.11it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  86%|████████▌ | 430/500 [00:36<00:04, 14.11it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  86%|████████▋ | 432/500 [00:36<00:05, 12.62it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  86%|████████▋ | 432/500 [00:36<00:05, 12.62it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  86%|████████▋ | 432/500 [00:36<00:05, 12.62it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  87%|████████▋ | 434/500 [00:36<00:04, 14.11it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  87%|████████▋ | 434/500 [00:36<00:04, 14.11it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  87%|████████▋ | 434/500 [00:36<00:04, 14.11it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  87%|████████▋ | 436/500 [00:36<00:05, 12.63it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  87%|████████▋ | 436/500 [00:36<00:05, 12.63it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  87%|████████▋ | 436/500 [00:36<00:05, 12.63it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  88%|████████▊ | 438/500 [00:36<00:04, 14.08it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  88%|████████▊ | 438/500 [00:36<00:04, 14.08it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  88%|████████▊ | 438/500 [00:36<00:04, 14.08it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  88%|████████▊ | 440/500 [00:36<00:04, 12.52it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  88%|████████▊ | 440/500 [00:36<00:04, 12.52it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  88%|████████▊ | 440/500 [00:37<00:04, 12.52it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  88%|████████▊ | 442/500 [00:37<00:04, 11.70it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  88%|████████▊ | 442/500 [00:37<00:04, 11.70it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  88%|████████▊ | 442/500 [00:37<00:04, 11.70it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  89%|████████▉ | 444/500 [00:37<00:05, 11.14it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  89%|████████▉ | 444/500 [00:37<00:05, 11.14it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  89%|████████▉ | 444/500 [00:37<00:05, 11.14it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  89%|████████▉ | 446/500 [00:37<00:04, 10.81it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  89%|████████▉ | 446/500 [00:37<00:04, 10.81it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  89%|████████▉ | 446/500 [00:37<00:04, 10.81it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  90%|████████▉ | 448/500 [00:37<00:04, 12.44it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  90%|████████▉ | 448/500 [00:37<00:04, 12.44it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  90%|████████▉ | 448/500 [00:37<00:04, 12.44it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.93e-03 :  90%|█████████ | 450/500 [00:37<00:04, 11.63it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  90%|█████████ | 450/500 [00:37<00:04, 11.63it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  90%|█████████ | 450/500 [00:37<00:04, 11.63it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  90%|█████████ | 452/500 [00:37<00:04, 11.11it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  90%|█████████ | 452/500 [00:37<00:04, 11.11it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  90%|█████████ | 452/500 [00:38<00:04, 11.11it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  91%|█████████ | 454/500 [00:38<00:04, 10.77it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  91%|█████████ | 454/500 [00:38<00:04, 10.77it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  91%|█████████ | 454/500 [00:38<00:04, 10.77it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  91%|█████████ | 456/500 [00:38<00:03, 11.05it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  91%|█████████ | 456/500 [00:38<00:03, 11.05it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  91%|█████████ | 456/500 [00:38<00:03, 11.05it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  92%|█████████▏| 458/500 [00:38<00:03, 12.00it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  92%|█████████▏| 458/500 [00:38<00:03, 12.00it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  92%|█████████▏| 458/500 [00:38<00:03, 12.00it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  92%|█████████▏| 460/500 [00:38<00:03, 11.35it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  92%|█████████▏| 460/500 [00:38<00:03, 11.35it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  92%|█████████▏| 460/500 [00:38<00:03, 11.35it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  92%|█████████▏| 462/500 [00:38<00:03, 10.93it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  92%|█████████▏| 462/500 [00:38<00:03, 10.93it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  92%|█████████▏| 462/500 [00:38<00:03, 10.93it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  93%|█████████▎| 464/500 [00:38<00:02, 12.59it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  93%|█████████▎| 464/500 [00:39<00:02, 12.59it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  93%|█████████▎| 464/500 [00:39<00:02, 12.59it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  93%|█████████▎| 466/500 [00:39<00:02, 11.78it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  93%|█████████▎| 466/500 [00:39<00:02, 11.78it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  93%|█████████▎| 466/500 [00:39<00:02, 11.78it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  94%|█████████▎| 468/500 [00:39<00:02, 13.33it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  94%|█████████▎| 468/500 [00:39<00:02, 13.33it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  94%|█████████▎| 468/500 [00:39<00:02, 13.33it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  94%|█████████▍| 470/500 [00:39<00:02, 12.17it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  94%|█████████▍| 470/500 [00:39<00:02, 12.17it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  94%|█████████▍| 470/500 [00:39<00:02, 12.17it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  94%|█████████▍| 472/500 [00:39<00:02, 11.39it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  94%|█████████▍| 472/500 [00:39<00:02, 11.39it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  94%|█████████▍| 472/500 [00:39<00:02, 11.39it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  95%|█████████▍| 474/500 [00:39<00:02, 10.98it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  95%|█████████▍| 474/500 [00:39<00:02, 10.98it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  95%|█████████▍| 474/500 [00:39<00:02, 10.98it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  95%|█████████▌| 476/500 [00:39<00:01, 12.60it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  95%|█████████▌| 476/500 [00:40<00:01, 12.60it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  95%|█████████▌| 476/500 [00:40<00:01, 12.60it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  96%|█████████▌| 478/500 [00:40<00:01, 11.73it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  96%|█████████▌| 478/500 [00:40<00:01, 11.73it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  96%|█████████▌| 478/500 [00:40<00:01, 11.73it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  96%|█████████▌| 480/500 [00:40<00:01, 11.18it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  96%|█████████▌| 480/500 [00:40<00:01, 11.18it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  96%|█████████▌| 480/500 [00:40<00:01, 11.18it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  96%|█████████▋| 482/500 [00:40<00:01, 12.80it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  96%|█████████▋| 482/500 [00:40<00:01, 12.80it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  96%|█████████▋| 482/500 [00:40<00:01, 12.80it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  97%|█████████▋| 484/500 [00:40<00:01, 11.86it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  97%|█████████▋| 484/500 [00:40<00:01, 11.86it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  97%|█████████▋| 484/500 [00:40<00:01, 11.86it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  97%|█████████▋| 486/500 [00:40<00:01, 13.42it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  97%|█████████▋| 486/500 [00:40<00:01, 13.42it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  97%|█████████▋| 486/500 [00:40<00:01, 13.42it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  98%|█████████▊| 488/500 [00:40<00:00, 12.22it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  98%|█████████▊| 488/500 [00:40<00:00, 12.22it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  98%|█████████▊| 488/500 [00:41<00:00, 12.22it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  98%|█████████▊| 490/500 [00:41<00:00, 12.13it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  98%|█████████▊| 490/500 [00:41<00:00, 12.13it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  98%|█████████▊| 490/500 [00:41<00:00, 12.13it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  98%|█████████▊| 492/500 [00:41<00:00, 12.88it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  98%|█████████▊| 492/500 [00:41<00:00, 12.88it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  98%|█████████▊| 492/500 [00:41<00:00, 12.88it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  99%|█████████▉| 494/500 [00:41<00:00, 11.93it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  99%|█████████▉| 494/500 [00:41<00:00, 11.93it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  99%|█████████▉| 494/500 [00:41<00:00, 11.93it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  99%|█████████▉| 496/500 [00:41<00:00, 13.48it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  99%|█████████▉| 496/500 [00:41<00:00, 13.48it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 :  99%|█████████▉| 496/500 [00:41<00:00, 13.48it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 : 100%|█████████▉| 498/500 [00:41<00:00, 12.25it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 : 100%|█████████▉| 498/500 [00:41<00:00, 12.25it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 : 100%|█████████▉| 498/500 [00:41<00:00, 12.25it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 : 100%|██████████| 500/500 [00:41<00:00, 13.79it/s]
[TorchDR] DR Loss : 1.11e+01 | Grad norm : 1.72e-03 : 100%|██████████| 500/500 [00:41<00:00, 11.96it/s]
Random state is None
[TorchDR] Initializing DR model COSNE.
[TorchDR] Affinity : computing the Entropic Affinity matrix.
[TorchDR] Affinity : sparsity mode enabled, computing 90 nearest neighbors. If this step is too slow, consider reducing the dimensionality of the data or disabling sparsity.

  0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  5.27e-01 (std =  5.68e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  3.72e-01 (std =  5.93e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  2.42e-01 (std =  5.54e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  1.48e-01 (std =  4.64e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  8.68e-02 (std =  3.58e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  8.68e-02 (std =  3.58e-02) :   5%|▌         | 5/100 [00:00<00:01, 49.97it/s]
[TorchDR] Root search : mean abs value =  5.01e-02 (std =  2.60e-02) :   5%|▌         | 5/100 [00:00<00:01, 49.97it/s]
[TorchDR] Root search : mean abs value =  2.88e-02 (std =  1.82e-02) :   5%|▌         | 5/100 [00:00<00:01, 49.97it/s]
[TorchDR] Root search : mean abs value =  1.66e-02 (std =  1.24e-02) :   5%|▌         | 5/100 [00:00<00:01, 49.97it/s]
[TorchDR] Root search : mean abs value =  9.57e-03 (std =  8.43e-03) :   5%|▌         | 5/100 [00:00<00:01, 49.97it/s]
[TorchDR] Root search : mean abs value =  5.58e-03 (std =  5.69e-03) :   5%|▌         | 5/100 [00:00<00:01, 49.97it/s]
[TorchDR] Root search : mean abs value =  3.28e-03 (std =  3.84e-03) :   5%|▌         | 5/100 [00:00<00:01, 49.97it/s]
[TorchDR] Root search : mean abs value =  1.95e-03 (std =  2.60e-03) :   5%|▌         | 5/100 [00:00<00:01, 49.97it/s]
[TorchDR] Root search : mean abs value =  1.17e-03 (std =  1.76e-03) :   5%|▌         | 5/100 [00:00<00:01, 49.97it/s]
[TorchDR] Root search : mean abs value =  1.17e-03 (std =  1.76e-03) :  13%|█▎        | 13/100 [00:00<00:01, 67.60it/s]
[TorchDR] Root search : mean abs value =  7.05e-04 (std =  1.20e-03) :  13%|█▎        | 13/100 [00:00<00:01, 67.60it/s]
[TorchDR] Root search : mean abs value =  4.30e-04 (std =  8.23e-04) :  13%|█▎        | 13/100 [00:00<00:01, 67.60it/s]
[TorchDR] Root search : mean abs value =  2.64e-04 (std =  5.66e-04) :  13%|█▎        | 13/100 [00:00<00:01, 67.60it/s]
[TorchDR] Root search : mean abs value =  1.64e-04 (std =  3.92e-04) :  13%|█▎        | 13/100 [00:00<00:01, 67.60it/s]
[TorchDR] Root search : mean abs value =  1.02e-04 (std =  2.72e-04) :  13%|█▎        | 13/100 [00:00<00:01, 67.60it/s]
[TorchDR] Root search : mean abs value =  6.47e-05 (std =  1.90e-04) :  13%|█▎        | 13/100 [00:00<00:01, 67.60it/s]
[TorchDR] Root search : mean abs value =  4.11e-05 (std =  1.33e-04) :  13%|█▎        | 13/100 [00:00<00:01, 67.60it/s]
[TorchDR] Root search : mean abs value =  2.63e-05 (std =  9.35e-05) :  13%|█▎        | 13/100 [00:00<00:01, 67.60it/s]
[TorchDR] Root search : mean abs value =  2.63e-05 (std =  9.35e-05) :  21%|██        | 21/100 [00:00<00:01, 73.07it/s]
[TorchDR] Root search : mean abs value =  1.70e-05 (std =  6.59e-05) :  21%|██        | 21/100 [00:00<00:01, 73.07it/s]
[TorchDR] Root search : mean abs value =  1.11e-05 (std =  4.65e-05) :  21%|██        | 21/100 [00:00<00:01, 73.07it/s]
[TorchDR] Root search : mean abs value =  1.11e-05 (std =  4.65e-05) :  23%|██▎       | 23/100 [00:00<00:01, 58.00it/s]

  0%|          | 0/500 [00:00<?, ?it/s]
[TorchDR] DR Loss : 3.32e+01 | Grad norm : 2.69e+00 :   0%|          | 0/500 [00:00<?, ?it/s]
[TorchDR] DR Loss : 3.32e+01 | Grad norm : 2.69e+00 :   0%|          | 1/500 [00:00<02:29,  3.33it/s]
[TorchDR] DR Loss : 3.16e+01 | Grad norm : 2.69e+00 :   0%|          | 1/500 [00:00<02:29,  3.33it/s]
[TorchDR] DR Loss : 3.16e+01 | Grad norm : 2.69e+00 :   0%|          | 2/500 [00:00<02:29,  3.34it/s]
[TorchDR] DR Loss : 3.00e+01 | Grad norm : 2.69e+00 :   0%|          | 2/500 [00:00<02:29,  3.34it/s]
[TorchDR] DR Loss : 3.00e+01 | Grad norm : 2.69e+00 :   1%|          | 3/500 [00:00<02:07,  3.91it/s]
[TorchDR] DR Loss : 2.83e+01 | Grad norm : 2.69e+00 :   1%|          | 3/500 [00:01<02:07,  3.91it/s]
[TorchDR] DR Loss : 2.83e+01 | Grad norm : 2.69e+00 :   1%|          | 4/500 [00:01<02:14,  3.68it/s]
[TorchDR] DR Loss : 2.66e+01 | Grad norm : 2.69e+00 :   1%|          | 4/500 [00:01<02:14,  3.68it/s]
[TorchDR] DR Loss : 2.66e+01 | Grad norm : 2.69e+00 :   1%|          | 5/500 [00:01<02:19,  3.55it/s]
[TorchDR] DR Loss : 2.49e+01 | Grad norm : 2.69e+00 :   1%|          | 5/500 [00:01<02:19,  3.55it/s]
[TorchDR] DR Loss : 2.49e+01 | Grad norm : 2.69e+00 :   1%|          | 6/500 [00:01<02:06,  3.90it/s]
[TorchDR] DR Loss : 2.32e+01 | Grad norm : 2.69e+00 :   1%|          | 6/500 [00:01<02:06,  3.90it/s]
[TorchDR] DR Loss : 2.32e+01 | Grad norm : 2.69e+00 :   1%|▏         | 7/500 [00:01<02:12,  3.72it/s]
[TorchDR] DR Loss : 2.16e+01 | Grad norm : 2.69e+00 :   1%|▏         | 7/500 [00:02<02:12,  3.72it/s]
[TorchDR] DR Loss : 2.16e+01 | Grad norm : 2.69e+00 :   2%|▏         | 8/500 [00:02<02:17,  3.59it/s]
[TorchDR] DR Loss : 1.99e+01 | Grad norm : 2.69e+00 :   2%|▏         | 8/500 [00:02<02:17,  3.59it/s]
[TorchDR] DR Loss : 1.99e+01 | Grad norm : 2.69e+00 :   2%|▏         | 9/500 [00:02<02:14,  3.64it/s]
[TorchDR] DR Loss : 1.84e+01 | Grad norm : 2.69e+00 :   2%|▏         | 9/500 [00:02<02:14,  3.64it/s]
[TorchDR] DR Loss : 1.84e+01 | Grad norm : 2.69e+00 :   2%|▏         | 10/500 [00:02<02:09,  3.80it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 2.69e+00 :   2%|▏         | 10/500 [00:02<02:09,  3.80it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 2.69e+00 :   2%|▏         | 11/500 [00:02<01:59,  4.10it/s]
[TorchDR] DR Loss : 1.56e+01 | Grad norm : 2.69e+00 :   2%|▏         | 11/500 [00:03<01:59,  4.10it/s]
[TorchDR] DR Loss : 1.56e+01 | Grad norm : 2.69e+00 :   2%|▏         | 12/500 [00:03<02:06,  3.85it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 2.69e+00 :   2%|▏         | 12/500 [00:03<02:06,  3.85it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 2.69e+00 :   3%|▎         | 13/500 [00:03<02:12,  3.67it/s]
[TorchDR] DR Loss : 1.33e+01 | Grad norm : 2.69e+00 :   3%|▎         | 13/500 [00:03<02:12,  3.67it/s]
[TorchDR] DR Loss : 1.33e+01 | Grad norm : 2.69e+00 :   3%|▎         | 14/500 [00:03<02:16,  3.57it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 2.69e+00 :   3%|▎         | 14/500 [00:04<02:16,  3.57it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 2.69e+00 :   3%|▎         | 15/500 [00:04<02:18,  3.50it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.69e+00 :   3%|▎         | 15/500 [00:04<02:18,  3.50it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.69e+00 :   3%|▎         | 16/500 [00:04<02:06,  3.84it/s]
[TorchDR] DR Loss : 1.07e+01 | Grad norm : 2.69e+00 :   3%|▎         | 16/500 [00:04<02:06,  3.84it/s]
[TorchDR] DR Loss : 1.07e+01 | Grad norm : 2.69e+00 :   3%|▎         | 17/500 [00:04<02:11,  3.68it/s]
[TorchDR] DR Loss : 1.01e+01 | Grad norm : 2.69e+00 :   3%|▎         | 17/500 [00:04<02:11,  3.68it/s]
[TorchDR] DR Loss : 1.01e+01 | Grad norm : 2.69e+00 :   4%|▎         | 18/500 [00:04<02:01,  3.97it/s]
[TorchDR] DR Loss : 9.49e+00 | Grad norm : 2.69e+00 :   4%|▎         | 18/500 [00:05<02:01,  3.97it/s]
[TorchDR] DR Loss : 9.49e+00 | Grad norm : 2.69e+00 :   4%|▍         | 19/500 [00:05<02:07,  3.78it/s]
[TorchDR] DR Loss : 9.01e+00 | Grad norm : 2.69e+00 :   4%|▍         | 19/500 [00:05<02:07,  3.78it/s]
[TorchDR] DR Loss : 9.01e+00 | Grad norm : 2.69e+00 :   4%|▍         | 20/500 [00:05<02:12,  3.63it/s]
[TorchDR] DR Loss : 8.59e+00 | Grad norm : 2.69e+00 :   4%|▍         | 20/500 [00:05<02:12,  3.63it/s]
[TorchDR] DR Loss : 8.59e+00 | Grad norm : 2.69e+00 :   4%|▍         | 21/500 [00:05<02:15,  3.53it/s]
[TorchDR] DR Loss : 8.24e+00 | Grad norm : 2.69e+00 :   4%|▍         | 21/500 [00:06<02:15,  3.53it/s]
[TorchDR] DR Loss : 8.24e+00 | Grad norm : 2.69e+00 :   4%|▍         | 22/500 [00:06<02:31,  3.16it/s]
[TorchDR] DR Loss : 7.94e+00 | Grad norm : 2.69e+00 :   4%|▍         | 22/500 [00:06<02:31,  3.16it/s]
[TorchDR] DR Loss : 7.94e+00 | Grad norm : 2.69e+00 :   5%|▍         | 23/500 [00:06<02:15,  3.52it/s]
[TorchDR] DR Loss : 7.68e+00 | Grad norm : 2.69e+00 :   5%|▍         | 23/500 [00:06<02:15,  3.52it/s]
[TorchDR] DR Loss : 7.68e+00 | Grad norm : 2.69e+00 :   5%|▍         | 24/500 [00:06<02:16,  3.48it/s]
[TorchDR] DR Loss : 7.46e+00 | Grad norm : 2.69e+00 :   5%|▍         | 24/500 [00:06<02:16,  3.48it/s]
[TorchDR] DR Loss : 7.46e+00 | Grad norm : 2.69e+00 :   5%|▌         | 25/500 [00:06<02:18,  3.44it/s]
[TorchDR] DR Loss : 7.27e+00 | Grad norm : 2.69e+00 :   5%|▌         | 25/500 [00:07<02:18,  3.44it/s]
[TorchDR] DR Loss : 7.27e+00 | Grad norm : 2.69e+00 :   5%|▌         | 26/500 [00:07<02:05,  3.77it/s]
[TorchDR] DR Loss : 7.10e+00 | Grad norm : 2.69e+00 :   5%|▌         | 26/500 [00:07<02:05,  3.77it/s]
[TorchDR] DR Loss : 7.10e+00 | Grad norm : 2.69e+00 :   5%|▌         | 27/500 [00:07<02:23,  3.30it/s]
[TorchDR] DR Loss : 6.96e+00 | Grad norm : 2.69e+00 :   5%|▌         | 27/500 [00:07<02:23,  3.30it/s]
[TorchDR] DR Loss : 6.96e+00 | Grad norm : 2.69e+00 :   6%|▌         | 28/500 [00:07<02:09,  3.65it/s]
[TorchDR] DR Loss : 6.83e+00 | Grad norm : 2.69e+00 :   6%|▌         | 28/500 [00:07<02:09,  3.65it/s]
[TorchDR] DR Loss : 6.83e+00 | Grad norm : 2.69e+00 :   6%|▌         | 29/500 [00:07<01:58,  3.96it/s]
[TorchDR] DR Loss : 6.70e+00 | Grad norm : 2.69e+00 :   6%|▌         | 29/500 [00:08<01:58,  3.96it/s]
[TorchDR] DR Loss : 6.70e+00 | Grad norm : 2.69e+00 :   6%|▌         | 30/500 [00:08<02:04,  3.77it/s]
[TorchDR] DR Loss : 6.57e+00 | Grad norm : 2.69e+00 :   6%|▌         | 30/500 [00:08<02:04,  3.77it/s]
[TorchDR] DR Loss : 6.57e+00 | Grad norm : 2.69e+00 :   6%|▌         | 31/500 [00:08<02:08,  3.64it/s]
[TorchDR] DR Loss : 6.44e+00 | Grad norm : 2.69e+00 :   6%|▌         | 31/500 [00:08<02:08,  3.64it/s]
[TorchDR] DR Loss : 6.44e+00 | Grad norm : 2.69e+00 :   6%|▋         | 32/500 [00:08<01:58,  3.94it/s]
[TorchDR] DR Loss : 6.32e+00 | Grad norm : 2.69e+00 :   6%|▋         | 32/500 [00:08<01:58,  3.94it/s]
[TorchDR] DR Loss : 6.32e+00 | Grad norm : 2.69e+00 :   7%|▋         | 33/500 [00:08<01:51,  4.19it/s]
[TorchDR] DR Loss : 6.20e+00 | Grad norm : 2.69e+00 :   7%|▋         | 33/500 [00:09<01:51,  4.19it/s]
[TorchDR] DR Loss : 6.20e+00 | Grad norm : 2.69e+00 :   7%|▋         | 34/500 [00:09<01:45,  4.40it/s]
[TorchDR] DR Loss : 6.09e+00 | Grad norm : 2.69e+00 :   7%|▋         | 34/500 [00:09<01:45,  4.40it/s]
[TorchDR] DR Loss : 6.09e+00 | Grad norm : 2.69e+00 :   7%|▋         | 35/500 [00:09<01:55,  4.04it/s]
[TorchDR] DR Loss : 6.00e+00 | Grad norm : 2.69e+00 :   7%|▋         | 35/500 [00:09<01:55,  4.04it/s]
[TorchDR] DR Loss : 6.00e+00 | Grad norm : 2.69e+00 :   7%|▋         | 36/500 [00:09<01:49,  4.26it/s]
[TorchDR] DR Loss : 5.92e+00 | Grad norm : 2.69e+00 :   7%|▋         | 36/500 [00:09<01:49,  4.26it/s]
[TorchDR] DR Loss : 5.92e+00 | Grad norm : 2.69e+00 :   7%|▋         | 37/500 [00:09<01:57,  3.94it/s]
[TorchDR] DR Loss : 5.85e+00 | Grad norm : 2.69e+00 :   7%|▋         | 37/500 [00:10<01:57,  3.94it/s]
[TorchDR] DR Loss : 5.85e+00 | Grad norm : 2.69e+00 :   8%|▊         | 38/500 [00:10<02:31,  3.05it/s]
[TorchDR] DR Loss : 5.79e+00 | Grad norm : 2.69e+00 :   8%|▊         | 38/500 [00:10<02:31,  3.05it/s]
[TorchDR] DR Loss : 5.79e+00 | Grad norm : 2.69e+00 :   8%|▊         | 39/500 [00:10<02:50,  2.71it/s]
[TorchDR] DR Loss : 5.74e+00 | Grad norm : 2.69e+00 :   8%|▊         | 39/500 [00:11<02:50,  2.71it/s]
[TorchDR] DR Loss : 5.74e+00 | Grad norm : 2.69e+00 :   8%|▊         | 40/500 [00:11<02:45,  2.78it/s]
[TorchDR] DR Loss : 5.68e+00 | Grad norm : 2.69e+00 :   8%|▊         | 40/500 [00:11<02:45,  2.78it/s]
[TorchDR] DR Loss : 5.68e+00 | Grad norm : 2.69e+00 :   8%|▊         | 41/500 [00:11<03:13,  2.38it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 2.69e+00 :   8%|▊         | 41/500 [00:12<03:13,  2.38it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 2.69e+00 :   8%|▊         | 42/500 [00:12<03:14,  2.36it/s]
[TorchDR] DR Loss : 5.58e+00 | Grad norm : 2.69e+00 :   8%|▊         | 42/500 [00:12<03:14,  2.36it/s]
[TorchDR] DR Loss : 5.58e+00 | Grad norm : 2.69e+00 :   9%|▊         | 43/500 [00:12<03:10,  2.40it/s]
[TorchDR] DR Loss : 5.53e+00 | Grad norm : 2.69e+00 :   9%|▊         | 43/500 [00:12<03:10,  2.40it/s]
[TorchDR] DR Loss : 5.53e+00 | Grad norm : 2.69e+00 :   9%|▉         | 44/500 [00:12<02:54,  2.62it/s]
[TorchDR] DR Loss : 5.48e+00 | Grad norm : 2.69e+00 :   9%|▉         | 44/500 [00:13<02:54,  2.62it/s]
[TorchDR] DR Loss : 5.48e+00 | Grad norm : 2.69e+00 :   9%|▉         | 45/500 [00:13<02:42,  2.80it/s]
[TorchDR] DR Loss : 5.43e+00 | Grad norm : 2.69e+00 :   9%|▉         | 45/500 [00:13<02:42,  2.80it/s]
[TorchDR] DR Loss : 5.43e+00 | Grad norm : 2.69e+00 :   9%|▉         | 46/500 [00:13<02:34,  2.93it/s]
[TorchDR] DR Loss : 5.39e+00 | Grad norm : 2.69e+00 :   9%|▉         | 46/500 [00:13<02:34,  2.93it/s]
[TorchDR] DR Loss : 5.39e+00 | Grad norm : 2.69e+00 :   9%|▉         | 47/500 [00:13<02:28,  3.05it/s]
[TorchDR] DR Loss : 5.35e+00 | Grad norm : 2.69e+00 :   9%|▉         | 47/500 [00:14<02:28,  3.05it/s]
[TorchDR] DR Loss : 5.35e+00 | Grad norm : 2.69e+00 :  10%|▉         | 48/500 [00:14<02:24,  3.14it/s]
[TorchDR] DR Loss : 5.31e+00 | Grad norm : 2.69e+00 :  10%|▉         | 48/500 [00:14<02:24,  3.14it/s]
[TorchDR] DR Loss : 5.31e+00 | Grad norm : 2.69e+00 :  10%|▉         | 49/500 [00:14<02:21,  3.19it/s]
[TorchDR] DR Loss : 5.28e+00 | Grad norm : 2.69e+00 :  10%|▉         | 49/500 [00:14<02:21,  3.19it/s]
[TorchDR] DR Loss : 5.28e+00 | Grad norm : 2.69e+00 :  10%|█         | 50/500 [00:14<02:19,  3.22it/s]
[TorchDR] DR Loss : 5.25e+00 | Grad norm : 5.96e-01 :  10%|█         | 50/500 [00:15<02:19,  3.22it/s]
[TorchDR] DR Loss : 5.25e+00 | Grad norm : 5.96e-01 :  10%|█         | 51/500 [00:15<02:18,  3.25it/s]
[TorchDR] DR Loss : 5.21e+00 | Grad norm : 5.96e-01 :  10%|█         | 51/500 [00:15<02:18,  3.25it/s]
[TorchDR] DR Loss : 5.21e+00 | Grad norm : 5.96e-01 :  10%|█         | 52/500 [00:15<02:16,  3.27it/s]
[TorchDR] DR Loss : 5.18e+00 | Grad norm : 5.96e-01 :  10%|█         | 52/500 [00:15<02:16,  3.27it/s]
[TorchDR] DR Loss : 5.18e+00 | Grad norm : 5.96e-01 :  11%|█         | 53/500 [00:15<02:15,  3.29it/s]
[TorchDR] DR Loss : 5.14e+00 | Grad norm : 5.96e-01 :  11%|█         | 53/500 [00:15<02:15,  3.29it/s]
[TorchDR] DR Loss : 5.14e+00 | Grad norm : 5.96e-01 :  11%|█         | 54/500 [00:15<02:27,  3.02it/s]
[TorchDR] DR Loss : 5.11e+00 | Grad norm : 5.96e-01 :  11%|█         | 54/500 [00:16<02:27,  3.02it/s]
[TorchDR] DR Loss : 5.11e+00 | Grad norm : 5.96e-01 :  11%|█         | 55/500 [00:16<02:23,  3.09it/s]
[TorchDR] DR Loss : 5.08e+00 | Grad norm : 5.96e-01 :  11%|█         | 55/500 [00:16<02:23,  3.09it/s]
[TorchDR] DR Loss : 5.08e+00 | Grad norm : 5.96e-01 :  11%|█         | 56/500 [00:16<02:46,  2.66it/s]
[TorchDR] DR Loss : 5.05e+00 | Grad norm : 5.96e-01 :  11%|█         | 56/500 [00:17<02:46,  2.66it/s]
[TorchDR] DR Loss : 5.05e+00 | Grad norm : 5.96e-01 :  11%|█▏        | 57/500 [00:17<02:49,  2.62it/s]
[TorchDR] DR Loss : 5.02e+00 | Grad norm : 5.96e-01 :  11%|█▏        | 57/500 [00:17<02:49,  2.62it/s]
[TorchDR] DR Loss : 5.02e+00 | Grad norm : 5.96e-01 :  12%|█▏        | 58/500 [00:17<02:33,  2.87it/s]
[TorchDR] DR Loss : 4.99e+00 | Grad norm : 5.96e-01 :  12%|█▏        | 58/500 [00:17<02:33,  2.87it/s]
[TorchDR] DR Loss : 4.99e+00 | Grad norm : 5.96e-01 :  12%|█▏        | 59/500 [00:17<02:45,  2.67it/s]
[TorchDR] DR Loss : 4.97e+00 | Grad norm : 5.96e-01 :  12%|█▏        | 59/500 [00:18<02:45,  2.67it/s]
[TorchDR] DR Loss : 4.97e+00 | Grad norm : 5.96e-01 :  12%|█▏        | 60/500 [00:18<02:35,  2.84it/s]
[TorchDR] DR Loss : 4.94e+00 | Grad norm : 5.96e-01 :  12%|█▏        | 60/500 [00:18<02:35,  2.84it/s]
[TorchDR] DR Loss : 4.94e+00 | Grad norm : 5.96e-01 :  12%|█▏        | 61/500 [00:18<02:53,  2.52it/s]
[TorchDR] DR Loss : 4.92e+00 | Grad norm : 5.96e-01 :  12%|█▏        | 61/500 [00:18<02:53,  2.52it/s]
[TorchDR] DR Loss : 4.92e+00 | Grad norm : 5.96e-01 :  12%|█▏        | 62/500 [00:18<02:40,  2.73it/s]
[TorchDR] DR Loss : 4.89e+00 | Grad norm : 5.96e-01 :  12%|█▏        | 62/500 [00:19<02:40,  2.73it/s]
[TorchDR] DR Loss : 4.89e+00 | Grad norm : 5.96e-01 :  13%|█▎        | 63/500 [00:19<02:18,  3.15it/s]
[TorchDR] DR Loss : 4.86e+00 | Grad norm : 5.96e-01 :  13%|█▎        | 63/500 [00:19<02:18,  3.15it/s]
[TorchDR] DR Loss : 4.86e+00 | Grad norm : 5.96e-01 :  13%|█▎        | 64/500 [00:19<02:15,  3.21it/s]
[TorchDR] DR Loss : 4.84e+00 | Grad norm : 5.96e-01 :  13%|█▎        | 64/500 [00:19<02:15,  3.21it/s]
[TorchDR] DR Loss : 4.84e+00 | Grad norm : 5.96e-01 :  13%|█▎        | 65/500 [00:19<02:13,  3.25it/s]
[TorchDR] DR Loss : 4.81e+00 | Grad norm : 5.96e-01 :  13%|█▎        | 65/500 [00:20<02:13,  3.25it/s]
[TorchDR] DR Loss : 4.81e+00 | Grad norm : 5.96e-01 :  13%|█▎        | 66/500 [00:20<02:00,  3.61it/s]
[TorchDR] DR Loss : 4.79e+00 | Grad norm : 5.96e-01 :  13%|█▎        | 66/500 [00:20<02:00,  3.61it/s]
[TorchDR] DR Loss : 4.79e+00 | Grad norm : 5.96e-01 :  13%|█▎        | 67/500 [00:20<02:02,  3.54it/s]
[TorchDR] DR Loss : 4.77e+00 | Grad norm : 5.96e-01 :  13%|█▎        | 67/500 [00:20<02:02,  3.54it/s]
[TorchDR] DR Loss : 4.77e+00 | Grad norm : 5.96e-01 :  14%|█▎        | 68/500 [00:20<02:04,  3.48it/s]
[TorchDR] DR Loss : 4.75e+00 | Grad norm : 5.96e-01 :  14%|█▎        | 68/500 [00:20<02:04,  3.48it/s]
[TorchDR] DR Loss : 4.75e+00 | Grad norm : 5.96e-01 :  14%|█▍        | 69/500 [00:20<01:53,  3.80it/s]
[TorchDR] DR Loss : 4.72e+00 | Grad norm : 5.96e-01 :  14%|█▍        | 69/500 [00:21<01:53,  3.80it/s]
[TorchDR] DR Loss : 4.72e+00 | Grad norm : 5.96e-01 :  14%|█▍        | 70/500 [00:21<01:57,  3.67it/s]
[TorchDR] DR Loss : 4.70e+00 | Grad norm : 5.96e-01 :  14%|█▍        | 70/500 [00:21<01:57,  3.67it/s]
[TorchDR] DR Loss : 4.70e+00 | Grad norm : 5.96e-01 :  14%|█▍        | 71/500 [00:21<01:48,  3.97it/s]
[TorchDR] DR Loss : 4.68e+00 | Grad norm : 5.96e-01 :  14%|█▍        | 71/500 [00:21<01:48,  3.97it/s]
[TorchDR] DR Loss : 4.68e+00 | Grad norm : 5.96e-01 :  14%|█▍        | 72/500 [00:21<01:41,  4.23it/s]
[TorchDR] DR Loss : 4.66e+00 | Grad norm : 5.96e-01 :  14%|█▍        | 72/500 [00:21<01:41,  4.23it/s]
[TorchDR] DR Loss : 4.66e+00 | Grad norm : 5.96e-01 :  15%|█▍        | 73/500 [00:21<01:48,  3.94it/s]
[TorchDR] DR Loss : 4.64e+00 | Grad norm : 5.96e-01 :  15%|█▍        | 73/500 [00:22<01:48,  3.94it/s]
[TorchDR] DR Loss : 4.64e+00 | Grad norm : 5.96e-01 :  15%|█▍        | 74/500 [00:22<01:53,  3.75it/s]
[TorchDR] DR Loss : 4.62e+00 | Grad norm : 5.96e-01 :  15%|█▍        | 74/500 [00:22<01:53,  3.75it/s]
[TorchDR] DR Loss : 4.62e+00 | Grad norm : 5.96e-01 :  15%|█▌        | 75/500 [00:22<01:45,  4.02it/s]
[TorchDR] DR Loss : 4.60e+00 | Grad norm : 5.96e-01 :  15%|█▌        | 75/500 [00:22<01:45,  4.02it/s]
[TorchDR] DR Loss : 4.60e+00 | Grad norm : 5.96e-01 :  15%|█▌        | 76/500 [00:22<01:51,  3.80it/s]
[TorchDR] DR Loss : 4.59e+00 | Grad norm : 5.96e-01 :  15%|█▌        | 76/500 [00:22<01:51,  3.80it/s]
[TorchDR] DR Loss : 4.59e+00 | Grad norm : 5.96e-01 :  15%|█▌        | 77/500 [00:22<01:55,  3.65it/s]
[TorchDR] DR Loss : 4.57e+00 | Grad norm : 5.96e-01 :  15%|█▌        | 77/500 [00:23<01:55,  3.65it/s]
[TorchDR] DR Loss : 4.57e+00 | Grad norm : 5.96e-01 :  16%|█▌        | 78/500 [00:23<01:46,  3.96it/s]
[TorchDR] DR Loss : 4.55e+00 | Grad norm : 5.96e-01 :  16%|█▌        | 78/500 [00:23<01:46,  3.96it/s]
[TorchDR] DR Loss : 4.55e+00 | Grad norm : 5.96e-01 :  16%|█▌        | 79/500 [00:23<01:52,  3.75it/s]
[TorchDR] DR Loss : 4.53e+00 | Grad norm : 5.96e-01 :  16%|█▌        | 79/500 [00:23<01:52,  3.75it/s]
[TorchDR] DR Loss : 4.53e+00 | Grad norm : 5.96e-01 :  16%|█▌        | 80/500 [00:23<01:55,  3.63it/s]
[TorchDR] DR Loss : 4.52e+00 | Grad norm : 5.96e-01 :  16%|█▌        | 80/500 [00:23<01:55,  3.63it/s]
[TorchDR] DR Loss : 4.52e+00 | Grad norm : 5.96e-01 :  16%|█▌        | 81/500 [00:23<01:46,  3.93it/s]
[TorchDR] DR Loss : 4.50e+00 | Grad norm : 5.96e-01 :  16%|█▌        | 81/500 [00:24<01:46,  3.93it/s]
[TorchDR] DR Loss : 4.50e+00 | Grad norm : 5.96e-01 :  16%|█▋        | 82/500 [00:24<01:39,  4.19it/s]
[TorchDR] DR Loss : 4.49e+00 | Grad norm : 5.96e-01 :  16%|█▋        | 82/500 [00:24<01:39,  4.19it/s]
[TorchDR] DR Loss : 4.49e+00 | Grad norm : 5.96e-01 :  17%|█▋        | 83/500 [00:24<01:46,  3.90it/s]
[TorchDR] DR Loss : 4.47e+00 | Grad norm : 5.96e-01 :  17%|█▋        | 83/500 [00:24<01:46,  3.90it/s]
[TorchDR] DR Loss : 4.47e+00 | Grad norm : 5.96e-01 :  17%|█▋        | 84/500 [00:24<01:52,  3.71it/s]
[TorchDR] DR Loss : 4.46e+00 | Grad norm : 5.96e-01 :  17%|█▋        | 84/500 [00:24<01:52,  3.71it/s]
[TorchDR] DR Loss : 4.46e+00 | Grad norm : 5.96e-01 :  17%|█▋        | 85/500 [00:24<01:43,  4.00it/s]
[TorchDR] DR Loss : 4.44e+00 | Grad norm : 5.96e-01 :  17%|█▋        | 85/500 [00:25<01:43,  4.00it/s]
[TorchDR] DR Loss : 4.44e+00 | Grad norm : 5.96e-01 :  17%|█▋        | 86/500 [00:25<01:49,  3.80it/s]
[TorchDR] DR Loss : 4.43e+00 | Grad norm : 5.96e-01 :  17%|█▋        | 86/500 [00:25<01:49,  3.80it/s]
[TorchDR] DR Loss : 4.43e+00 | Grad norm : 5.96e-01 :  17%|█▋        | 87/500 [00:25<01:41,  4.06it/s]
[TorchDR] DR Loss : 4.41e+00 | Grad norm : 5.96e-01 :  17%|█▋        | 87/500 [00:25<01:41,  4.06it/s]
[TorchDR] DR Loss : 4.41e+00 | Grad norm : 5.96e-01 :  18%|█▊        | 88/500 [00:25<01:35,  4.31it/s]
[TorchDR] DR Loss : 4.40e+00 | Grad norm : 5.96e-01 :  18%|█▊        | 88/500 [00:25<01:35,  4.31it/s]
[TorchDR] DR Loss : 4.40e+00 | Grad norm : 5.96e-01 :  18%|█▊        | 89/500 [00:25<01:43,  3.96it/s]
[TorchDR] DR Loss : 4.39e+00 | Grad norm : 5.96e-01 :  18%|█▊        | 89/500 [00:26<01:43,  3.96it/s]
[TorchDR] DR Loss : 4.39e+00 | Grad norm : 5.96e-01 :  18%|█▊        | 90/500 [00:26<01:49,  3.76it/s]
[TorchDR] DR Loss : 4.37e+00 | Grad norm : 5.96e-01 :  18%|█▊        | 90/500 [00:26<01:49,  3.76it/s]
[TorchDR] DR Loss : 4.37e+00 | Grad norm : 5.96e-01 :  18%|█▊        | 91/500 [00:26<01:52,  3.63it/s]
[TorchDR] DR Loss : 4.36e+00 | Grad norm : 5.96e-01 :  18%|█▊        | 91/500 [00:26<01:52,  3.63it/s]
[TorchDR] DR Loss : 4.36e+00 | Grad norm : 5.96e-01 :  18%|█▊        | 92/500 [00:26<01:55,  3.54it/s]
[TorchDR] DR Loss : 4.35e+00 | Grad norm : 5.96e-01 :  18%|█▊        | 92/500 [00:27<01:55,  3.54it/s]
[TorchDR] DR Loss : 4.35e+00 | Grad norm : 5.96e-01 :  19%|█▊        | 93/500 [00:27<01:45,  3.85it/s]
[TorchDR] DR Loss : 4.34e+00 | Grad norm : 5.96e-01 :  19%|█▊        | 93/500 [00:27<01:45,  3.85it/s]
[TorchDR] DR Loss : 4.34e+00 | Grad norm : 5.96e-01 :  19%|█▉        | 94/500 [00:27<01:49,  3.71it/s]
[TorchDR] DR Loss : 4.33e+00 | Grad norm : 5.96e-01 :  19%|█▉        | 94/500 [00:27<01:49,  3.71it/s]
[TorchDR] DR Loss : 4.33e+00 | Grad norm : 5.96e-01 :  19%|█▉        | 95/500 [00:27<01:53,  3.58it/s]
[TorchDR] DR Loss : 4.32e+00 | Grad norm : 5.96e-01 :  19%|█▉        | 95/500 [00:27<01:53,  3.58it/s]
[TorchDR] DR Loss : 4.32e+00 | Grad norm : 5.96e-01 :  19%|█▉        | 96/500 [00:27<01:55,  3.51it/s]
[TorchDR] DR Loss : 4.31e+00 | Grad norm : 5.96e-01 :  19%|█▉        | 96/500 [00:28<01:55,  3.51it/s]
[TorchDR] DR Loss : 4.31e+00 | Grad norm : 5.96e-01 :  19%|█▉        | 97/500 [00:28<01:56,  3.46it/s]
[TorchDR] DR Loss : 4.30e+00 | Grad norm : 5.96e-01 :  19%|█▉        | 97/500 [00:28<01:56,  3.46it/s]
[TorchDR] DR Loss : 4.30e+00 | Grad norm : 5.96e-01 :  20%|█▉        | 98/500 [00:28<01:45,  3.81it/s]
[TorchDR] DR Loss : 4.29e+00 | Grad norm : 5.96e-01 :  20%|█▉        | 98/500 [00:28<01:45,  3.81it/s]
[TorchDR] DR Loss : 4.29e+00 | Grad norm : 5.96e-01 :  20%|█▉        | 99/500 [00:28<01:38,  4.08it/s]
[TorchDR] DR Loss : 4.28e+00 | Grad norm : 5.96e-01 :  20%|█▉        | 99/500 [00:28<01:38,  4.08it/s]
[TorchDR] DR Loss : 4.28e+00 | Grad norm : 5.96e-01 :  20%|██        | 100/500 [00:28<01:44,  3.83it/s]
[TorchDR] DR Loss : 4.27e+00 | Grad norm : 1.13e-01 :  20%|██        | 100/500 [00:29<01:44,  3.83it/s]
[TorchDR] DR Loss : 4.27e+00 | Grad norm : 1.13e-01 :  20%|██        | 101/500 [00:29<01:49,  3.66it/s]
[TorchDR] DR Loss : 4.26e+00 | Grad norm : 1.13e-01 :  20%|██        | 101/500 [00:29<01:49,  3.66it/s]
[TorchDR] DR Loss : 4.26e+00 | Grad norm : 1.13e-01 :  20%|██        | 102/500 [00:29<01:51,  3.55it/s]
[TorchDR] DR Loss : 4.25e+00 | Grad norm : 1.13e-01 :  20%|██        | 102/500 [00:29<01:51,  3.55it/s]
[TorchDR] DR Loss : 4.25e+00 | Grad norm : 1.13e-01 :  21%|██        | 103/500 [00:29<01:53,  3.49it/s]
[TorchDR] DR Loss : 4.24e+00 | Grad norm : 1.13e-01 :  21%|██        | 103/500 [00:30<01:53,  3.49it/s]
[TorchDR] DR Loss : 4.24e+00 | Grad norm : 1.13e-01 :  21%|██        | 104/500 [00:30<02:06,  3.13it/s]
[TorchDR] DR Loss : 4.23e+00 | Grad norm : 1.13e-01 :  21%|██        | 104/500 [00:30<02:06,  3.13it/s]
[TorchDR] DR Loss : 4.23e+00 | Grad norm : 1.13e-01 :  21%|██        | 105/500 [00:30<01:52,  3.51it/s]
[TorchDR] DR Loss : 4.22e+00 | Grad norm : 1.13e-01 :  21%|██        | 105/500 [00:30<01:52,  3.51it/s]
[TorchDR] DR Loss : 4.22e+00 | Grad norm : 1.13e-01 :  21%|██        | 106/500 [00:30<01:53,  3.46it/s]
[TorchDR] DR Loss : 4.21e+00 | Grad norm : 1.13e-01 :  21%|██        | 106/500 [00:31<01:53,  3.46it/s]
[TorchDR] DR Loss : 4.21e+00 | Grad norm : 1.13e-01 :  21%|██▏       | 107/500 [00:31<01:55,  3.41it/s]
[TorchDR] DR Loss : 4.21e+00 | Grad norm : 1.13e-01 :  21%|██▏       | 107/500 [00:31<01:55,  3.41it/s]
[TorchDR] DR Loss : 4.21e+00 | Grad norm : 1.13e-01 :  22%|██▏       | 108/500 [00:31<01:55,  3.39it/s]
[TorchDR] DR Loss : 4.20e+00 | Grad norm : 1.13e-01 :  22%|██▏       | 108/500 [00:31<01:55,  3.39it/s]
[TorchDR] DR Loss : 4.20e+00 | Grad norm : 1.13e-01 :  22%|██▏       | 109/500 [00:31<02:07,  3.07it/s]
[TorchDR] DR Loss : 4.19e+00 | Grad norm : 1.13e-01 :  22%|██▏       | 109/500 [00:32<02:07,  3.07it/s]
[TorchDR] DR Loss : 4.19e+00 | Grad norm : 1.13e-01 :  22%|██▏       | 110/500 [00:32<02:04,  3.13it/s]
[TorchDR] DR Loss : 4.19e+00 | Grad norm : 1.13e-01 :  22%|██▏       | 110/500 [00:32<02:04,  3.13it/s]
[TorchDR] DR Loss : 4.19e+00 | Grad norm : 1.13e-01 :  22%|██▏       | 111/500 [00:32<02:12,  2.93it/s]
[TorchDR] DR Loss : 4.18e+00 | Grad norm : 1.13e-01 :  22%|██▏       | 111/500 [00:32<02:12,  2.93it/s]
[TorchDR] DR Loss : 4.18e+00 | Grad norm : 1.13e-01 :  22%|██▏       | 112/500 [00:32<01:56,  3.32it/s]
[TorchDR] DR Loss : 4.17e+00 | Grad norm : 1.13e-01 :  22%|██▏       | 112/500 [00:32<01:56,  3.32it/s]
[TorchDR] DR Loss : 4.17e+00 | Grad norm : 1.13e-01 :  23%|██▎       | 113/500 [00:32<01:44,  3.69it/s]
[TorchDR] DR Loss : 4.17e+00 | Grad norm : 1.13e-01 :  23%|██▎       | 113/500 [00:33<01:44,  3.69it/s]
[TorchDR] DR Loss : 4.17e+00 | Grad norm : 1.13e-01 :  23%|██▎       | 114/500 [00:33<01:58,  3.26it/s]
[TorchDR] DR Loss : 4.16e+00 | Grad norm : 1.13e-01 :  23%|██▎       | 114/500 [00:33<01:58,  3.26it/s]
[TorchDR] DR Loss : 4.16e+00 | Grad norm : 1.13e-01 :  23%|██▎       | 115/500 [00:33<01:46,  3.60it/s]
[TorchDR] DR Loss : 4.15e+00 | Grad norm : 1.13e-01 :  23%|██▎       | 115/500 [00:33<01:46,  3.60it/s]
[TorchDR] DR Loss : 4.15e+00 | Grad norm : 1.13e-01 :  23%|██▎       | 116/500 [00:33<01:48,  3.54it/s]
[TorchDR] DR Loss : 4.15e+00 | Grad norm : 1.13e-01 :  23%|██▎       | 116/500 [00:33<01:48,  3.54it/s]
[TorchDR] DR Loss : 4.15e+00 | Grad norm : 1.13e-01 :  23%|██▎       | 117/500 [00:33<01:49,  3.49it/s]
[TorchDR] DR Loss : 4.14e+00 | Grad norm : 1.13e-01 :  23%|██▎       | 117/500 [00:34<01:49,  3.49it/s]
[TorchDR] DR Loss : 4.14e+00 | Grad norm : 1.13e-01 :  24%|██▎       | 118/500 [00:34<01:40,  3.82it/s]
[TorchDR] DR Loss : 4.14e+00 | Grad norm : 1.13e-01 :  24%|██▎       | 118/500 [00:34<01:40,  3.82it/s]
[TorchDR] DR Loss : 4.14e+00 | Grad norm : 1.13e-01 :  24%|██▍       | 119/500 [00:34<01:43,  3.67it/s]
[TorchDR] DR Loss : 4.13e+00 | Grad norm : 1.13e-01 :  24%|██▍       | 119/500 [00:34<01:43,  3.67it/s]
[TorchDR] DR Loss : 4.13e+00 | Grad norm : 1.13e-01 :  24%|██▍       | 120/500 [00:34<01:58,  3.21it/s]
[TorchDR] DR Loss : 4.13e+00 | Grad norm : 1.13e-01 :  24%|██▍       | 120/500 [00:35<01:58,  3.21it/s]
[TorchDR] DR Loss : 4.13e+00 | Grad norm : 1.13e-01 :  24%|██▍       | 121/500 [00:35<01:56,  3.25it/s]
[TorchDR] DR Loss : 4.12e+00 | Grad norm : 1.13e-01 :  24%|██▍       | 121/500 [00:35<01:56,  3.25it/s]
[TorchDR] DR Loss : 4.12e+00 | Grad norm : 1.13e-01 :  24%|██▍       | 122/500 [00:35<01:55,  3.27it/s]
[TorchDR] DR Loss : 4.11e+00 | Grad norm : 1.13e-01 :  24%|██▍       | 122/500 [00:35<01:55,  3.27it/s]
[TorchDR] DR Loss : 4.11e+00 | Grad norm : 1.13e-01 :  25%|██▍       | 123/500 [00:35<01:54,  3.29it/s]
[TorchDR] DR Loss : 4.11e+00 | Grad norm : 1.13e-01 :  25%|██▍       | 123/500 [00:36<01:54,  3.29it/s]
[TorchDR] DR Loss : 4.11e+00 | Grad norm : 1.13e-01 :  25%|██▍       | 124/500 [00:36<01:54,  3.29it/s]
[TorchDR] DR Loss : 4.11e+00 | Grad norm : 1.13e-01 :  25%|██▍       | 124/500 [00:36<01:54,  3.29it/s]
[TorchDR] DR Loss : 4.11e+00 | Grad norm : 1.13e-01 :  25%|██▌       | 125/500 [00:36<01:53,  3.31it/s]
[TorchDR] DR Loss : 4.10e+00 | Grad norm : 1.13e-01 :  25%|██▌       | 125/500 [00:36<01:53,  3.31it/s]
[TorchDR] DR Loss : 4.10e+00 | Grad norm : 1.13e-01 :  25%|██▌       | 126/500 [00:36<01:52,  3.33it/s]
[TorchDR] DR Loss : 4.10e+00 | Grad norm : 1.13e-01 :  25%|██▌       | 126/500 [00:36<01:52,  3.33it/s]
[TorchDR] DR Loss : 4.10e+00 | Grad norm : 1.13e-01 :  25%|██▌       | 127/500 [00:36<01:41,  3.68it/s]
[TorchDR] DR Loss : 4.09e+00 | Grad norm : 1.13e-01 :  25%|██▌       | 127/500 [00:37<01:41,  3.68it/s]
[TorchDR] DR Loss : 4.09e+00 | Grad norm : 1.13e-01 :  26%|██▌       | 128/500 [00:37<01:43,  3.59it/s]
[TorchDR] DR Loss : 4.09e+00 | Grad norm : 1.13e-01 :  26%|██▌       | 128/500 [00:37<01:43,  3.59it/s]
[TorchDR] DR Loss : 4.09e+00 | Grad norm : 1.13e-01 :  26%|██▌       | 129/500 [00:37<01:35,  3.90it/s]
[TorchDR] DR Loss : 4.08e+00 | Grad norm : 1.13e-01 :  26%|██▌       | 129/500 [00:37<01:35,  3.90it/s]
[TorchDR] DR Loss : 4.08e+00 | Grad norm : 1.13e-01 :  26%|██▌       | 130/500 [00:37<01:39,  3.73it/s]
[TorchDR] DR Loss : 4.08e+00 | Grad norm : 1.13e-01 :  26%|██▌       | 130/500 [00:37<01:39,  3.73it/s]
[TorchDR] DR Loss : 4.08e+00 | Grad norm : 1.13e-01 :  26%|██▌       | 131/500 [00:37<01:32,  4.00it/s]
[TorchDR] DR Loss : 4.07e+00 | Grad norm : 1.13e-01 :  26%|██▌       | 131/500 [00:38<01:32,  4.00it/s]
[TorchDR] DR Loss : 4.07e+00 | Grad norm : 1.13e-01 :  26%|██▋       | 132/500 [00:38<01:37,  3.79it/s]
[TorchDR] DR Loss : 4.07e+00 | Grad norm : 1.13e-01 :  26%|██▋       | 132/500 [00:38<01:37,  3.79it/s]
[TorchDR] DR Loss : 4.07e+00 | Grad norm : 1.13e-01 :  27%|██▋       | 133/500 [00:38<01:40,  3.66it/s]
[TorchDR] DR Loss : 4.07e+00 | Grad norm : 1.13e-01 :  27%|██▋       | 133/500 [00:38<01:40,  3.66it/s]
[TorchDR] DR Loss : 4.07e+00 | Grad norm : 1.13e-01 :  27%|██▋       | 134/500 [00:38<01:32,  3.96it/s]
[TorchDR] DR Loss : 4.06e+00 | Grad norm : 1.13e-01 :  27%|██▋       | 134/500 [00:39<01:32,  3.96it/s]
[TorchDR] DR Loss : 4.06e+00 | Grad norm : 1.13e-01 :  27%|██▋       | 135/500 [00:39<01:37,  3.74it/s]
[TorchDR] DR Loss : 4.06e+00 | Grad norm : 1.13e-01 :  27%|██▋       | 135/500 [00:39<01:37,  3.74it/s]
[TorchDR] DR Loss : 4.06e+00 | Grad norm : 1.13e-01 :  27%|██▋       | 136/500 [00:39<01:30,  4.04it/s]
[TorchDR] DR Loss : 4.06e+00 | Grad norm : 1.13e-01 :  27%|██▋       | 136/500 [00:39<01:30,  4.04it/s]
[TorchDR] DR Loss : 4.06e+00 | Grad norm : 1.13e-01 :  27%|██▋       | 137/500 [00:39<01:56,  3.11it/s]
[TorchDR] DR Loss : 4.05e+00 | Grad norm : 1.13e-01 :  27%|██▋       | 137/500 [00:39<01:56,  3.11it/s]
[TorchDR] DR Loss : 4.05e+00 | Grad norm : 1.13e-01 :  28%|██▊       | 138/500 [00:39<01:50,  3.27it/s]
[TorchDR] DR Loss : 4.05e+00 | Grad norm : 1.13e-01 :  28%|██▊       | 138/500 [00:40<01:50,  3.27it/s]
[TorchDR] DR Loss : 4.05e+00 | Grad norm : 1.13e-01 :  28%|██▊       | 139/500 [00:40<01:32,  3.90it/s]
[TorchDR] DR Loss : 4.05e+00 | Grad norm : 1.13e-01 :  28%|██▊       | 139/500 [00:40<01:32,  3.90it/s]
[TorchDR] DR Loss : 4.05e+00 | Grad norm : 1.13e-01 :  28%|██▊       | 140/500 [00:40<01:46,  3.38it/s]
[TorchDR] DR Loss : 4.04e+00 | Grad norm : 1.13e-01 :  28%|██▊       | 140/500 [00:40<01:46,  3.38it/s]
[TorchDR] DR Loss : 4.04e+00 | Grad norm : 1.13e-01 :  28%|██▊       | 141/500 [00:40<01:47,  3.35it/s]
[TorchDR] DR Loss : 4.04e+00 | Grad norm : 1.13e-01 :  28%|██▊       | 141/500 [00:41<01:47,  3.35it/s]
[TorchDR] DR Loss : 4.04e+00 | Grad norm : 1.13e-01 :  28%|██▊       | 142/500 [00:41<01:58,  3.03it/s]
[TorchDR] DR Loss : 4.04e+00 | Grad norm : 1.13e-01 :  28%|██▊       | 142/500 [00:41<01:58,  3.03it/s]
[TorchDR] DR Loss : 4.04e+00 | Grad norm : 1.13e-01 :  29%|██▊       | 143/500 [00:41<02:04,  2.87it/s]
[TorchDR] DR Loss : 4.03e+00 | Grad norm : 1.13e-01 :  29%|██▊       | 143/500 [00:41<02:04,  2.87it/s]
[TorchDR] DR Loss : 4.03e+00 | Grad norm : 1.13e-01 :  29%|██▉       | 144/500 [00:41<01:49,  3.26it/s]
[TorchDR] DR Loss : 4.03e+00 | Grad norm : 1.13e-01 :  29%|██▉       | 144/500 [00:42<01:49,  3.26it/s]
[TorchDR] DR Loss : 4.03e+00 | Grad norm : 1.13e-01 :  29%|██▉       | 145/500 [00:42<01:47,  3.30it/s]
[TorchDR] DR Loss : 4.03e+00 | Grad norm : 1.13e-01 :  29%|██▉       | 145/500 [00:42<01:47,  3.30it/s]
[TorchDR] DR Loss : 4.03e+00 | Grad norm : 1.13e-01 :  29%|██▉       | 146/500 [00:42<01:36,  3.66it/s]
[TorchDR] DR Loss : 4.02e+00 | Grad norm : 1.13e-01 :  29%|██▉       | 146/500 [00:42<01:36,  3.66it/s]
[TorchDR] DR Loss : 4.02e+00 | Grad norm : 1.13e-01 :  29%|██▉       | 147/500 [00:42<01:38,  3.57it/s]
[TorchDR] DR Loss : 4.02e+00 | Grad norm : 1.13e-01 :  29%|██▉       | 147/500 [00:42<01:38,  3.57it/s]
[TorchDR] DR Loss : 4.02e+00 | Grad norm : 1.13e-01 :  30%|██▉       | 148/500 [00:42<01:30,  3.90it/s]
[TorchDR] DR Loss : 4.02e+00 | Grad norm : 1.13e-01 :  30%|██▉       | 148/500 [00:43<01:30,  3.90it/s]
[TorchDR] DR Loss : 4.02e+00 | Grad norm : 1.13e-01 :  30%|██▉       | 149/500 [00:43<01:34,  3.71it/s]
[TorchDR] DR Loss : 4.02e+00 | Grad norm : 1.13e-01 :  30%|██▉       | 149/500 [00:43<01:34,  3.71it/s]
[TorchDR] DR Loss : 4.02e+00 | Grad norm : 1.13e-01 :  30%|███       | 150/500 [00:43<01:37,  3.59it/s]
[TorchDR] DR Loss : 4.01e+00 | Grad norm : 6.01e-02 :  30%|███       | 150/500 [00:43<01:37,  3.59it/s]
[TorchDR] DR Loss : 4.01e+00 | Grad norm : 6.01e-02 :  30%|███       | 151/500 [00:43<01:39,  3.51it/s]
[TorchDR] DR Loss : 4.01e+00 | Grad norm : 6.01e-02 :  30%|███       | 151/500 [00:43<01:39,  3.51it/s]
[TorchDR] DR Loss : 4.01e+00 | Grad norm : 6.01e-02 :  30%|███       | 152/500 [00:43<01:40,  3.46it/s]
[TorchDR] DR Loss : 4.01e+00 | Grad norm : 6.01e-02 :  30%|███       | 152/500 [00:44<01:40,  3.46it/s]
[TorchDR] DR Loss : 4.01e+00 | Grad norm : 6.01e-02 :  31%|███       | 153/500 [00:44<01:31,  3.79it/s]
[TorchDR] DR Loss : 4.01e+00 | Grad norm : 6.01e-02 :  31%|███       | 153/500 [00:44<01:31,  3.79it/s]
[TorchDR] DR Loss : 4.01e+00 | Grad norm : 6.01e-02 :  31%|███       | 154/500 [00:44<01:34,  3.66it/s]
[TorchDR] DR Loss : 4.00e+00 | Grad norm : 6.01e-02 :  31%|███       | 154/500 [00:44<01:34,  3.66it/s]
[TorchDR] DR Loss : 4.00e+00 | Grad norm : 6.01e-02 :  31%|███       | 155/500 [00:44<01:26,  3.97it/s]
[TorchDR] DR Loss : 4.00e+00 | Grad norm : 6.01e-02 :  31%|███       | 155/500 [00:45<01:26,  3.97it/s]
[TorchDR] DR Loss : 4.00e+00 | Grad norm : 6.01e-02 :  31%|███       | 156/500 [00:45<01:31,  3.75it/s]
[TorchDR] DR Loss : 4.00e+00 | Grad norm : 6.01e-02 :  31%|███       | 156/500 [00:45<01:31,  3.75it/s]
[TorchDR] DR Loss : 4.00e+00 | Grad norm : 6.01e-02 :  31%|███▏      | 157/500 [00:45<01:34,  3.62it/s]
[TorchDR] DR Loss : 4.00e+00 | Grad norm : 6.01e-02 :  31%|███▏      | 157/500 [00:45<01:34,  3.62it/s]
[TorchDR] DR Loss : 4.00e+00 | Grad norm : 6.01e-02 :  32%|███▏      | 158/500 [00:45<01:27,  3.92it/s]
[TorchDR] DR Loss : 4.00e+00 | Grad norm : 6.01e-02 :  32%|███▏      | 158/500 [00:45<01:27,  3.92it/s]
[TorchDR] DR Loss : 4.00e+00 | Grad norm : 6.01e-02 :  32%|███▏      | 159/500 [00:45<01:30,  3.75it/s]
[TorchDR] DR Loss : 3.99e+00 | Grad norm : 6.01e-02 :  32%|███▏      | 159/500 [00:46<01:30,  3.75it/s]
[TorchDR] DR Loss : 3.99e+00 | Grad norm : 6.01e-02 :  32%|███▏      | 160/500 [00:46<01:44,  3.26it/s]
[TorchDR] DR Loss : 3.99e+00 | Grad norm : 6.01e-02 :  32%|███▏      | 160/500 [00:46<01:44,  3.26it/s]
[TorchDR] DR Loss : 3.99e+00 | Grad norm : 6.01e-02 :  32%|███▏      | 161/500 [00:46<01:53,  2.98it/s]
[TorchDR] DR Loss : 3.99e+00 | Grad norm : 6.01e-02 :  32%|███▏      | 161/500 [00:47<01:53,  2.98it/s]
[TorchDR] DR Loss : 3.99e+00 | Grad norm : 6.01e-02 :  32%|███▏      | 162/500 [00:47<02:09,  2.61it/s]
[TorchDR] DR Loss : 3.99e+00 | Grad norm : 6.01e-02 :  32%|███▏      | 162/500 [00:47<02:09,  2.61it/s]
[TorchDR] DR Loss : 3.99e+00 | Grad norm : 6.01e-02 :  33%|███▎      | 163/500 [00:47<02:00,  2.79it/s]
[TorchDR] DR Loss : 3.98e+00 | Grad norm : 6.01e-02 :  33%|███▎      | 163/500 [00:47<02:00,  2.79it/s]
[TorchDR] DR Loss : 3.98e+00 | Grad norm : 6.01e-02 :  33%|███▎      | 164/500 [00:47<01:44,  3.21it/s]
[TorchDR] DR Loss : 3.98e+00 | Grad norm : 6.01e-02 :  33%|███▎      | 164/500 [00:47<01:44,  3.21it/s]
[TorchDR] DR Loss : 3.98e+00 | Grad norm : 6.01e-02 :  33%|███▎      | 165/500 [00:47<01:43,  3.24it/s]
[TorchDR] DR Loss : 3.98e+00 | Grad norm : 6.01e-02 :  33%|███▎      | 165/500 [00:48<01:43,  3.24it/s]
[TorchDR] DR Loss : 3.98e+00 | Grad norm : 6.01e-02 :  33%|███▎      | 166/500 [00:48<01:42,  3.27it/s]
[TorchDR] DR Loss : 3.98e+00 | Grad norm : 6.01e-02 :  33%|███▎      | 166/500 [00:48<01:42,  3.27it/s]
[TorchDR] DR Loss : 3.98e+00 | Grad norm : 6.01e-02 :  33%|███▎      | 167/500 [00:48<01:40,  3.30it/s]
[TorchDR] DR Loss : 3.98e+00 | Grad norm : 6.01e-02 :  33%|███▎      | 167/500 [00:48<01:40,  3.30it/s]
[TorchDR] DR Loss : 3.98e+00 | Grad norm : 6.01e-02 :  34%|███▎      | 168/500 [00:48<01:30,  3.66it/s]
[TorchDR] DR Loss : 3.98e+00 | Grad norm : 6.01e-02 :  34%|███▎      | 168/500 [00:48<01:30,  3.66it/s]
[TorchDR] DR Loss : 3.98e+00 | Grad norm : 6.01e-02 :  34%|███▍      | 169/500 [00:48<01:23,  3.96it/s]
[TorchDR] DR Loss : 3.97e+00 | Grad norm : 6.01e-02 :  34%|███▍      | 169/500 [00:49<01:23,  3.96it/s]
[TorchDR] DR Loss : 3.97e+00 | Grad norm : 6.01e-02 :  34%|███▍      | 170/500 [00:49<01:27,  3.77it/s]
[TorchDR] DR Loss : 3.97e+00 | Grad norm : 6.01e-02 :  34%|███▍      | 170/500 [00:49<01:27,  3.77it/s]
[TorchDR] DR Loss : 3.97e+00 | Grad norm : 6.01e-02 :  34%|███▍      | 171/500 [00:49<01:30,  3.63it/s]
[TorchDR] DR Loss : 3.97e+00 | Grad norm : 6.01e-02 :  34%|███▍      | 171/500 [00:49<01:30,  3.63it/s]
[TorchDR] DR Loss : 3.97e+00 | Grad norm : 6.01e-02 :  34%|███▍      | 172/500 [00:49<01:23,  3.94it/s]
[TorchDR] DR Loss : 3.97e+00 | Grad norm : 6.01e-02 :  34%|███▍      | 172/500 [00:49<01:23,  3.94it/s]
[TorchDR] DR Loss : 3.97e+00 | Grad norm : 6.01e-02 :  35%|███▍      | 173/500 [00:49<01:26,  3.76it/s]
[TorchDR] DR Loss : 3.97e+00 | Grad norm : 6.01e-02 :  35%|███▍      | 173/500 [00:50<01:26,  3.76it/s]
[TorchDR] DR Loss : 3.97e+00 | Grad norm : 6.01e-02 :  35%|███▍      | 174/500 [00:50<01:20,  4.03it/s]
[TorchDR] DR Loss : 3.97e+00 | Grad norm : 6.01e-02 :  35%|███▍      | 174/500 [00:50<01:20,  4.03it/s]
[TorchDR] DR Loss : 3.97e+00 | Grad norm : 6.01e-02 :  35%|███▌      | 175/500 [00:50<01:25,  3.79it/s]
[TorchDR] DR Loss : 3.96e+00 | Grad norm : 6.01e-02 :  35%|███▌      | 175/500 [00:50<01:25,  3.79it/s]
[TorchDR] DR Loss : 3.96e+00 | Grad norm : 6.01e-02 :  35%|███▌      | 176/500 [00:50<01:28,  3.67it/s]
[TorchDR] DR Loss : 3.96e+00 | Grad norm : 6.01e-02 :  35%|███▌      | 176/500 [00:51<01:28,  3.67it/s]
[TorchDR] DR Loss : 3.96e+00 | Grad norm : 6.01e-02 :  35%|███▌      | 177/500 [00:51<01:30,  3.56it/s]
[TorchDR] DR Loss : 3.96e+00 | Grad norm : 6.01e-02 :  35%|███▌      | 177/500 [00:51<01:30,  3.56it/s]
[TorchDR] DR Loss : 3.96e+00 | Grad norm : 6.01e-02 :  36%|███▌      | 178/500 [00:51<01:32,  3.50it/s]
[TorchDR] DR Loss : 3.96e+00 | Grad norm : 6.01e-02 :  36%|███▌      | 178/500 [00:51<01:32,  3.50it/s]
[TorchDR] DR Loss : 3.96e+00 | Grad norm : 6.01e-02 :  36%|███▌      | 179/500 [00:51<01:23,  3.84it/s]
[TorchDR] DR Loss : 3.96e+00 | Grad norm : 6.01e-02 :  36%|███▌      | 179/500 [00:51<01:23,  3.84it/s]
[TorchDR] DR Loss : 3.96e+00 | Grad norm : 6.01e-02 :  36%|███▌      | 180/500 [00:51<01:17,  4.11it/s]
[TorchDR] DR Loss : 3.96e+00 | Grad norm : 6.01e-02 :  36%|███▌      | 180/500 [00:52<01:17,  4.11it/s]
[TorchDR] DR Loss : 3.96e+00 | Grad norm : 6.01e-02 :  36%|███▌      | 181/500 [00:52<01:23,  3.84it/s]
[TorchDR] DR Loss : 3.96e+00 | Grad norm : 6.01e-02 :  36%|███▌      | 181/500 [00:52<01:23,  3.84it/s]
[TorchDR] DR Loss : 3.96e+00 | Grad norm : 6.01e-02 :  36%|███▋      | 182/500 [00:52<01:26,  3.68it/s]
[TorchDR] DR Loss : 3.95e+00 | Grad norm : 6.01e-02 :  36%|███▋      | 182/500 [00:52<01:26,  3.68it/s]
[TorchDR] DR Loss : 3.95e+00 | Grad norm : 6.01e-02 :  37%|███▋      | 183/500 [00:52<01:28,  3.56it/s]
[TorchDR] DR Loss : 3.95e+00 | Grad norm : 6.01e-02 :  37%|███▋      | 183/500 [00:52<01:28,  3.56it/s]
[TorchDR] DR Loss : 3.95e+00 | Grad norm : 6.01e-02 :  37%|███▋      | 184/500 [00:52<01:30,  3.50it/s]
[TorchDR] DR Loss : 3.95e+00 | Grad norm : 6.01e-02 :  37%|███▋      | 184/500 [00:53<01:30,  3.50it/s]
[TorchDR] DR Loss : 3.95e+00 | Grad norm : 6.01e-02 :  37%|███▋      | 185/500 [00:53<01:31,  3.45it/s]
[TorchDR] DR Loss : 3.95e+00 | Grad norm : 6.01e-02 :  37%|███▋      | 185/500 [00:53<01:31,  3.45it/s]
[TorchDR] DR Loss : 3.95e+00 | Grad norm : 6.01e-02 :  37%|███▋      | 186/500 [00:53<01:32,  3.40it/s]
[TorchDR] DR Loss : 3.95e+00 | Grad norm : 6.01e-02 :  37%|███▋      | 186/500 [00:53<01:32,  3.40it/s]
[TorchDR] DR Loss : 3.95e+00 | Grad norm : 6.01e-02 :  37%|███▋      | 187/500 [00:53<01:32,  3.39it/s]
[TorchDR] DR Loss : 3.95e+00 | Grad norm : 6.01e-02 :  37%|███▋      | 187/500 [00:54<01:32,  3.39it/s]
[TorchDR] DR Loss : 3.95e+00 | Grad norm : 6.01e-02 :  38%|███▊      | 188/500 [00:54<01:29,  3.49it/s]
[TorchDR] DR Loss : 3.95e+00 | Grad norm : 6.01e-02 :  38%|███▊      | 188/500 [00:55<01:29,  3.49it/s]
[TorchDR] DR Loss : 3.95e+00 | Grad norm : 6.01e-02 :  38%|███▊      | 189/500 [00:55<03:35,  1.45it/s]
[TorchDR] DR Loss : 3.95e+00 | Grad norm : 6.01e-02 :  38%|███▊      | 189/500 [00:56<03:35,  1.45it/s]
[TorchDR] DR Loss : 3.95e+00 | Grad norm : 6.01e-02 :  38%|███▊      | 190/500 [00:56<02:57,  1.74it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  38%|███▊      | 190/500 [00:56<02:57,  1.74it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  38%|███▊      | 191/500 [00:56<02:31,  2.03it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  38%|███▊      | 191/500 [00:56<02:31,  2.03it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  38%|███▊      | 192/500 [00:56<02:13,  2.30it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  38%|███▊      | 192/500 [00:56<02:13,  2.30it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  39%|███▊      | 193/500 [00:57<02:00,  2.54it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  39%|███▊      | 193/500 [00:57<02:00,  2.54it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  39%|███▉      | 194/500 [00:57<01:42,  2.97it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  39%|███▉      | 194/500 [00:57<01:42,  2.97it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  39%|███▉      | 195/500 [00:57<01:39,  3.08it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  39%|███▉      | 195/500 [00:57<01:39,  3.08it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  39%|███▉      | 196/500 [00:57<01:36,  3.15it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  39%|███▉      | 196/500 [00:58<01:36,  3.15it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  39%|███▉      | 197/500 [00:58<01:43,  2.94it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  39%|███▉      | 197/500 [00:58<01:43,  2.94it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  40%|███▉      | 198/500 [00:58<01:30,  3.34it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  40%|███▉      | 198/500 [00:58<01:30,  3.34it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  40%|███▉      | 199/500 [00:58<01:30,  3.32it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  40%|███▉      | 199/500 [00:59<01:30,  3.32it/s]
[TorchDR] DR Loss : 3.94e+00 | Grad norm : 6.01e-02 :  40%|████      | 200/500 [00:59<01:30,  3.33it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  40%|████      | 200/500 [00:59<01:30,  3.33it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  40%|████      | 201/500 [00:59<01:38,  3.03it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  40%|████      | 201/500 [00:59<01:38,  3.03it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  40%|████      | 202/500 [00:59<01:35,  3.11it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  40%|████      | 202/500 [01:00<01:35,  3.11it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  41%|████      | 203/500 [01:00<01:42,  2.91it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  41%|████      | 203/500 [01:00<01:42,  2.91it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  41%|████      | 204/500 [01:00<01:37,  3.02it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  41%|████      | 204/500 [01:00<01:37,  3.02it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  41%|████      | 205/500 [01:00<01:34,  3.11it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  41%|████      | 205/500 [01:00<01:34,  3.11it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  41%|████      | 206/500 [01:00<01:32,  3.18it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  41%|████      | 206/500 [01:01<01:32,  3.18it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  41%|████▏     | 207/500 [01:01<01:22,  3.54it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  41%|████▏     | 207/500 [01:01<01:22,  3.54it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  42%|████▏     | 208/500 [01:01<01:23,  3.50it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  42%|████▏     | 208/500 [01:01<01:23,  3.50it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  42%|████▏     | 209/500 [01:01<01:16,  3.82it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  42%|████▏     | 209/500 [01:02<01:16,  3.82it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  42%|████▏     | 210/500 [01:02<01:18,  3.67it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  42%|████▏     | 210/500 [01:02<01:18,  3.67it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  42%|████▏     | 211/500 [01:02<01:21,  3.56it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  42%|████▏     | 211/500 [01:02<01:21,  3.56it/s]
[TorchDR] DR Loss : 3.93e+00 | Grad norm : 3.70e-02 :  42%|████▏     | 212/500 [01:02<01:22,  3.49it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  42%|████▏     | 212/500 [01:02<01:22,  3.49it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  43%|████▎     | 213/500 [01:02<01:23,  3.46it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  43%|████▎     | 213/500 [01:03<01:23,  3.46it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  43%|████▎     | 214/500 [01:03<01:23,  3.42it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  43%|████▎     | 214/500 [01:03<01:23,  3.42it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  43%|████▎     | 215/500 [01:03<01:15,  3.77it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  43%|████▎     | 215/500 [01:03<01:15,  3.77it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  43%|████▎     | 216/500 [01:03<01:10,  4.05it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  43%|████▎     | 216/500 [01:03<01:10,  4.05it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  43%|████▎     | 217/500 [01:03<01:14,  3.82it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  43%|████▎     | 217/500 [01:04<01:14,  3.82it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  44%|████▎     | 218/500 [01:04<01:09,  4.08it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  44%|████▎     | 218/500 [01:04<01:09,  4.08it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  44%|████▍     | 219/500 [01:04<01:21,  3.45it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  44%|████▍     | 219/500 [01:04<01:21,  3.45it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  44%|████▍     | 220/500 [01:04<01:21,  3.42it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  44%|████▍     | 220/500 [01:05<01:21,  3.42it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  44%|████▍     | 221/500 [01:05<01:14,  3.76it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  44%|████▍     | 221/500 [01:05<01:14,  3.76it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  44%|████▍     | 222/500 [01:05<01:16,  3.63it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  44%|████▍     | 222/500 [01:05<01:16,  3.63it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  45%|████▍     | 223/500 [01:05<01:18,  3.54it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  45%|████▍     | 223/500 [01:05<01:18,  3.54it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  45%|████▍     | 224/500 [01:05<01:19,  3.48it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  45%|████▍     | 224/500 [01:06<01:19,  3.48it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  45%|████▌     | 225/500 [01:06<01:12,  3.81it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  45%|████▌     | 225/500 [01:06<01:12,  3.81it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  45%|████▌     | 226/500 [01:06<01:07,  4.09it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  45%|████▌     | 226/500 [01:06<01:07,  4.09it/s]
[TorchDR] DR Loss : 3.92e+00 | Grad norm : 3.70e-02 :  45%|████▌     | 227/500 [01:06<01:19,  3.45it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  45%|████▌     | 227/500 [01:07<01:19,  3.45it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  46%|████▌     | 228/500 [01:07<01:19,  3.41it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  46%|████▌     | 228/500 [01:07<01:19,  3.41it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  46%|████▌     | 229/500 [01:07<01:19,  3.40it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  46%|████▌     | 229/500 [01:07<01:19,  3.40it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  46%|████▌     | 230/500 [01:07<01:12,  3.74it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  46%|████▌     | 230/500 [01:07<01:12,  3.74it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  46%|████▌     | 231/500 [01:07<01:06,  4.03it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  46%|████▌     | 231/500 [01:08<01:06,  4.03it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  46%|████▋     | 232/500 [01:08<01:10,  3.81it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  46%|████▋     | 232/500 [01:08<01:10,  3.81it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  47%|████▋     | 233/500 [01:08<01:13,  3.65it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  47%|████▋     | 233/500 [01:08<01:13,  3.65it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  47%|████▋     | 234/500 [01:08<01:14,  3.55it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  47%|████▋     | 234/500 [01:08<01:14,  3.55it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  47%|████▋     | 235/500 [01:08<01:23,  3.17it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  47%|████▋     | 235/500 [01:09<01:23,  3.17it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  47%|████▋     | 236/500 [01:09<01:14,  3.54it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  47%|████▋     | 236/500 [01:09<01:14,  3.54it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  47%|████▋     | 237/500 [01:09<01:08,  3.86it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  47%|████▋     | 237/500 [01:09<01:08,  3.86it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  48%|████▊     | 238/500 [01:09<01:03,  4.15it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  48%|████▊     | 238/500 [01:09<01:03,  4.15it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  48%|████▊     | 239/500 [01:09<01:07,  3.89it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  48%|████▊     | 239/500 [01:10<01:07,  3.89it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  48%|████▊     | 240/500 [01:10<01:10,  3.70it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  48%|████▊     | 240/500 [01:10<01:10,  3.70it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  48%|████▊     | 241/500 [01:10<01:12,  3.57it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  48%|████▊     | 241/500 [01:10<01:12,  3.57it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  48%|████▊     | 242/500 [01:10<01:13,  3.51it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  48%|████▊     | 242/500 [01:11<01:13,  3.51it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  49%|████▊     | 243/500 [01:11<01:07,  3.83it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  49%|████▊     | 243/500 [01:11<01:07,  3.83it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  49%|████▉     | 244/500 [01:11<01:09,  3.70it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  49%|████▉     | 244/500 [01:11<01:09,  3.70it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  49%|████▉     | 245/500 [01:11<01:03,  3.99it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  49%|████▉     | 245/500 [01:11<01:03,  3.99it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  49%|████▉     | 246/500 [01:11<00:59,  4.24it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  49%|████▉     | 246/500 [01:12<00:59,  4.24it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  49%|████▉     | 247/500 [01:12<01:04,  3.92it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  49%|████▉     | 247/500 [01:12<01:04,  3.92it/s]
[TorchDR] DR Loss : 3.91e+00 | Grad norm : 3.70e-02 :  50%|████▉     | 248/500 [01:12<01:07,  3.74it/s]
[TorchDR] DR Loss : 3.90e+00 | Grad norm : 3.70e-02 :  50%|████▉     | 248/500 [01:12<01:07,  3.74it/s]
[TorchDR] DR Loss : 3.90e+00 | Grad norm : 3.70e-02 :  50%|████▉     | 249/500 [01:12<01:02,  4.02it/s]
[TorchDR] DR Loss : 3.90e+00 | Grad norm : 3.70e-02 :  50%|████▉     | 249/500 [01:12<01:02,  4.02it/s]
[TorchDR] DR Loss : 3.90e+00 | Grad norm : 3.70e-02 :  50%|█████     | 250/500 [01:12<01:05,  3.80it/s]
[TorchDR] DR Loss : 3.90e+00 | Grad norm : 2.32e-02 :  50%|█████     | 250/500 [01:13<01:05,  3.80it/s]
[TorchDR] DR Loss : 3.90e+00 | Grad norm : 2.32e-02 :  50%|█████     | 251/500 [01:13<01:01,  4.07it/s]
[TorchDR] DR Loss : 1.15e+01 | Grad norm : 2.32e-02 :  50%|█████     | 251/500 [01:13<01:01,  4.07it/s]
[TorchDR] DR Loss : 1.15e+01 | Grad norm : 2.32e-02 :  50%|█████     | 252/500 [01:13<01:04,  3.82it/s]
[TorchDR] DR Loss : 1.15e+01 | Grad norm : 2.32e-02 :  50%|█████     | 252/500 [01:13<01:04,  3.82it/s]
[TorchDR] DR Loss : 1.15e+01 | Grad norm : 2.32e-02 :  51%|█████     | 253/500 [01:13<01:07,  3.67it/s]
[TorchDR] DR Loss : 1.15e+01 | Grad norm : 2.32e-02 :  51%|█████     | 253/500 [01:13<01:07,  3.67it/s]
[TorchDR] DR Loss : 1.15e+01 | Grad norm : 2.32e-02 :  51%|█████     | 254/500 [01:13<01:09,  3.55it/s]
[TorchDR] DR Loss : 1.15e+01 | Grad norm : 2.32e-02 :  51%|█████     | 254/500 [01:14<01:09,  3.55it/s]
[TorchDR] DR Loss : 1.15e+01 | Grad norm : 2.32e-02 :  51%|█████     | 255/500 [01:14<01:10,  3.50it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  51%|█████     | 255/500 [01:14<01:10,  3.50it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  51%|█████     | 256/500 [01:14<01:03,  3.83it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  51%|█████     | 256/500 [01:14<01:03,  3.83it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  51%|█████▏    | 257/500 [01:14<01:05,  3.69it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  51%|█████▏    | 257/500 [01:14<01:05,  3.69it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  52%|█████▏    | 258/500 [01:14<01:00,  3.98it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  52%|█████▏    | 258/500 [01:15<01:00,  3.98it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  52%|█████▏    | 259/500 [01:15<01:03,  3.77it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  52%|█████▏    | 259/500 [01:15<01:03,  3.77it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  52%|█████▏    | 260/500 [01:15<01:06,  3.63it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  52%|█████▏    | 260/500 [01:15<01:06,  3.63it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  52%|█████▏    | 261/500 [01:15<01:07,  3.55it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  52%|█████▏    | 261/500 [01:16<01:07,  3.55it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  52%|█████▏    | 262/500 [01:16<01:08,  3.49it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  52%|█████▏    | 262/500 [01:16<01:08,  3.49it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  53%|█████▎    | 263/500 [01:16<01:02,  3.81it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  53%|█████▎    | 263/500 [01:16<01:02,  3.81it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  53%|█████▎    | 264/500 [01:16<01:04,  3.67it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  53%|█████▎    | 264/500 [01:16<01:04,  3.67it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  53%|█████▎    | 265/500 [01:16<00:59,  3.97it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  53%|█████▎    | 265/500 [01:17<00:59,  3.97it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  53%|█████▎    | 266/500 [01:17<01:02,  3.76it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  53%|█████▎    | 266/500 [01:17<01:02,  3.76it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  53%|█████▎    | 267/500 [01:17<01:02,  3.76it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  53%|█████▎    | 267/500 [01:17<01:02,  3.76it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  54%|█████▎    | 268/500 [01:17<00:59,  3.92it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  54%|█████▎    | 268/500 [01:17<00:59,  3.92it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  54%|█████▍    | 269/500 [01:17<00:55,  4.16it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  54%|█████▍    | 269/500 [01:18<00:55,  4.16it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  54%|█████▍    | 270/500 [01:18<00:59,  3.88it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  54%|█████▍    | 270/500 [01:18<00:59,  3.88it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  54%|█████▍    | 271/500 [01:18<01:01,  3.70it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  54%|█████▍    | 271/500 [01:18<01:01,  3.70it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  54%|█████▍    | 272/500 [01:18<01:03,  3.59it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  54%|█████▍    | 272/500 [01:18<01:03,  3.59it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  55%|█████▍    | 273/500 [01:18<01:04,  3.52it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  55%|█████▍    | 273/500 [01:19<01:04,  3.52it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  55%|█████▍    | 274/500 [01:19<00:59,  3.82it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  55%|█████▍    | 274/500 [01:19<00:59,  3.82it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  55%|█████▌    | 275/500 [01:19<01:01,  3.67it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  55%|█████▌    | 275/500 [01:19<01:01,  3.67it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  55%|█████▌    | 276/500 [01:19<01:03,  3.55it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  55%|█████▌    | 276/500 [01:20<01:03,  3.55it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  55%|█████▌    | 277/500 [01:20<01:03,  3.50it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  55%|█████▌    | 277/500 [01:20<01:03,  3.50it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  56%|█████▌    | 278/500 [01:20<00:57,  3.83it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  56%|█████▌    | 278/500 [01:20<00:57,  3.83it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  56%|█████▌    | 279/500 [01:20<01:06,  3.33it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  56%|█████▌    | 279/500 [01:21<01:06,  3.33it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  56%|█████▌    | 280/500 [01:21<01:12,  3.03it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  56%|█████▌    | 280/500 [01:21<01:12,  3.03it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  56%|█████▌    | 281/500 [01:21<01:10,  3.11it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  56%|█████▌    | 281/500 [01:21<01:10,  3.11it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  56%|█████▋    | 282/500 [01:21<01:08,  3.17it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  56%|█████▋    | 282/500 [01:21<01:08,  3.17it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  57%|█████▋    | 283/500 [01:21<01:07,  3.22it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  57%|█████▋    | 283/500 [01:22<01:07,  3.22it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  57%|█████▋    | 284/500 [01:22<01:06,  3.26it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  57%|█████▋    | 284/500 [01:22<01:06,  3.26it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  57%|█████▋    | 285/500 [01:22<01:03,  3.38it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  57%|█████▋    | 285/500 [01:22<01:03,  3.38it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  57%|█████▋    | 286/500 [01:22<00:59,  3.60it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  57%|█████▋    | 286/500 [01:23<00:59,  3.60it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  57%|█████▋    | 287/500 [01:23<00:54,  3.91it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  57%|█████▋    | 287/500 [01:23<00:54,  3.91it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  58%|█████▊    | 288/500 [01:23<00:56,  3.75it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  58%|█████▊    | 288/500 [01:23<00:56,  3.75it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  58%|█████▊    | 289/500 [01:23<00:52,  4.03it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  58%|█████▊    | 289/500 [01:23<00:52,  4.03it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  58%|█████▊    | 290/500 [01:23<00:55,  3.79it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  58%|█████▊    | 290/500 [01:24<00:55,  3.79it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  58%|█████▊    | 291/500 [01:24<00:57,  3.64it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  58%|█████▊    | 291/500 [01:24<00:57,  3.64it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  58%|█████▊    | 292/500 [01:24<01:04,  3.23it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  58%|█████▊    | 292/500 [01:24<01:04,  3.23it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  59%|█████▊    | 293/500 [01:24<01:03,  3.25it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  59%|█████▊    | 293/500 [01:25<01:03,  3.25it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  59%|█████▉    | 294/500 [01:25<01:02,  3.28it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  59%|█████▉    | 294/500 [01:25<01:02,  3.28it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  59%|█████▉    | 295/500 [01:25<01:02,  3.28it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  59%|█████▉    | 295/500 [01:25<01:02,  3.28it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  59%|█████▉    | 296/500 [01:25<01:07,  3.00it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  59%|█████▉    | 296/500 [01:26<01:07,  3.00it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  59%|█████▉    | 297/500 [01:26<01:11,  2.84it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  59%|█████▉    | 297/500 [01:26<01:11,  2.84it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  60%|█████▉    | 298/500 [01:26<01:14,  2.71it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  60%|█████▉    | 298/500 [01:27<01:14,  2.71it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  60%|█████▉    | 299/500 [01:27<01:15,  2.65it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  60%|█████▉    | 299/500 [01:27<01:15,  2.65it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.32e-02 :  60%|██████    | 300/500 [01:27<01:14,  2.67it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  60%|██████    | 300/500 [01:27<01:14,  2.67it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  60%|██████    | 301/500 [01:27<01:05,  3.02it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  60%|██████    | 301/500 [01:27<01:05,  3.02it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  60%|██████    | 302/500 [01:27<01:09,  2.84it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  60%|██████    | 302/500 [01:28<01:09,  2.84it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  61%|██████    | 303/500 [01:28<01:06,  2.97it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  61%|██████    | 303/500 [01:28<01:06,  2.97it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  61%|██████    | 304/500 [01:28<01:03,  3.07it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  61%|██████    | 304/500 [01:28<01:03,  3.07it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  61%|██████    | 305/500 [01:28<01:02,  3.14it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  61%|██████    | 305/500 [01:29<01:02,  3.14it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  61%|██████    | 306/500 [01:29<01:00,  3.20it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  61%|██████    | 306/500 [01:29<01:00,  3.20it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  61%|██████▏   | 307/500 [01:29<00:59,  3.23it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  61%|██████▏   | 307/500 [01:29<00:59,  3.23it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  62%|██████▏   | 308/500 [01:29<01:04,  2.98it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  62%|██████▏   | 308/500 [01:30<01:04,  2.98it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  62%|██████▏   | 309/500 [01:30<01:01,  3.09it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  62%|██████▏   | 309/500 [01:30<01:01,  3.09it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  62%|██████▏   | 310/500 [01:30<01:00,  3.16it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  62%|██████▏   | 310/500 [01:30<01:00,  3.16it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  62%|██████▏   | 311/500 [01:30<00:58,  3.21it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  62%|██████▏   | 311/500 [01:31<00:58,  3.21it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  62%|██████▏   | 312/500 [01:31<00:52,  3.56it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  62%|██████▏   | 312/500 [01:31<00:52,  3.56it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  63%|██████▎   | 313/500 [01:31<00:58,  3.19it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  63%|██████▎   | 313/500 [01:31<00:58,  3.19it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  63%|██████▎   | 314/500 [01:31<00:52,  3.55it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  63%|██████▎   | 314/500 [01:31<00:52,  3.55it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  63%|██████▎   | 315/500 [01:31<00:52,  3.49it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  63%|██████▎   | 315/500 [01:32<00:52,  3.49it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  63%|██████▎   | 316/500 [01:32<00:53,  3.44it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  63%|██████▎   | 316/500 [01:32<00:53,  3.44it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  63%|██████▎   | 317/500 [01:32<00:53,  3.41it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  63%|██████▎   | 317/500 [01:32<00:53,  3.41it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  64%|██████▎   | 318/500 [01:32<00:53,  3.39it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  64%|██████▎   | 318/500 [01:33<00:53,  3.39it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  64%|██████▍   | 319/500 [01:33<00:53,  3.38it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  64%|██████▍   | 319/500 [01:33<00:53,  3.38it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  64%|██████▍   | 320/500 [01:33<00:48,  3.71it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  64%|██████▍   | 320/500 [01:33<00:48,  3.71it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  64%|██████▍   | 321/500 [01:33<00:49,  3.61it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  64%|██████▍   | 321/500 [01:33<00:49,  3.61it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  64%|██████▍   | 322/500 [01:33<00:50,  3.53it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  64%|██████▍   | 322/500 [01:34<00:50,  3.53it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  65%|██████▍   | 323/500 [01:34<00:51,  3.47it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  65%|██████▍   | 323/500 [01:34<00:51,  3.47it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  65%|██████▍   | 324/500 [01:34<00:51,  3.43it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  65%|██████▍   | 324/500 [01:34<00:51,  3.43it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  65%|██████▌   | 325/500 [01:34<00:46,  3.76it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  65%|██████▌   | 325/500 [01:35<00:46,  3.76it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  65%|██████▌   | 326/500 [01:35<00:47,  3.64it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  65%|██████▌   | 326/500 [01:35<00:47,  3.64it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  65%|██████▌   | 327/500 [01:35<00:48,  3.55it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  65%|██████▌   | 327/500 [01:35<00:48,  3.55it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  66%|██████▌   | 328/500 [01:35<00:44,  3.87it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  66%|██████▌   | 328/500 [01:35<00:44,  3.87it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  66%|██████▌   | 329/500 [01:35<00:51,  3.35it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  66%|██████▌   | 329/500 [01:36<00:51,  3.35it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  66%|██████▌   | 330/500 [01:36<00:50,  3.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  66%|██████▌   | 330/500 [01:36<00:50,  3.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  66%|██████▌   | 331/500 [01:36<00:50,  3.33it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  66%|██████▌   | 331/500 [01:36<00:50,  3.33it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  66%|██████▋   | 332/500 [01:36<00:50,  3.33it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  66%|██████▋   | 332/500 [01:37<00:50,  3.33it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  67%|██████▋   | 333/500 [01:37<00:50,  3.32it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  67%|██████▋   | 333/500 [01:37<00:50,  3.32it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  67%|██████▋   | 334/500 [01:37<00:49,  3.33it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  67%|██████▋   | 334/500 [01:37<00:49,  3.33it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  67%|██████▋   | 335/500 [01:37<00:49,  3.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  67%|██████▋   | 335/500 [01:38<00:49,  3.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  67%|██████▋   | 336/500 [01:38<00:49,  3.32it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  67%|██████▋   | 336/500 [01:38<00:49,  3.32it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  67%|██████▋   | 337/500 [01:38<00:48,  3.33it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  67%|██████▋   | 337/500 [01:38<00:48,  3.33it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  68%|██████▊   | 338/500 [01:38<00:48,  3.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  68%|██████▊   | 338/500 [01:38<00:48,  3.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  68%|██████▊   | 339/500 [01:38<00:48,  3.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  68%|██████▊   | 339/500 [01:39<00:48,  3.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  68%|██████▊   | 340/500 [01:39<00:47,  3.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  68%|██████▊   | 340/500 [01:39<00:47,  3.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  68%|██████▊   | 341/500 [01:39<00:47,  3.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  68%|██████▊   | 341/500 [01:39<00:47,  3.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  68%|██████▊   | 342/500 [01:39<00:47,  3.33it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  68%|██████▊   | 342/500 [01:40<00:47,  3.33it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  69%|██████▊   | 343/500 [01:40<00:42,  3.69it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  69%|██████▊   | 343/500 [01:40<00:42,  3.69it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  69%|██████▉   | 344/500 [01:40<00:43,  3.58it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  69%|██████▉   | 344/500 [01:40<00:43,  3.58it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  69%|██████▉   | 345/500 [01:40<00:44,  3.52it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  69%|██████▉   | 345/500 [01:40<00:44,  3.52it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  69%|██████▉   | 346/500 [01:40<00:44,  3.47it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  69%|██████▉   | 346/500 [01:41<00:44,  3.47it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  69%|██████▉   | 347/500 [01:41<00:49,  3.10it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  69%|██████▉   | 347/500 [01:41<00:49,  3.10it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  70%|██████▉   | 348/500 [01:41<00:47,  3.18it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  70%|██████▉   | 348/500 [01:41<00:47,  3.18it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  70%|██████▉   | 349/500 [01:41<00:42,  3.53it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  70%|██████▉   | 349/500 [01:42<00:42,  3.53it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.47e-03 :  70%|███████   | 350/500 [01:42<00:47,  3.16it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  70%|███████   | 350/500 [01:42<00:47,  3.16it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  70%|███████   | 351/500 [01:42<00:50,  2.92it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  70%|███████   | 351/500 [01:42<00:50,  2.92it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  70%|███████   | 352/500 [01:42<00:48,  3.04it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  70%|███████   | 352/500 [01:43<00:48,  3.04it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  71%|███████   | 353/500 [01:43<00:47,  3.12it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  71%|███████   | 353/500 [01:43<00:47,  3.12it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  71%|███████   | 354/500 [01:43<00:46,  3.17it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  71%|███████   | 354/500 [01:43<00:46,  3.17it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  71%|███████   | 355/500 [01:43<00:49,  2.95it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  71%|███████   | 355/500 [01:44<00:49,  2.95it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  71%|███████   | 356/500 [01:44<00:47,  3.05it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  71%|███████   | 356/500 [01:44<00:47,  3.05it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  71%|███████▏  | 357/500 [01:44<00:45,  3.13it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  71%|███████▏  | 357/500 [01:44<00:45,  3.13it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  72%|███████▏  | 358/500 [01:44<00:44,  3.17it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  72%|███████▏  | 358/500 [01:45<00:44,  3.17it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  72%|███████▏  | 359/500 [01:45<00:47,  2.95it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  72%|███████▏  | 359/500 [01:45<00:47,  2.95it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  72%|███████▏  | 360/500 [01:45<00:45,  3.06it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  72%|███████▏  | 360/500 [01:45<00:45,  3.06it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  72%|███████▏  | 361/500 [01:45<00:44,  3.12it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  72%|███████▏  | 361/500 [01:46<00:44,  3.12it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  72%|███████▏  | 362/500 [01:46<00:47,  2.92it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  72%|███████▏  | 362/500 [01:46<00:47,  2.92it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  73%|███████▎  | 363/500 [01:46<00:45,  3.03it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  73%|███████▎  | 363/500 [01:46<00:45,  3.03it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  73%|███████▎  | 364/500 [01:46<00:47,  2.85it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  73%|███████▎  | 364/500 [01:47<00:47,  2.85it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  73%|███████▎  | 365/500 [01:47<00:45,  2.97it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  73%|███████▎  | 365/500 [01:47<00:45,  2.97it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  73%|███████▎  | 366/500 [01:47<00:43,  3.06it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  73%|███████▎  | 366/500 [01:47<00:43,  3.06it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  73%|███████▎  | 367/500 [01:47<00:42,  3.13it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  73%|███████▎  | 367/500 [01:48<00:42,  3.13it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  74%|███████▎  | 368/500 [01:48<00:41,  3.19it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  74%|███████▎  | 368/500 [01:48<00:41,  3.19it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  74%|███████▍  | 369/500 [01:48<00:44,  2.96it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  74%|███████▍  | 369/500 [01:48<00:44,  2.96it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  74%|███████▍  | 370/500 [01:48<00:46,  2.79it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  74%|███████▍  | 370/500 [01:49<00:46,  2.79it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  74%|███████▍  | 371/500 [01:49<00:47,  2.70it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  74%|███████▍  | 371/500 [01:49<00:47,  2.70it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  74%|███████▍  | 372/500 [01:49<00:48,  2.64it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  74%|███████▍  | 372/500 [01:50<00:48,  2.64it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  75%|███████▍  | 373/500 [01:50<00:45,  2.81it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  75%|███████▍  | 373/500 [01:50<00:45,  2.81it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  75%|███████▍  | 374/500 [01:50<00:50,  2.52it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  75%|███████▍  | 374/500 [01:50<00:50,  2.52it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  75%|███████▌  | 375/500 [01:50<00:53,  2.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  75%|███████▌  | 375/500 [01:51<00:53,  2.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  75%|███████▌  | 376/500 [01:51<00:55,  2.23it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  75%|███████▌  | 376/500 [01:52<00:55,  2.23it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  75%|███████▌  | 377/500 [01:52<01:26,  1.42it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  75%|███████▌  | 377/500 [01:53<01:26,  1.42it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  76%|███████▌  | 378/500 [01:53<01:15,  1.62it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  76%|███████▌  | 378/500 [01:53<01:15,  1.62it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  76%|███████▌  | 379/500 [01:53<01:09,  1.73it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  76%|███████▌  | 379/500 [01:54<01:09,  1.73it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  76%|███████▌  | 380/500 [01:54<00:59,  2.01it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  76%|███████▌  | 380/500 [01:54<00:59,  2.01it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  76%|███████▌  | 381/500 [01:54<00:52,  2.29it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  76%|███████▌  | 381/500 [01:54<00:52,  2.29it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  76%|███████▋  | 382/500 [01:54<00:46,  2.52it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  76%|███████▋  | 382/500 [01:54<00:46,  2.52it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  77%|███████▋  | 383/500 [01:54<00:42,  2.73it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  77%|███████▋  | 383/500 [01:55<00:42,  2.73it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  77%|███████▋  | 384/500 [01:55<00:40,  2.88it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  77%|███████▋  | 384/500 [01:55<00:40,  2.88it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  77%|███████▋  | 385/500 [01:55<00:38,  3.00it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  77%|███████▋  | 385/500 [01:55<00:38,  3.00it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  77%|███████▋  | 386/500 [01:55<00:43,  2.62it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  77%|███████▋  | 386/500 [01:56<00:43,  2.62it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  77%|███████▋  | 387/500 [01:56<00:37,  3.04it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  77%|███████▋  | 387/500 [01:56<00:37,  3.04it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  78%|███████▊  | 388/500 [01:56<00:38,  2.88it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  78%|███████▊  | 388/500 [01:56<00:38,  2.88it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  78%|███████▊  | 389/500 [01:56<00:37,  2.98it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  78%|███████▊  | 389/500 [01:57<00:37,  2.98it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  78%|███████▊  | 390/500 [01:57<00:35,  3.07it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  78%|███████▊  | 390/500 [01:58<00:35,  3.07it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  78%|███████▊  | 391/500 [01:58<00:57,  1.90it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  78%|███████▊  | 391/500 [01:58<00:57,  1.90it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  78%|███████▊  | 392/500 [01:58<00:55,  1.93it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  78%|███████▊  | 392/500 [01:59<00:55,  1.93it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  79%|███████▊  | 393/500 [01:59<00:54,  1.95it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  79%|███████▊  | 393/500 [01:59<00:54,  1.95it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  79%|███████▉  | 394/500 [01:59<00:57,  1.86it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  79%|███████▉  | 394/500 [02:00<00:57,  1.86it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  79%|███████▉  | 395/500 [02:00<00:49,  2.14it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  79%|███████▉  | 395/500 [02:00<00:49,  2.14it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  79%|███████▉  | 396/500 [02:00<00:43,  2.39it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  79%|███████▉  | 396/500 [02:00<00:43,  2.39it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  79%|███████▉  | 397/500 [02:00<00:39,  2.61it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  79%|███████▉  | 397/500 [02:01<00:39,  2.61it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  80%|███████▉  | 398/500 [02:01<00:36,  2.79it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  80%|███████▉  | 398/500 [02:01<00:36,  2.79it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  80%|███████▉  | 399/500 [02:01<00:34,  2.94it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  80%|███████▉  | 399/500 [02:01<00:34,  2.94it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.52e-04 :  80%|████████  | 400/500 [02:01<00:35,  2.81it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  80%|████████  | 400/500 [02:02<00:35,  2.81it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  80%|████████  | 401/500 [02:02<00:33,  2.92it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  80%|████████  | 401/500 [02:02<00:33,  2.92it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  80%|████████  | 402/500 [02:02<00:35,  2.78it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  80%|████████  | 402/500 [02:02<00:35,  2.78it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  81%|████████  | 403/500 [02:02<00:38,  2.49it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  81%|████████  | 403/500 [02:03<00:38,  2.49it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  81%|████████  | 404/500 [02:03<00:46,  2.05it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  81%|████████  | 404/500 [02:04<00:46,  2.05it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  81%|████████  | 405/500 [02:04<00:44,  2.16it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  81%|████████  | 405/500 [02:04<00:44,  2.16it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  81%|████████  | 406/500 [02:04<00:41,  2.26it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  81%|████████  | 406/500 [02:04<00:41,  2.26it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  81%|████████▏ | 407/500 [02:04<00:37,  2.49it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  81%|████████▏ | 407/500 [02:05<00:37,  2.49it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  82%|████████▏ | 408/500 [02:05<00:34,  2.69it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  82%|████████▏ | 408/500 [02:05<00:34,  2.69it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  82%|████████▏ | 409/500 [02:05<00:31,  2.87it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  82%|████████▏ | 409/500 [02:05<00:31,  2.87it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  82%|████████▏ | 410/500 [02:05<00:27,  3.28it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  82%|████████▏ | 410/500 [02:05<00:27,  3.28it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  82%|████████▏ | 411/500 [02:05<00:27,  3.30it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  82%|████████▏ | 411/500 [02:06<00:27,  3.30it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  82%|████████▏ | 412/500 [02:06<00:26,  3.30it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  82%|████████▏ | 412/500 [02:06<00:26,  3.30it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  83%|████████▎ | 413/500 [02:06<00:26,  3.31it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  83%|████████▎ | 413/500 [02:06<00:26,  3.31it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  83%|████████▎ | 414/500 [02:06<00:25,  3.32it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  83%|████████▎ | 414/500 [02:07<00:25,  3.32it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  83%|████████▎ | 415/500 [02:07<00:25,  3.33it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  83%|████████▎ | 415/500 [02:07<00:25,  3.33it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  83%|████████▎ | 416/500 [02:07<00:25,  3.32it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  83%|████████▎ | 416/500 [02:07<00:25,  3.32it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  83%|████████▎ | 417/500 [02:07<00:24,  3.35it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  83%|████████▎ | 417/500 [02:07<00:24,  3.35it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  84%|████████▎ | 418/500 [02:07<00:22,  3.70it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  84%|████████▎ | 418/500 [02:08<00:22,  3.70it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  84%|████████▍ | 419/500 [02:08<00:22,  3.60it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  84%|████████▍ | 419/500 [02:08<00:22,  3.60it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  84%|████████▍ | 420/500 [02:08<00:20,  3.91it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  84%|████████▍ | 420/500 [02:08<00:20,  3.91it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  84%|████████▍ | 421/500 [02:08<00:18,  4.17it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  84%|████████▍ | 421/500 [02:08<00:18,  4.17it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  84%|████████▍ | 422/500 [02:08<00:20,  3.90it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  84%|████████▍ | 422/500 [02:09<00:20,  3.90it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  85%|████████▍ | 423/500 [02:09<00:18,  4.15it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  85%|████████▍ | 423/500 [02:09<00:18,  4.15it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  85%|████████▍ | 424/500 [02:09<00:19,  3.89it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  85%|████████▍ | 424/500 [02:09<00:19,  3.89it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  85%|████████▌ | 425/500 [02:09<00:20,  3.70it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  85%|████████▌ | 425/500 [02:09<00:20,  3.70it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  85%|████████▌ | 426/500 [02:09<00:20,  3.60it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  85%|████████▌ | 426/500 [02:10<00:20,  3.60it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  85%|████████▌ | 427/500 [02:10<00:18,  3.90it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  85%|████████▌ | 427/500 [02:10<00:18,  3.90it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  86%|████████▌ | 428/500 [02:10<00:19,  3.72it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  86%|████████▌ | 428/500 [02:10<00:19,  3.72it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  86%|████████▌ | 429/500 [02:10<00:19,  3.59it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  86%|████████▌ | 429/500 [02:10<00:19,  3.59it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  86%|████████▌ | 430/500 [02:10<00:19,  3.51it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  86%|████████▌ | 430/500 [02:11<00:19,  3.51it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  86%|████████▌ | 431/500 [02:11<00:19,  3.46it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  86%|████████▌ | 431/500 [02:11<00:19,  3.46it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  86%|████████▋ | 432/500 [02:11<00:19,  3.42it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  86%|████████▋ | 432/500 [02:11<00:19,  3.42it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  87%|████████▋ | 433/500 [02:11<00:19,  3.40it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  87%|████████▋ | 433/500 [02:12<00:19,  3.40it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  87%|████████▋ | 434/500 [02:12<00:17,  3.74it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  87%|████████▋ | 434/500 [02:12<00:17,  3.74it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  87%|████████▋ | 435/500 [02:12<00:19,  3.26it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  87%|████████▋ | 435/500 [02:12<00:19,  3.26it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  87%|████████▋ | 436/500 [02:12<00:19,  3.29it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  87%|████████▋ | 436/500 [02:13<00:19,  3.29it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  87%|████████▋ | 437/500 [02:13<00:19,  3.30it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  87%|████████▋ | 437/500 [02:13<00:19,  3.30it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  88%|████████▊ | 438/500 [02:13<00:16,  3.66it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  88%|████████▊ | 438/500 [02:13<00:16,  3.66it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  88%|████████▊ | 439/500 [02:13<00:17,  3.55it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  88%|████████▊ | 439/500 [02:13<00:17,  3.55it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  88%|████████▊ | 440/500 [02:13<00:17,  3.50it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  88%|████████▊ | 440/500 [02:14<00:17,  3.50it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  88%|████████▊ | 441/500 [02:14<00:17,  3.45it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  88%|████████▊ | 441/500 [02:14<00:17,  3.45it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  88%|████████▊ | 442/500 [02:14<00:17,  3.41it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  88%|████████▊ | 442/500 [02:14<00:17,  3.41it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  89%|████████▊ | 443/500 [02:14<00:18,  3.08it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  89%|████████▊ | 443/500 [02:15<00:18,  3.08it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  89%|████████▉ | 444/500 [02:15<00:17,  3.14it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  89%|████████▉ | 444/500 [02:15<00:17,  3.14it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  89%|████████▉ | 445/500 [02:15<00:17,  3.21it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  89%|████████▉ | 445/500 [02:15<00:17,  3.21it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  89%|████████▉ | 446/500 [02:15<00:16,  3.25it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  89%|████████▉ | 446/500 [02:16<00:16,  3.25it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  89%|████████▉ | 447/500 [02:16<00:16,  3.28it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  89%|████████▉ | 447/500 [02:16<00:16,  3.28it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  90%|████████▉ | 448/500 [02:16<00:14,  3.63it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  90%|████████▉ | 448/500 [02:16<00:14,  3.63it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  90%|████████▉ | 449/500 [02:16<00:15,  3.21it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  90%|████████▉ | 449/500 [02:17<00:15,  3.21it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 9.96e-05 :  90%|█████████ | 450/500 [02:17<00:15,  3.23it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  90%|█████████ | 450/500 [02:17<00:15,  3.23it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  90%|█████████ | 451/500 [02:17<00:15,  3.26it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  90%|█████████ | 451/500 [02:17<00:15,  3.26it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  90%|█████████ | 452/500 [02:17<00:16,  2.99it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  90%|█████████ | 452/500 [02:18<00:16,  2.99it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  91%|█████████ | 453/500 [02:18<00:15,  3.07it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  91%|█████████ | 453/500 [02:18<00:15,  3.07it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  91%|█████████ | 454/500 [02:18<00:14,  3.16it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  91%|█████████ | 454/500 [02:18<00:14,  3.16it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  91%|█████████ | 455/500 [02:18<00:15,  2.94it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  91%|█████████ | 455/500 [02:18<00:15,  2.94it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  91%|█████████ | 456/500 [02:18<00:13,  3.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  91%|█████████ | 456/500 [02:19<00:13,  3.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  91%|█████████▏| 457/500 [02:19<00:12,  3.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  91%|█████████▏| 457/500 [02:19<00:12,  3.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  92%|█████████▏| 458/500 [02:19<00:11,  3.70it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  92%|█████████▏| 458/500 [02:19<00:11,  3.70it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  92%|█████████▏| 459/500 [02:19<00:12,  3.24it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  92%|█████████▏| 459/500 [02:20<00:12,  3.24it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  92%|█████████▏| 460/500 [02:20<00:12,  3.28it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  92%|█████████▏| 460/500 [02:20<00:12,  3.28it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  92%|█████████▏| 461/500 [02:20<00:10,  3.64it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  92%|█████████▏| 461/500 [02:20<00:10,  3.64it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  92%|█████████▏| 462/500 [02:20<00:09,  3.96it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  92%|█████████▏| 462/500 [02:20<00:09,  3.96it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  93%|█████████▎| 463/500 [02:20<00:09,  3.74it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  93%|█████████▎| 463/500 [02:21<00:09,  3.74it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  93%|█████████▎| 464/500 [02:21<00:09,  3.62it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  93%|█████████▎| 464/500 [02:21<00:09,  3.62it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  93%|█████████▎| 465/500 [02:21<00:09,  3.53it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  93%|█████████▎| 465/500 [02:21<00:09,  3.53it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  93%|█████████▎| 466/500 [02:21<00:09,  3.47it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  93%|█████████▎| 466/500 [02:21<00:09,  3.47it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  93%|█████████▎| 467/500 [02:21<00:08,  3.80it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  93%|█████████▎| 467/500 [02:22<00:08,  3.80it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  94%|█████████▎| 468/500 [02:22<00:08,  3.67it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  94%|█████████▎| 468/500 [02:22<00:08,  3.67it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  94%|█████████▍| 469/500 [02:22<00:07,  3.97it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  94%|█████████▍| 469/500 [02:22<00:07,  3.97it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  94%|█████████▍| 470/500 [02:22<00:07,  3.77it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  94%|█████████▍| 470/500 [02:22<00:07,  3.77it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  94%|█████████▍| 471/500 [02:22<00:07,  4.04it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  94%|█████████▍| 471/500 [02:23<00:07,  4.04it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  94%|█████████▍| 472/500 [02:23<00:08,  3.43it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  94%|█████████▍| 472/500 [02:23<00:08,  3.43it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  95%|█████████▍| 473/500 [02:23<00:08,  3.09it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  95%|█████████▍| 473/500 [02:23<00:08,  3.09it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  95%|█████████▍| 474/500 [02:23<00:07,  3.46it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  95%|█████████▍| 474/500 [02:24<00:07,  3.46it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  95%|█████████▌| 475/500 [02:24<00:07,  3.13it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  95%|█████████▌| 475/500 [02:24<00:07,  3.13it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  95%|█████████▌| 476/500 [02:24<00:07,  3.19it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  95%|█████████▌| 476/500 [02:24<00:07,  3.19it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  95%|█████████▌| 477/500 [02:24<00:07,  3.23it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  95%|█████████▌| 477/500 [02:25<00:07,  3.23it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  96%|█████████▌| 478/500 [02:25<00:06,  3.60it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  96%|█████████▌| 478/500 [02:25<00:06,  3.60it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  96%|█████████▌| 479/500 [02:25<00:05,  3.53it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  96%|█████████▌| 479/500 [02:25<00:05,  3.53it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  96%|█████████▌| 480/500 [02:25<00:05,  3.86it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  96%|█████████▌| 480/500 [02:25<00:05,  3.86it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  96%|█████████▌| 481/500 [02:25<00:05,  3.70it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  96%|█████████▌| 481/500 [02:26<00:05,  3.70it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  96%|█████████▋| 482/500 [02:26<00:03,  4.51it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  96%|█████████▋| 482/500 [02:26<00:03,  4.51it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  97%|█████████▋| 483/500 [02:26<00:04,  4.10it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  97%|█████████▋| 483/500 [02:26<00:04,  4.10it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  97%|█████████▋| 484/500 [02:26<00:03,  4.32it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  97%|█████████▋| 484/500 [02:26<00:03,  4.32it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  97%|█████████▋| 485/500 [02:26<00:03,  3.98it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  97%|█████████▋| 485/500 [02:27<00:03,  3.98it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  97%|█████████▋| 486/500 [02:27<00:03,  3.77it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  97%|█████████▋| 486/500 [02:27<00:03,  3.77it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  97%|█████████▋| 487/500 [02:27<00:03,  3.63it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  97%|█████████▋| 487/500 [02:27<00:03,  3.63it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  98%|█████████▊| 488/500 [02:27<00:03,  3.93it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  98%|█████████▊| 488/500 [02:27<00:03,  3.93it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  98%|█████████▊| 489/500 [02:27<00:02,  3.74it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  98%|█████████▊| 489/500 [02:28<00:02,  3.74it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  98%|█████████▊| 490/500 [02:28<00:02,  4.03it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  98%|█████████▊| 490/500 [02:28<00:02,  4.03it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  98%|█████████▊| 491/500 [02:28<00:02,  3.80it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  98%|█████████▊| 491/500 [02:28<00:02,  3.80it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  98%|█████████▊| 492/500 [02:28<00:02,  3.65it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  98%|█████████▊| 492/500 [02:29<00:02,  3.65it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  99%|█████████▊| 493/500 [02:29<00:01,  3.55it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  99%|█████████▊| 493/500 [02:29<00:01,  3.55it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  99%|█████████▉| 494/500 [02:29<00:01,  3.48it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  99%|█████████▉| 494/500 [02:29<00:01,  3.48it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  99%|█████████▉| 495/500 [02:29<00:01,  3.44it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  99%|█████████▉| 495/500 [02:29<00:01,  3.44it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  99%|█████████▉| 496/500 [02:29<00:01,  3.41it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  99%|█████████▉| 496/500 [02:30<00:01,  3.41it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  99%|█████████▉| 497/500 [02:30<00:00,  3.39it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 :  99%|█████████▉| 497/500 [02:30<00:00,  3.39it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 : 100%|█████████▉| 498/500 [02:30<00:00,  3.73it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 : 100%|█████████▉| 498/500 [02:30<00:00,  3.73it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 : 100%|█████████▉| 499/500 [02:30<00:00,  3.61it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 : 100%|█████████▉| 499/500 [02:30<00:00,  3.61it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 : 100%|██████████| 500/500 [02:30<00:00,  3.53it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 1.80e-05 : 100%|██████████| 500/500 [02:30<00:00,  3.31it/s]

Load the SNARE-seq dataset (gene expression) with cell type labels#

def load_numpy_from_url(url, delimiter="\t"):
    """
    Load a numpy array from a URL.

    Parameters
    ----------
    url : str
        URL to load data from.
    delimiter : str, default="\t"
        Delimiter used in the data file.

    Returns
    -------
    numpy.ndarray
        Loaded data as a numpy array.
    """
    response = urllib.request.urlopen(url)
    data = response.read().decode("utf-8")
    data = data.split("\n")
    data = [row.split(delimiter) for row in data if row]
    numpy_array = np.array(data, dtype=float)
    return numpy_array


url_x = "https://rsinghlab.github.io/SCOT/data/snare_rna.txt"
snare_data = load_numpy_from_url(url_x) / 100

url_y = "https://rsinghlab.github.io/SCOT/data/SNAREseq_types.txt"
snare_labels = load_numpy_from_url(url_y)

Computing TSNE and COSNE on SNARE-seq data#

We can now proceed to computing the two DR methods and visualizing the results on the SNARE-seq dataset

tsne_model = TSNE(verbose=True, max_iter=500)
out_tsne = tsne_model.fit_transform(snare_data)

cosne_model = COSNE(lr=1e-1, verbose=True, gamma=0.5, lambda1=0.01, max_iter=500)
out_cosne = cosne_model.fit_transform(snare_data)


fig, axes = plt.subplots(nrows=1, ncols=2, figsize=(16, 8))
axes[0].scatter(*out_tsne.T, c=snare_labels.squeeze(1), cmap=plt.get_cmap("rainbow"))
axes[0].set_xticks([])
axes[0].set_yticks([])
axes[0].set_title("T-SNE", fontsize=24)
plotGrid(axes[1])
axes[1].scatter(*out_cosne.T, c=snare_labels.squeeze(1), cmap=plt.get_cmap("rainbow"))
axes[1].axis("off")
axes[1].set_title("CO-SNE", fontsize=24)
plt.show()
T-SNE, CO-SNE
Random state is None
[TorchDR] Initializing DR model TSNE.
[TorchDR] Affinity : computing the Entropic Affinity matrix.
[TorchDR] Affinity : sparsity mode enabled, computing 90 nearest neighbors. If this step is too slow, consider reducing the dimensionality of the data or disabling sparsity.

  0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  3.40e-01 (std =  8.51e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  2.16e-01 (std =  7.86e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  1.31e-01 (std =  6.42e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  1.31e-01 (std =  6.42e-02) :   3%|▎         | 3/100 [00:00<00:03, 29.64it/s]
[TorchDR] Root search : mean abs value =  7.78e-02 (std =  4.84e-02) :   3%|▎         | 3/100 [00:00<00:03, 29.64it/s]
[TorchDR] Root search : mean abs value =  4.59e-02 (std =  3.47e-02) :   3%|▎         | 3/100 [00:00<00:03, 29.64it/s]
[TorchDR] Root search : mean abs value =  2.72e-02 (std =  2.42e-02) :   3%|▎         | 3/100 [00:00<00:03, 29.64it/s]
[TorchDR] Root search : mean abs value =  2.72e-02 (std =  2.42e-02) :   6%|▌         | 6/100 [00:00<00:03, 29.75it/s]
[TorchDR] Root search : mean abs value =  1.62e-02 (std =  1.66e-02) :   6%|▌         | 6/100 [00:00<00:03, 29.75it/s]
[TorchDR] Root search : mean abs value =  9.73e-03 (std =  1.13e-02) :   6%|▌         | 6/100 [00:00<00:03, 29.75it/s]
[TorchDR] Root search : mean abs value =  5.90e-03 (std =  7.69e-03) :   6%|▌         | 6/100 [00:00<00:03, 29.75it/s]
[TorchDR] Root search : mean abs value =  3.60e-03 (std =  5.23e-03) :   6%|▌         | 6/100 [00:00<00:03, 29.75it/s]
[TorchDR] Root search : mean abs value =  3.60e-03 (std =  5.23e-03) :  10%|█         | 10/100 [00:00<00:03, 26.10it/s]
[TorchDR] Root search : mean abs value =  2.22e-03 (std =  3.56e-03) :  10%|█         | 10/100 [00:00<00:03, 26.10it/s]
[TorchDR] Root search : mean abs value =  1.38e-03 (std =  2.43e-03) :  10%|█         | 10/100 [00:00<00:03, 26.10it/s]
[TorchDR] Root search : mean abs value =  8.67e-04 (std =  1.66e-03) :  10%|█         | 10/100 [00:00<00:03, 26.10it/s]
[TorchDR] Root search : mean abs value =  8.67e-04 (std =  1.66e-03) :  13%|█▎        | 13/100 [00:00<00:03, 24.96it/s]
[TorchDR] Root search : mean abs value =  5.47e-04 (std =  1.14e-03) :  13%|█▎        | 13/100 [00:00<00:03, 24.96it/s]
[TorchDR] Root search : mean abs value =  3.48e-04 (std =  7.87e-04) :  13%|█▎        | 13/100 [00:00<00:03, 24.96it/s]
[TorchDR] Root search : mean abs value =  2.23e-04 (std =  5.45e-04) :  13%|█▎        | 13/100 [00:00<00:03, 24.96it/s]
[TorchDR] Root search : mean abs value =  1.44e-04 (std =  3.79e-04) :  13%|█▎        | 13/100 [00:00<00:03, 24.96it/s]
[TorchDR] Root search : mean abs value =  9.31e-05 (std =  2.65e-04) :  13%|█▎        | 13/100 [00:00<00:03, 24.96it/s]
[TorchDR] Root search : mean abs value =  9.31e-05 (std =  2.65e-04) :  18%|█▊        | 18/100 [00:00<00:02, 32.42it/s]
[TorchDR] Root search : mean abs value =  6.07e-05 (std =  1.86e-04) :  18%|█▊        | 18/100 [00:00<00:02, 32.42it/s]
[TorchDR] Root search : mean abs value =  3.98e-05 (std =  1.31e-04) :  18%|█▊        | 18/100 [00:00<00:02, 32.42it/s]
[TorchDR] Root search : mean abs value =  2.62e-05 (std =  9.25e-05) :  18%|█▊        | 18/100 [00:00<00:02, 32.42it/s]
[TorchDR] Root search : mean abs value =  1.73e-05 (std =  6.56e-05) :  18%|█▊        | 18/100 [00:00<00:02, 32.42it/s]
[TorchDR] Root search : mean abs value =  1.73e-05 (std =  6.56e-05) :  22%|██▏       | 22/100 [00:00<00:02, 28.66it/s]
[TorchDR] Root search : mean abs value =  1.15e-05 (std =  4.67e-05) :  22%|██▏       | 22/100 [00:00<00:02, 28.66it/s]
[TorchDR] Root search : mean abs value =  1.15e-05 (std =  4.67e-05) :  23%|██▎       | 23/100 [00:00<00:02, 28.71it/s]
Random state is None

  0%|          | 0/500 [00:00<?, ?it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   0%|          | 0/500 [00:00<?, ?it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   0%|          | 1/500 [00:00<01:54,  4.35it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   0%|          | 1/500 [00:00<01:54,  4.35it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   0%|          | 2/500 [00:00<01:46,  4.67it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   0%|          | 2/500 [00:00<01:46,  4.67it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   1%|          | 3/500 [00:00<02:21,  3.52it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   1%|          | 3/500 [00:01<02:21,  3.52it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   1%|          | 4/500 [00:01<02:10,  3.80it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   1%|          | 4/500 [00:01<02:10,  3.80it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   1%|          | 5/500 [00:01<02:11,  3.77it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   1%|          | 5/500 [00:01<02:11,  3.77it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   1%|          | 6/500 [00:01<02:05,  3.94it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   1%|          | 6/500 [00:01<02:05,  3.94it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   1%|▏         | 7/500 [00:01<02:23,  3.43it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   1%|▏         | 7/500 [00:02<02:23,  3.43it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   2%|▏         | 8/500 [00:02<02:29,  3.29it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   2%|▏         | 8/500 [00:02<02:29,  3.29it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   2%|▏         | 9/500 [00:02<02:28,  3.31it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   2%|▏         | 9/500 [00:02<02:28,  3.31it/s]
[TorchDR] DR Loss : 1.39e+01 | Grad norm : 2.30e-05 :   2%|▏         | 10/500 [00:02<02:13,  3.68it/s]
[TorchDR] DR Loss : 1.41e+01 | Grad norm : 2.30e-05 :   2%|▏         | 10/500 [00:03<02:13,  3.68it/s]
[TorchDR] DR Loss : 1.41e+01 | Grad norm : 2.30e-05 :   2%|▏         | 11/500 [00:03<02:16,  3.58it/s]
[TorchDR] DR Loss : 1.43e+01 | Grad norm : 2.30e-05 :   2%|▏         | 11/500 [00:03<02:16,  3.58it/s]
[TorchDR] DR Loss : 1.43e+01 | Grad norm : 2.30e-05 :   2%|▏         | 12/500 [00:03<02:04,  3.91it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 2.30e-05 :   2%|▏         | 12/500 [00:03<02:04,  3.91it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 2.30e-05 :   3%|▎         | 13/500 [00:03<01:56,  4.17it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 2.30e-05 :   3%|▎         | 13/500 [00:03<01:56,  4.17it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 2.30e-05 :   3%|▎         | 14/500 [00:03<01:50,  4.40it/s]
[TorchDR] DR Loss : 1.48e+01 | Grad norm : 2.30e-05 :   3%|▎         | 14/500 [00:03<01:50,  4.40it/s]
[TorchDR] DR Loss : 1.48e+01 | Grad norm : 2.30e-05 :   3%|▎         | 15/500 [00:03<02:00,  4.03it/s]
[TorchDR] DR Loss : 1.48e+01 | Grad norm : 2.30e-05 :   3%|▎         | 15/500 [00:04<02:00,  4.03it/s]
[TorchDR] DR Loss : 1.48e+01 | Grad norm : 2.30e-05 :   3%|▎         | 16/500 [00:04<01:53,  4.27it/s]
[TorchDR] DR Loss : 1.49e+01 | Grad norm : 2.30e-05 :   3%|▎         | 16/500 [00:04<01:53,  4.27it/s]
[TorchDR] DR Loss : 1.49e+01 | Grad norm : 2.30e-05 :   3%|▎         | 17/500 [00:04<01:48,  4.47it/s]
[TorchDR] DR Loss : 1.49e+01 | Grad norm : 2.30e-05 :   3%|▎         | 17/500 [00:04<01:48,  4.47it/s]
[TorchDR] DR Loss : 1.49e+01 | Grad norm : 2.30e-05 :   4%|▎         | 18/500 [00:04<01:44,  4.61it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   4%|▎         | 18/500 [00:04<01:44,  4.61it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   4%|▍         | 19/500 [00:04<01:41,  4.72it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   4%|▍         | 19/500 [00:04<01:41,  4.72it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   4%|▍         | 20/500 [00:05<01:49,  4.37it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   4%|▍         | 20/500 [00:05<01:49,  4.37it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   4%|▍         | 21/500 [00:05<01:49,  4.36it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   4%|▍         | 21/500 [00:05<01:49,  4.36it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   4%|▍         | 22/500 [00:05<01:59,  3.99it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   4%|▍         | 22/500 [00:05<01:59,  3.99it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   5%|▍         | 23/500 [00:05<01:52,  4.23it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   5%|▍         | 23/500 [00:05<01:52,  4.23it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   5%|▍         | 24/500 [00:05<01:47,  4.44it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   5%|▍         | 24/500 [00:06<01:47,  4.44it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   5%|▌         | 25/500 [00:06<02:11,  3.61it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   5%|▌         | 25/500 [00:06<02:11,  3.61it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   5%|▌         | 26/500 [00:06<02:00,  3.93it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   5%|▌         | 26/500 [00:06<02:00,  3.93it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   5%|▌         | 27/500 [00:06<02:06,  3.73it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   5%|▌         | 27/500 [00:07<02:06,  3.73it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   6%|▌         | 28/500 [00:07<02:10,  3.61it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   6%|▌         | 28/500 [00:07<02:10,  3.61it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   6%|▌         | 29/500 [00:07<02:09,  3.64it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   6%|▌         | 29/500 [00:07<02:09,  3.64it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   6%|▌         | 30/500 [00:07<02:02,  3.83it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   6%|▌         | 30/500 [00:07<02:02,  3.83it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   6%|▌         | 31/500 [00:07<01:54,  4.10it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   6%|▌         | 31/500 [00:08<01:54,  4.10it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   6%|▋         | 32/500 [00:08<02:01,  3.84it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   6%|▋         | 32/500 [00:08<02:01,  3.84it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   7%|▋         | 33/500 [00:08<02:02,  3.80it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   7%|▋         | 33/500 [00:08<02:02,  3.80it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   7%|▋         | 34/500 [00:08<01:57,  3.96it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   7%|▋         | 34/500 [00:08<01:57,  3.96it/s]
[TorchDR] DR Loss : 1.50e+01 | Grad norm : 2.30e-05 :   7%|▋         | 35/500 [00:08<01:50,  4.20it/s]
[TorchDR] DR Loss : 1.49e+01 | Grad norm : 2.30e-05 :   7%|▋         | 35/500 [00:09<01:50,  4.20it/s]
[TorchDR] DR Loss : 1.49e+01 | Grad norm : 2.30e-05 :   7%|▋         | 36/500 [00:09<02:13,  3.49it/s]
[TorchDR] DR Loss : 1.49e+01 | Grad norm : 2.30e-05 :   7%|▋         | 36/500 [00:09<02:13,  3.49it/s]
[TorchDR] DR Loss : 1.49e+01 | Grad norm : 2.30e-05 :   7%|▋         | 37/500 [00:09<02:14,  3.45it/s]
[TorchDR] DR Loss : 1.49e+01 | Grad norm : 2.30e-05 :   7%|▋         | 37/500 [00:09<02:14,  3.45it/s]
[TorchDR] DR Loss : 1.49e+01 | Grad norm : 2.30e-05 :   8%|▊         | 38/500 [00:09<02:01,  3.80it/s]
[TorchDR] DR Loss : 1.49e+01 | Grad norm : 2.30e-05 :   8%|▊         | 38/500 [00:10<02:01,  3.80it/s]
[TorchDR] DR Loss : 1.49e+01 | Grad norm : 2.30e-05 :   8%|▊         | 39/500 [00:10<02:30,  3.07it/s]
[TorchDR] DR Loss : 1.48e+01 | Grad norm : 2.30e-05 :   8%|▊         | 39/500 [00:10<02:30,  3.07it/s]
[TorchDR] DR Loss : 1.48e+01 | Grad norm : 2.30e-05 :   8%|▊         | 40/500 [00:10<02:26,  3.14it/s]
[TorchDR] DR Loss : 1.48e+01 | Grad norm : 2.30e-05 :   8%|▊         | 40/500 [00:10<02:26,  3.14it/s]
[TorchDR] DR Loss : 1.48e+01 | Grad norm : 2.30e-05 :   8%|▊         | 41/500 [00:10<02:37,  2.92it/s]
[TorchDR] DR Loss : 1.48e+01 | Grad norm : 2.30e-05 :   8%|▊         | 41/500 [00:11<02:37,  2.92it/s]
[TorchDR] DR Loss : 1.48e+01 | Grad norm : 2.30e-05 :   8%|▊         | 42/500 [00:11<02:34,  2.96it/s]
[TorchDR] DR Loss : 1.48e+01 | Grad norm : 2.30e-05 :   8%|▊         | 42/500 [00:11<02:34,  2.96it/s]
[TorchDR] DR Loss : 1.48e+01 | Grad norm : 2.30e-05 :   9%|▊         | 43/500 [00:11<02:29,  3.05it/s]
[TorchDR] DR Loss : 1.47e+01 | Grad norm : 2.30e-05 :   9%|▊         | 43/500 [00:11<02:29,  3.05it/s]
[TorchDR] DR Loss : 1.47e+01 | Grad norm : 2.30e-05 :   9%|▉         | 44/500 [00:11<02:21,  3.22it/s]
[TorchDR] DR Loss : 1.47e+01 | Grad norm : 2.30e-05 :   9%|▉         | 44/500 [00:12<02:21,  3.22it/s]
[TorchDR] DR Loss : 1.47e+01 | Grad norm : 2.30e-05 :   9%|▉         | 45/500 [00:12<02:10,  3.50it/s]
[TorchDR] DR Loss : 1.47e+01 | Grad norm : 2.30e-05 :   9%|▉         | 45/500 [00:12<02:10,  3.50it/s]
[TorchDR] DR Loss : 1.47e+01 | Grad norm : 2.30e-05 :   9%|▉         | 46/500 [00:12<02:21,  3.22it/s]
[TorchDR] DR Loss : 1.47e+01 | Grad norm : 2.30e-05 :   9%|▉         | 46/500 [00:12<02:21,  3.22it/s]
[TorchDR] DR Loss : 1.47e+01 | Grad norm : 2.30e-05 :   9%|▉         | 47/500 [00:12<02:23,  3.16it/s]
[TorchDR] DR Loss : 1.47e+01 | Grad norm : 2.30e-05 :   9%|▉         | 47/500 [00:13<02:23,  3.16it/s]
[TorchDR] DR Loss : 1.47e+01 | Grad norm : 2.30e-05 :  10%|▉         | 48/500 [00:13<02:20,  3.22it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 2.30e-05 :  10%|▉         | 48/500 [00:13<02:20,  3.22it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 2.30e-05 :  10%|▉         | 49/500 [00:13<02:05,  3.59it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 2.30e-05 :  10%|▉         | 49/500 [00:13<02:05,  3.59it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 2.30e-05 :  10%|█         | 50/500 [00:13<02:48,  2.67it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  10%|█         | 50/500 [00:14<02:48,  2.67it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  10%|█         | 51/500 [00:14<02:38,  2.84it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  10%|█         | 51/500 [00:14<02:38,  2.84it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  10%|█         | 52/500 [00:14<02:30,  2.97it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  10%|█         | 52/500 [00:14<02:30,  2.97it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  11%|█         | 53/500 [00:14<02:12,  3.38it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  11%|█         | 53/500 [00:14<02:12,  3.38it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  11%|█         | 54/500 [00:14<01:59,  3.75it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  11%|█         | 54/500 [00:15<01:59,  3.75it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  11%|█         | 55/500 [00:15<01:50,  4.04it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  11%|█         | 55/500 [00:15<01:50,  4.04it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  11%|█         | 56/500 [00:15<01:56,  3.81it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  11%|█         | 56/500 [00:15<01:56,  3.81it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  11%|█▏        | 57/500 [00:15<02:01,  3.65it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  11%|█▏        | 57/500 [00:15<02:01,  3.65it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  12%|█▏        | 58/500 [00:15<02:00,  3.66it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  12%|█▏        | 58/500 [00:16<02:00,  3.66it/s]
[TorchDR] DR Loss : 1.46e+01 | Grad norm : 3.57e-01 :  12%|█▏        | 59/500 [00:16<02:07,  3.46it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  12%|█▏        | 59/500 [00:16<02:07,  3.46it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  12%|█▏        | 60/500 [00:16<02:04,  3.53it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  12%|█▏        | 60/500 [00:16<02:04,  3.53it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  12%|█▏        | 61/500 [00:16<01:57,  3.73it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  12%|█▏        | 61/500 [00:17<01:57,  3.73it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  12%|█▏        | 62/500 [00:17<02:01,  3.61it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  12%|█▏        | 62/500 [00:17<02:01,  3.61it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  13%|█▎        | 63/500 [00:17<01:38,  4.44it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  13%|█▎        | 63/500 [00:17<01:38,  4.44it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  13%|█▎        | 64/500 [00:17<01:43,  4.21it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  13%|█▎        | 64/500 [00:17<01:43,  4.21it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  13%|█▎        | 65/500 [00:17<01:38,  4.41it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  13%|█▎        | 65/500 [00:17<01:38,  4.41it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  13%|█▎        | 66/500 [00:17<01:38,  4.41it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  13%|█▎        | 66/500 [00:18<01:38,  4.41it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  13%|█▎        | 67/500 [00:18<01:34,  4.57it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  13%|█▎        | 67/500 [00:18<01:34,  4.57it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  14%|█▎        | 68/500 [00:18<01:32,  4.69it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  14%|█▎        | 68/500 [00:18<01:32,  4.69it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  14%|█▍        | 69/500 [00:18<01:30,  4.75it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  14%|█▍        | 69/500 [00:18<01:30,  4.75it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  14%|█▍        | 70/500 [00:18<01:29,  4.83it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  14%|█▍        | 70/500 [00:18<01:29,  4.83it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  14%|█▍        | 71/500 [00:18<01:40,  4.27it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  14%|█▍        | 71/500 [00:19<01:40,  4.27it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  14%|█▍        | 72/500 [00:19<01:35,  4.47it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  14%|█▍        | 72/500 [00:19<01:35,  4.47it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  15%|█▍        | 73/500 [00:19<01:32,  4.62it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  15%|█▍        | 73/500 [00:19<01:32,  4.62it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  15%|█▍        | 74/500 [00:19<01:30,  4.73it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  15%|█▍        | 74/500 [00:19<01:30,  4.73it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  15%|█▌        | 75/500 [00:19<01:28,  4.80it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  15%|█▌        | 75/500 [00:19<01:28,  4.80it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  15%|█▌        | 76/500 [00:19<01:27,  4.85it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  15%|█▌        | 76/500 [00:20<01:27,  4.85it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  15%|█▌        | 77/500 [00:20<01:26,  4.88it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  15%|█▌        | 77/500 [00:20<01:26,  4.88it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  16%|█▌        | 78/500 [00:20<01:37,  4.31it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  16%|█▌        | 78/500 [00:20<01:37,  4.31it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  16%|█▌        | 79/500 [00:20<01:42,  4.10it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  16%|█▌        | 79/500 [00:20<01:42,  4.10it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  16%|█▌        | 80/500 [00:20<01:37,  4.33it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  16%|█▌        | 80/500 [00:21<01:37,  4.33it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  16%|█▌        | 81/500 [00:21<01:36,  4.33it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  16%|█▌        | 81/500 [00:21<01:36,  4.33it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  16%|█▋        | 82/500 [00:21<01:45,  3.98it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  16%|█▋        | 82/500 [00:21<01:45,  3.98it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  17%|█▋        | 83/500 [00:21<01:50,  3.76it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  17%|█▋        | 83/500 [00:22<01:50,  3.76it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  17%|█▋        | 84/500 [00:22<01:54,  3.62it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  17%|█▋        | 84/500 [00:22<01:54,  3.62it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  17%|█▋        | 85/500 [00:22<01:57,  3.53it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  17%|█▋        | 85/500 [00:22<01:57,  3.53it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  17%|█▋        | 86/500 [00:22<01:59,  3.47it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  17%|█▋        | 86/500 [00:22<01:59,  3.47it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  17%|█▋        | 87/500 [00:22<02:00,  3.44it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  17%|█▋        | 87/500 [00:23<02:00,  3.44it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  18%|█▊        | 88/500 [00:23<01:57,  3.51it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  18%|█▊        | 88/500 [00:23<01:57,  3.51it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  18%|█▊        | 89/500 [00:23<01:59,  3.45it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  18%|█▊        | 89/500 [00:23<01:59,  3.45it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  18%|█▊        | 90/500 [00:23<02:00,  3.41it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  18%|█▊        | 90/500 [00:24<02:00,  3.41it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  18%|█▊        | 91/500 [00:24<02:04,  3.29it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  18%|█▊        | 91/500 [00:24<02:04,  3.29it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  18%|█▊        | 92/500 [00:24<02:00,  3.40it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  18%|█▊        | 92/500 [00:24<02:00,  3.40it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  19%|█▊        | 93/500 [00:24<02:03,  3.29it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  19%|█▊        | 93/500 [00:24<02:03,  3.29it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  19%|█▉        | 94/500 [00:24<01:51,  3.65it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  19%|█▉        | 94/500 [00:25<01:51,  3.65it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  19%|█▉        | 95/500 [00:25<01:50,  3.68it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  19%|█▉        | 95/500 [00:25<01:50,  3.68it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  19%|█▉        | 96/500 [00:25<01:53,  3.57it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  19%|█▉        | 96/500 [00:25<01:53,  3.57it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  19%|█▉        | 97/500 [00:25<01:46,  3.77it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  19%|█▉        | 97/500 [00:26<01:46,  3.77it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  20%|█▉        | 98/500 [00:26<01:47,  3.75it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  20%|█▉        | 98/500 [00:26<01:47,  3.75it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  20%|█▉        | 99/500 [00:26<01:42,  3.92it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  20%|█▉        | 99/500 [00:26<01:42,  3.92it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.57e-01 :  20%|██        | 100/500 [00:26<01:35,  4.18it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  20%|██        | 100/500 [00:26<01:35,  4.18it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  20%|██        | 101/500 [00:26<01:51,  3.58it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  20%|██        | 101/500 [00:27<01:51,  3.58it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  20%|██        | 102/500 [00:27<02:29,  2.66it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  20%|██        | 102/500 [00:27<02:29,  2.66it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  21%|██        | 103/500 [00:27<02:47,  2.37it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  21%|██        | 103/500 [00:28<02:47,  2.37it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  21%|██        | 104/500 [00:28<02:56,  2.25it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  21%|██        | 104/500 [00:28<02:56,  2.25it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  21%|██        | 105/500 [00:28<02:58,  2.21it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  21%|██        | 105/500 [00:29<02:58,  2.21it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  21%|██        | 106/500 [00:29<03:07,  2.10it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  21%|██        | 106/500 [00:30<03:07,  2.10it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  21%|██▏       | 107/500 [00:30<03:21,  1.95it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  21%|██▏       | 107/500 [00:30<03:21,  1.95it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  22%|██▏       | 108/500 [00:30<03:31,  1.85it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  22%|██▏       | 108/500 [00:31<03:31,  1.85it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  22%|██▏       | 109/500 [00:31<03:22,  1.93it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  22%|██▏       | 109/500 [00:31<03:22,  1.93it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  22%|██▏       | 110/500 [00:31<03:00,  2.17it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  22%|██▏       | 110/500 [00:31<03:00,  2.17it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  22%|██▏       | 111/500 [00:31<02:48,  2.31it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  22%|██▏       | 111/500 [00:32<02:48,  2.31it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  22%|██▏       | 112/500 [00:32<02:56,  2.20it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  22%|██▏       | 112/500 [00:32<02:56,  2.20it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  23%|██▎       | 113/500 [00:32<02:53,  2.23it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  23%|██▎       | 113/500 [00:33<02:53,  2.23it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  23%|██▎       | 114/500 [00:33<03:06,  2.07it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  23%|██▎       | 114/500 [00:34<03:06,  2.07it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  23%|██▎       | 115/500 [00:34<03:31,  1.82it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  23%|██▎       | 115/500 [00:34<03:31,  1.82it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  23%|██▎       | 116/500 [00:34<03:28,  1.84it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  23%|██▎       | 116/500 [00:35<03:28,  1.84it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  23%|██▎       | 117/500 [00:35<03:23,  1.89it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  23%|██▎       | 117/500 [00:35<03:23,  1.89it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  24%|██▎       | 118/500 [00:35<03:15,  1.96it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  24%|██▎       | 118/500 [00:35<03:15,  1.96it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  24%|██▍       | 119/500 [00:35<02:54,  2.19it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  24%|██▍       | 119/500 [00:36<02:54,  2.19it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  24%|██▍       | 120/500 [00:36<03:09,  2.00it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  24%|██▍       | 120/500 [00:37<03:09,  2.00it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  24%|██▍       | 121/500 [00:37<03:20,  1.89it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  24%|██▍       | 121/500 [00:37<03:20,  1.89it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  24%|██▍       | 122/500 [00:37<02:54,  2.17it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  24%|██▍       | 122/500 [00:37<02:54,  2.17it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  25%|██▍       | 123/500 [00:37<02:47,  2.25it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  25%|██▍       | 123/500 [00:38<02:47,  2.25it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  25%|██▍       | 124/500 [00:38<02:52,  2.18it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  25%|██▍       | 124/500 [00:38<02:52,  2.18it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  25%|██▌       | 125/500 [00:38<02:42,  2.31it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  25%|██▌       | 125/500 [00:39<02:42,  2.31it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  25%|██▌       | 126/500 [00:39<02:49,  2.21it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  25%|██▌       | 126/500 [00:39<02:49,  2.21it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  25%|██▌       | 127/500 [00:39<02:46,  2.24it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  25%|██▌       | 127/500 [00:40<02:46,  2.24it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  26%|██▌       | 128/500 [00:40<03:03,  2.03it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  26%|██▌       | 128/500 [00:40<03:03,  2.03it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  26%|██▌       | 129/500 [00:40<02:52,  2.15it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  26%|██▌       | 129/500 [00:41<02:52,  2.15it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  26%|██▌       | 130/500 [00:41<03:14,  1.90it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  26%|██▌       | 130/500 [00:41<03:14,  1.90it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  26%|██▌       | 131/500 [00:41<03:14,  1.89it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  26%|██▌       | 131/500 [00:42<03:14,  1.89it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  26%|██▋       | 132/500 [00:42<03:29,  1.75it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  26%|██▋       | 132/500 [00:42<03:29,  1.75it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  27%|██▋       | 133/500 [00:42<02:51,  2.14it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  27%|██▋       | 133/500 [00:43<02:51,  2.14it/s]
[TorchDR] DR Loss : 1.45e+01 | Grad norm : 3.50e-01 :  27%|██▋       | 134/500 [00:43<02:43,  2.23it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  27%|██▋       | 134/500 [00:43<02:43,  2.23it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  27%|██▋       | 135/500 [00:43<03:00,  2.03it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  27%|██▋       | 135/500 [00:44<03:00,  2.03it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  27%|██▋       | 136/500 [00:44<03:00,  2.02it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  27%|██▋       | 136/500 [00:44<03:00,  2.02it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  27%|██▋       | 137/500 [00:44<02:38,  2.29it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  27%|██▋       | 137/500 [00:44<02:38,  2.29it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  28%|██▊       | 138/500 [00:44<02:44,  2.20it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  28%|██▊       | 138/500 [00:45<02:44,  2.20it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  28%|██▊       | 139/500 [00:45<02:27,  2.44it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  28%|██▊       | 139/500 [00:45<02:27,  2.44it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  28%|██▊       | 140/500 [00:45<02:22,  2.52it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  28%|██▊       | 140/500 [00:45<02:22,  2.52it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  28%|██▊       | 141/500 [00:45<02:04,  2.88it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  28%|██▊       | 141/500 [00:46<02:04,  2.88it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  28%|██▊       | 142/500 [00:46<02:09,  2.76it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  28%|██▊       | 142/500 [00:46<02:09,  2.76it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  29%|██▊       | 143/500 [00:46<01:52,  3.18it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  29%|██▊       | 143/500 [00:46<01:52,  3.18it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  29%|██▉       | 144/500 [00:46<01:39,  3.56it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  29%|██▉       | 144/500 [00:46<01:39,  3.56it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  29%|██▉       | 145/500 [00:46<01:41,  3.50it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  29%|██▉       | 145/500 [00:47<01:41,  3.50it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  29%|██▉       | 146/500 [00:47<01:42,  3.45it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  29%|██▉       | 146/500 [00:47<01:42,  3.45it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  29%|██▉       | 147/500 [00:47<01:43,  3.43it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  29%|██▉       | 147/500 [00:47<01:43,  3.43it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  30%|██▉       | 148/500 [00:47<01:33,  3.77it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  30%|██▉       | 148/500 [00:48<01:33,  3.77it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  30%|██▉       | 149/500 [00:48<01:36,  3.63it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  30%|██▉       | 149/500 [00:48<01:36,  3.63it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.50e-01 :  30%|███       | 150/500 [00:48<01:35,  3.65it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  30%|███       | 150/500 [00:48<01:35,  3.65it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  30%|███       | 151/500 [00:48<01:41,  3.44it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  30%|███       | 151/500 [00:49<01:41,  3.44it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  30%|███       | 152/500 [00:49<02:03,  2.83it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  30%|███       | 152/500 [00:49<02:03,  2.83it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  31%|███       | 153/500 [00:49<01:57,  2.96it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  31%|███       | 153/500 [00:49<01:57,  2.96it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  31%|███       | 154/500 [00:49<02:03,  2.81it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  31%|███       | 154/500 [00:50<02:03,  2.81it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  31%|███       | 155/500 [00:50<02:07,  2.71it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  31%|███       | 155/500 [00:50<02:07,  2.71it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  31%|███       | 156/500 [00:50<02:10,  2.64it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  31%|███       | 156/500 [00:50<02:10,  2.64it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  31%|███▏      | 157/500 [00:50<02:01,  2.82it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  31%|███▏      | 157/500 [00:51<02:01,  2.82it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  32%|███▏      | 158/500 [00:51<01:52,  3.04it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  32%|███▏      | 158/500 [00:51<01:52,  3.04it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  32%|███▏      | 159/500 [00:51<01:49,  3.12it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  32%|███▏      | 159/500 [00:51<01:49,  3.12it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  32%|███▏      | 160/500 [00:51<02:00,  2.82it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  32%|███▏      | 160/500 [00:52<02:00,  2.82it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  32%|███▏      | 161/500 [00:52<02:04,  2.72it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  32%|███▏      | 161/500 [00:52<02:04,  2.72it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  32%|███▏      | 162/500 [00:52<01:54,  2.96it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  32%|███▏      | 162/500 [00:52<01:54,  2.96it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  33%|███▎      | 163/500 [00:52<01:42,  3.28it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  33%|███▎      | 163/500 [00:53<01:42,  3.28it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  33%|███▎      | 164/500 [00:53<01:32,  3.65it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  33%|███▎      | 164/500 [00:53<01:32,  3.65it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  33%|███▎      | 165/500 [00:53<01:31,  3.67it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  33%|███▎      | 165/500 [00:53<01:31,  3.67it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  33%|███▎      | 166/500 [00:53<01:26,  3.85it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  33%|███▎      | 166/500 [00:53<01:26,  3.85it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  33%|███▎      | 167/500 [00:53<01:20,  4.13it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  33%|███▎      | 167/500 [00:54<01:20,  4.13it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  34%|███▎      | 168/500 [00:54<01:25,  3.86it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  34%|███▎      | 168/500 [00:54<01:25,  3.86it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  34%|███▍      | 169/500 [00:54<01:19,  4.14it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  34%|███▍      | 169/500 [00:54<01:19,  4.14it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  34%|███▍      | 170/500 [00:54<01:15,  4.36it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  34%|███▍      | 170/500 [00:54<01:15,  4.36it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  34%|███▍      | 171/500 [00:54<01:19,  4.15it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  34%|███▍      | 171/500 [00:54<01:19,  4.15it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  34%|███▍      | 172/500 [00:54<01:08,  4.79it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  34%|███▍      | 172/500 [00:55<01:08,  4.79it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  35%|███▍      | 173/500 [00:55<01:16,  4.26it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  35%|███▍      | 173/500 [00:55<01:16,  4.26it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  35%|███▍      | 174/500 [00:55<01:13,  4.44it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  35%|███▍      | 174/500 [00:55<01:13,  4.44it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  35%|███▌      | 175/500 [00:55<01:10,  4.59it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  35%|███▌      | 175/500 [00:55<01:10,  4.59it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  35%|███▌      | 176/500 [00:55<01:15,  4.29it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  35%|███▌      | 176/500 [00:56<01:15,  4.29it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  35%|███▌      | 177/500 [00:56<01:12,  4.47it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  35%|███▌      | 177/500 [00:56<01:12,  4.47it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  36%|███▌      | 178/500 [00:56<01:12,  4.44it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  36%|███▌      | 178/500 [00:56<01:12,  4.44it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  36%|███▌      | 179/500 [00:56<01:19,  4.03it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  36%|███▌      | 179/500 [00:56<01:19,  4.03it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  36%|███▌      | 180/500 [00:56<01:30,  3.52it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  36%|███▌      | 180/500 [00:57<01:30,  3.52it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  36%|███▌      | 181/500 [00:57<01:25,  3.74it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  36%|███▌      | 181/500 [00:57<01:25,  3.74it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  36%|███▋      | 182/500 [00:57<01:18,  4.05it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  36%|███▋      | 182/500 [00:57<01:18,  4.05it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  37%|███▋      | 183/500 [00:57<01:13,  4.29it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  37%|███▋      | 183/500 [00:57<01:13,  4.29it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  37%|███▋      | 184/500 [00:57<01:10,  4.48it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  37%|███▋      | 184/500 [00:57<01:10,  4.48it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  37%|███▋      | 185/500 [00:57<01:08,  4.62it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  37%|███▋      | 185/500 [00:58<01:08,  4.62it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  37%|███▋      | 186/500 [00:58<01:06,  4.72it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  37%|███▋      | 186/500 [00:58<01:06,  4.72it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  37%|███▋      | 187/500 [00:58<01:05,  4.80it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  37%|███▋      | 187/500 [00:58<01:05,  4.80it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  38%|███▊      | 188/500 [00:58<01:10,  4.42it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  38%|███▊      | 188/500 [00:58<01:10,  4.42it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  38%|███▊      | 189/500 [00:58<01:10,  4.41it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  38%|███▊      | 189/500 [00:59<01:10,  4.41it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  38%|███▊      | 190/500 [00:59<01:08,  4.55it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  38%|███▊      | 190/500 [00:59<01:08,  4.55it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  38%|███▊      | 191/500 [00:59<01:12,  4.27it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  38%|███▊      | 191/500 [00:59<01:12,  4.27it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  38%|███▊      | 192/500 [00:59<01:09,  4.46it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  38%|███▊      | 192/500 [00:59<01:09,  4.46it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  39%|███▊      | 193/500 [00:59<01:09,  4.43it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  39%|███▊      | 193/500 [00:59<01:09,  4.43it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  39%|███▉      | 194/500 [00:59<01:06,  4.58it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  39%|███▉      | 194/500 [01:00<01:06,  4.58it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  39%|███▉      | 195/500 [01:00<01:04,  4.71it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  39%|███▉      | 195/500 [01:00<01:04,  4.71it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  39%|███▉      | 196/500 [01:00<01:09,  4.35it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  39%|███▉      | 196/500 [01:00<01:09,  4.35it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  39%|███▉      | 197/500 [01:00<01:00,  4.98it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  39%|███▉      | 197/500 [01:00<01:00,  4.98it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  40%|███▉      | 198/500 [01:00<01:06,  4.53it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  40%|███▉      | 198/500 [01:01<01:06,  4.53it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  40%|███▉      | 199/500 [01:01<01:04,  4.66it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  40%|███▉      | 199/500 [01:01<01:04,  4.66it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  40%|████      | 200/500 [01:01<01:05,  4.55it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  40%|████      | 200/500 [01:01<01:05,  4.55it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  40%|████      | 201/500 [01:01<01:04,  4.67it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  40%|████      | 201/500 [01:01<01:04,  4.67it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  40%|████      | 202/500 [01:01<01:11,  4.19it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  40%|████      | 202/500 [01:01<01:11,  4.19it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  41%|████      | 203/500 [01:01<01:07,  4.40it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  41%|████      | 203/500 [01:02<01:07,  4.40it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  41%|████      | 204/500 [01:02<01:02,  4.74it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  41%|████      | 204/500 [01:02<01:02,  4.74it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  41%|████      | 205/500 [01:02<01:01,  4.83it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  41%|████      | 205/500 [01:02<01:01,  4.83it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  41%|████      | 206/500 [01:02<01:00,  4.88it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  41%|████      | 206/500 [01:02<01:00,  4.88it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  41%|████▏     | 207/500 [01:02<00:53,  5.45it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  41%|████▏     | 207/500 [01:02<00:53,  5.45it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  42%|████▏     | 208/500 [01:02<01:00,  4.79it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  42%|████▏     | 208/500 [01:03<01:00,  4.79it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  42%|████▏     | 209/500 [01:03<01:02,  4.66it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  42%|████▏     | 209/500 [01:03<01:02,  4.66it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  42%|████▏     | 210/500 [01:03<01:00,  4.76it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  42%|████▏     | 210/500 [01:03<01:00,  4.76it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  42%|████▏     | 211/500 [01:03<00:59,  4.83it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  42%|████▏     | 211/500 [01:03<00:59,  4.83it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  42%|████▏     | 212/500 [01:03<00:59,  4.88it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  42%|████▏     | 212/500 [01:03<00:59,  4.88it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  43%|████▎     | 213/500 [01:03<00:55,  5.13it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  43%|████▎     | 213/500 [01:04<00:55,  5.13it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  43%|████▎     | 214/500 [01:04<00:56,  5.09it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  43%|████▎     | 214/500 [01:04<00:56,  5.09it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  43%|████▎     | 215/500 [01:04<00:56,  5.06it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  43%|████▎     | 215/500 [01:04<00:56,  5.06it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  43%|████▎     | 216/500 [01:04<00:58,  4.83it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  43%|████▎     | 216/500 [01:04<00:58,  4.83it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  43%|████▎     | 217/500 [01:04<00:57,  4.88it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  43%|████▎     | 217/500 [01:04<00:57,  4.88it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  44%|████▎     | 218/500 [01:04<00:57,  4.91it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  44%|████▎     | 218/500 [01:05<00:57,  4.91it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  44%|████▍     | 219/500 [01:05<00:57,  4.92it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  44%|████▍     | 219/500 [01:05<00:57,  4.92it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  44%|████▍     | 220/500 [01:05<01:02,  4.49it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  44%|████▍     | 220/500 [01:05<01:02,  4.49it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  44%|████▍     | 221/500 [01:05<01:02,  4.46it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  44%|████▍     | 221/500 [01:05<01:02,  4.46it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  44%|████▍     | 222/500 [01:05<01:00,  4.60it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  44%|████▍     | 222/500 [01:06<01:00,  4.60it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  45%|████▍     | 223/500 [01:06<01:04,  4.28it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  45%|████▍     | 223/500 [01:06<01:04,  4.28it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  45%|████▍     | 224/500 [01:06<01:03,  4.31it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  45%|████▍     | 224/500 [01:06<01:03,  4.31it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  45%|████▌     | 225/500 [01:06<01:01,  4.50it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  45%|████▌     | 225/500 [01:06<01:01,  4.50it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  45%|████▌     | 226/500 [01:06<00:59,  4.63it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  45%|████▌     | 226/500 [01:06<00:59,  4.63it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  45%|████▌     | 227/500 [01:06<00:57,  4.73it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  45%|████▌     | 227/500 [01:07<00:57,  4.73it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  46%|████▌     | 228/500 [01:07<00:56,  4.80it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  46%|████▌     | 228/500 [01:07<00:56,  4.80it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  46%|████▌     | 229/500 [01:07<00:55,  4.85it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  46%|████▌     | 229/500 [01:07<00:55,  4.85it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  46%|████▌     | 230/500 [01:07<01:00,  4.45it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  46%|████▌     | 230/500 [01:07<01:00,  4.45it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  46%|████▌     | 231/500 [01:07<00:58,  4.61it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  46%|████▌     | 231/500 [01:08<00:58,  4.61it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  46%|████▋     | 232/500 [01:08<00:59,  4.54it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  46%|████▋     | 232/500 [01:08<00:59,  4.54it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  47%|████▋     | 233/500 [01:08<00:57,  4.66it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  47%|████▋     | 233/500 [01:08<00:57,  4.66it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  47%|████▋     | 234/500 [01:08<00:53,  4.95it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  47%|████▋     | 234/500 [01:08<00:53,  4.95it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  47%|████▋     | 235/500 [01:08<00:53,  4.96it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  47%|████▋     | 235/500 [01:08<00:53,  4.96it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  47%|████▋     | 236/500 [01:08<00:55,  4.78it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  47%|████▋     | 236/500 [01:09<00:55,  4.78it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  47%|████▋     | 237/500 [01:09<00:54,  4.83it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  47%|████▋     | 237/500 [01:09<00:54,  4.83it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  48%|████▊     | 238/500 [01:09<00:59,  4.43it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  48%|████▊     | 238/500 [01:09<00:59,  4.43it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  48%|████▊     | 239/500 [01:09<00:51,  5.04it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  48%|████▊     | 239/500 [01:09<00:51,  5.04it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  48%|████▊     | 240/500 [01:09<00:59,  4.41it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  48%|████▊     | 240/500 [01:10<00:59,  4.41it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  48%|████▊     | 241/500 [01:10<01:02,  4.16it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  48%|████▊     | 241/500 [01:10<01:02,  4.16it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  48%|████▊     | 242/500 [01:10<01:01,  4.21it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  48%|████▊     | 242/500 [01:10<01:01,  4.21it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  49%|████▊     | 243/500 [01:10<01:03,  4.05it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  49%|████▊     | 243/500 [01:10<01:03,  4.05it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  49%|████▉     | 244/500 [01:10<01:01,  4.13it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  49%|████▉     | 244/500 [01:11<01:01,  4.13it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  49%|████▉     | 245/500 [01:11<01:03,  4.00it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  49%|████▉     | 245/500 [01:11<01:03,  4.00it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  49%|████▉     | 246/500 [01:11<00:54,  4.64it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  49%|████▉     | 246/500 [01:11<00:54,  4.64it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  49%|████▉     | 247/500 [01:11<00:58,  4.34it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  49%|████▉     | 247/500 [01:11<00:58,  4.34it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  50%|████▉     | 248/500 [01:11<00:55,  4.51it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  50%|████▉     | 248/500 [01:11<00:55,  4.51it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  50%|████▉     | 249/500 [01:11<00:53,  4.66it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  50%|████▉     | 249/500 [01:12<00:53,  4.66it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  50%|█████     | 250/500 [01:12<00:55,  4.54it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  50%|█████     | 250/500 [01:12<00:55,  4.54it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 3.48e-01 :  50%|█████     | 251/500 [01:12<00:53,  4.67it/s]
[TorchDR] DR Loss : 1.29e+01 | Grad norm : 3.48e-01 :  50%|█████     | 251/500 [01:12<00:53,  4.67it/s]
[TorchDR] DR Loss : 1.29e+01 | Grad norm : 3.48e-01 :  50%|█████     | 252/500 [01:12<00:59,  4.19it/s]
[TorchDR] DR Loss : 1.29e+01 | Grad norm : 3.48e-01 :  50%|█████     | 252/500 [01:12<00:59,  4.19it/s]
[TorchDR] DR Loss : 1.29e+01 | Grad norm : 3.48e-01 :  51%|█████     | 253/500 [01:12<00:54,  4.56it/s]
[TorchDR] DR Loss : 1.29e+01 | Grad norm : 3.48e-01 :  51%|█████     | 253/500 [01:12<00:54,  4.56it/s]
[TorchDR] DR Loss : 1.29e+01 | Grad norm : 3.48e-01 :  51%|█████     | 254/500 [01:12<00:54,  4.52it/s]
[TorchDR] DR Loss : 1.28e+01 | Grad norm : 3.48e-01 :  51%|█████     | 254/500 [01:13<00:54,  4.52it/s]
[TorchDR] DR Loss : 1.28e+01 | Grad norm : 3.48e-01 :  51%|█████     | 255/500 [01:13<01:00,  4.07it/s]
[TorchDR] DR Loss : 1.28e+01 | Grad norm : 3.48e-01 :  51%|█████     | 255/500 [01:13<01:00,  4.07it/s]
[TorchDR] DR Loss : 1.28e+01 | Grad norm : 3.48e-01 :  51%|█████     | 256/500 [01:13<01:01,  3.95it/s]
[TorchDR] DR Loss : 1.28e+01 | Grad norm : 3.48e-01 :  51%|█████     | 256/500 [01:13<01:01,  3.95it/s]
[TorchDR] DR Loss : 1.28e+01 | Grad norm : 3.48e-01 :  51%|█████▏    | 257/500 [01:13<00:59,  4.08it/s]
[TorchDR] DR Loss : 1.28e+01 | Grad norm : 3.48e-01 :  51%|█████▏    | 257/500 [01:13<00:59,  4.08it/s]
[TorchDR] DR Loss : 1.28e+01 | Grad norm : 3.48e-01 :  52%|█████▏    | 258/500 [01:13<00:56,  4.30it/s]
[TorchDR] DR Loss : 1.28e+01 | Grad norm : 3.48e-01 :  52%|█████▏    | 258/500 [01:14<00:56,  4.30it/s]
[TorchDR] DR Loss : 1.28e+01 | Grad norm : 3.48e-01 :  52%|█████▏    | 259/500 [01:14<00:58,  4.11it/s]
[TorchDR] DR Loss : 1.27e+01 | Grad norm : 3.48e-01 :  52%|█████▏    | 259/500 [01:14<00:58,  4.11it/s]
[TorchDR] DR Loss : 1.27e+01 | Grad norm : 3.48e-01 :  52%|█████▏    | 260/500 [01:14<00:55,  4.34it/s]
[TorchDR] DR Loss : 1.27e+01 | Grad norm : 3.48e-01 :  52%|█████▏    | 260/500 [01:14<00:55,  4.34it/s]
[TorchDR] DR Loss : 1.27e+01 | Grad norm : 3.48e-01 :  52%|█████▏    | 261/500 [01:14<00:48,  4.97it/s]
[TorchDR] DR Loss : 1.27e+01 | Grad norm : 3.48e-01 :  52%|█████▏    | 261/500 [01:14<00:48,  4.97it/s]
[TorchDR] DR Loss : 1.27e+01 | Grad norm : 3.48e-01 :  52%|█████▏    | 262/500 [01:14<00:52,  4.53it/s]
[TorchDR] DR Loss : 1.27e+01 | Grad norm : 3.48e-01 :  52%|█████▏    | 262/500 [01:15<00:52,  4.53it/s]
[TorchDR] DR Loss : 1.27e+01 | Grad norm : 3.48e-01 :  53%|█████▎    | 263/500 [01:15<00:53,  4.44it/s]
[TorchDR] DR Loss : 1.27e+01 | Grad norm : 3.48e-01 :  53%|█████▎    | 263/500 [01:15<00:53,  4.44it/s]
[TorchDR] DR Loss : 1.27e+01 | Grad norm : 3.48e-01 :  53%|█████▎    | 264/500 [01:15<00:51,  4.61it/s]
[TorchDR] DR Loss : 1.27e+01 | Grad norm : 3.48e-01 :  53%|█████▎    | 264/500 [01:15<00:51,  4.61it/s]
[TorchDR] DR Loss : 1.27e+01 | Grad norm : 3.48e-01 :  53%|█████▎    | 265/500 [01:15<01:01,  3.81it/s]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.48e-01 :  53%|█████▎    | 265/500 [01:15<01:01,  3.81it/s]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.48e-01 :  53%|█████▎    | 266/500 [01:15<00:59,  3.95it/s]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.48e-01 :  53%|█████▎    | 266/500 [01:16<00:59,  3.95it/s]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.48e-01 :  53%|█████▎    | 267/500 [01:16<01:02,  3.75it/s]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.48e-01 :  53%|█████▎    | 267/500 [01:16<01:02,  3.75it/s]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.48e-01 :  54%|█████▎    | 268/500 [01:16<01:04,  3.62it/s]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.48e-01 :  54%|█████▎    | 268/500 [01:16<01:04,  3.62it/s]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.48e-01 :  54%|█████▍    | 269/500 [01:16<00:58,  3.94it/s]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.48e-01 :  54%|█████▍    | 269/500 [01:16<00:58,  3.94it/s]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.48e-01 :  54%|█████▍    | 270/500 [01:16<00:54,  4.20it/s]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.48e-01 :  54%|█████▍    | 270/500 [01:17<00:54,  4.20it/s]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.48e-01 :  54%|█████▍    | 271/500 [01:17<00:56,  4.05it/s]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.48e-01 :  54%|█████▍    | 271/500 [01:17<00:56,  4.05it/s]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.48e-01 :  54%|█████▍    | 272/500 [01:17<00:55,  4.14it/s]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.48e-01 :  54%|█████▍    | 272/500 [01:17<00:55,  4.14it/s]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.48e-01 :  55%|█████▍    | 273/500 [01:17<00:52,  4.35it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  55%|█████▍    | 273/500 [01:17<00:52,  4.35it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  55%|█████▍    | 274/500 [01:17<00:54,  4.14it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  55%|█████▍    | 274/500 [01:18<00:54,  4.14it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  55%|█████▌    | 275/500 [01:18<00:53,  4.20it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  55%|█████▌    | 275/500 [01:18<00:53,  4.20it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  55%|█████▌    | 276/500 [01:18<00:51,  4.39it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  55%|█████▌    | 276/500 [01:18<00:51,  4.39it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  55%|█████▌    | 277/500 [01:18<00:55,  4.03it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  55%|█████▌    | 277/500 [01:18<00:55,  4.03it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  56%|█████▌    | 278/500 [01:18<00:52,  4.27it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  56%|█████▌    | 278/500 [01:19<00:52,  4.27it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  56%|█████▌    | 279/500 [01:19<00:54,  4.09it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  56%|█████▌    | 279/500 [01:19<00:54,  4.09it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  56%|█████▌    | 280/500 [01:19<00:52,  4.17it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  56%|█████▌    | 280/500 [01:19<00:52,  4.17it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  56%|█████▌    | 281/500 [01:19<00:48,  4.55it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  56%|█████▌    | 281/500 [01:19<00:48,  4.55it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  56%|█████▋    | 282/500 [01:19<00:46,  4.67it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  56%|█████▋    | 282/500 [01:19<00:46,  4.67it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  57%|█████▋    | 283/500 [01:19<00:47,  4.59it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  57%|█████▋    | 283/500 [01:20<00:47,  4.59it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.48e-01 :  57%|█████▋    | 284/500 [01:20<00:45,  4.70it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  57%|█████▋    | 284/500 [01:20<00:45,  4.70it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  57%|█████▋    | 285/500 [01:20<00:49,  4.35it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  57%|█████▋    | 285/500 [01:20<00:49,  4.35it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  57%|█████▋    | 286/500 [01:20<00:47,  4.53it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  57%|█████▋    | 286/500 [01:20<00:47,  4.53it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  57%|█████▋    | 287/500 [01:20<00:47,  4.48it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  57%|█████▋    | 287/500 [01:20<00:47,  4.48it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  58%|█████▊    | 288/500 [01:20<00:44,  4.80it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  58%|█████▊    | 288/500 [01:21<00:44,  4.80it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  58%|█████▊    | 289/500 [01:21<00:39,  5.39it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  58%|█████▊    | 289/500 [01:21<00:39,  5.39it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  58%|█████▊    | 290/500 [01:21<00:44,  4.77it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  58%|█████▊    | 290/500 [01:21<00:44,  4.77it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  58%|█████▊    | 291/500 [01:21<00:43,  4.82it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  58%|█████▊    | 291/500 [01:21<00:43,  4.82it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  58%|█████▊    | 292/500 [01:21<00:42,  4.87it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  58%|█████▊    | 292/500 [01:22<00:42,  4.87it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  59%|█████▊    | 293/500 [01:22<00:48,  4.29it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  59%|█████▊    | 293/500 [01:22<00:48,  4.29it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  59%|█████▉    | 294/500 [01:22<00:47,  4.30it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  59%|█████▉    | 294/500 [01:22<00:47,  4.30it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  59%|█████▉    | 295/500 [01:22<00:49,  4.11it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  59%|█████▉    | 295/500 [01:22<00:49,  4.11it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  59%|█████▉    | 296/500 [01:22<00:48,  4.18it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  59%|█████▉    | 296/500 [01:23<00:48,  4.18it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  59%|█████▉    | 297/500 [01:23<00:52,  3.89it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  59%|█████▉    | 297/500 [01:23<00:52,  3.89it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.48e-01 :  60%|█████▉    | 298/500 [01:23<00:48,  4.16it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.48e-01 :  60%|█████▉    | 298/500 [01:23<00:48,  4.16it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.48e-01 :  60%|█████▉    | 299/500 [01:23<00:50,  4.01it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.48e-01 :  60%|█████▉    | 299/500 [01:23<00:50,  4.01it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.48e-01 :  60%|██████    | 300/500 [01:23<00:46,  4.26it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  60%|██████    | 300/500 [01:23<00:46,  4.26it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  60%|██████    | 301/500 [01:23<00:44,  4.46it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  60%|██████    | 301/500 [01:24<00:44,  4.46it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  60%|██████    | 302/500 [01:24<00:44,  4.43it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  60%|██████    | 302/500 [01:24<00:44,  4.43it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  61%|██████    | 303/500 [01:24<00:43,  4.57it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  61%|██████    | 303/500 [01:24<00:43,  4.57it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  61%|██████    | 304/500 [01:24<00:45,  4.29it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  61%|██████    | 304/500 [01:24<00:45,  4.29it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  61%|██████    | 305/500 [01:24<00:43,  4.46it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  61%|██████    | 305/500 [01:25<00:43,  4.46it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  61%|██████    | 306/500 [01:25<00:43,  4.44it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  61%|██████    | 306/500 [01:25<00:43,  4.44it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  61%|██████▏   | 307/500 [01:25<00:46,  4.19it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  61%|██████▏   | 307/500 [01:25<00:46,  4.19it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  62%|██████▏   | 308/500 [01:25<00:45,  4.24it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  62%|██████▏   | 308/500 [01:25<00:45,  4.24it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  62%|██████▏   | 309/500 [01:25<00:43,  4.44it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  62%|██████▏   | 309/500 [01:25<00:43,  4.44it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  62%|██████▏   | 310/500 [01:26<00:45,  4.19it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  62%|██████▏   | 310/500 [01:26<00:45,  4.19it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  62%|██████▏   | 311/500 [01:26<00:44,  4.24it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  62%|██████▏   | 311/500 [01:26<00:44,  4.24it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  62%|██████▏   | 312/500 [01:26<00:42,  4.44it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  62%|██████▏   | 312/500 [01:26<00:42,  4.44it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  63%|██████▎   | 313/500 [01:26<00:40,  4.58it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  63%|██████▎   | 313/500 [01:26<00:40,  4.58it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  63%|██████▎   | 314/500 [01:26<00:39,  4.70it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  63%|██████▎   | 314/500 [01:27<00:39,  4.70it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  63%|██████▎   | 315/500 [01:27<00:38,  4.79it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  63%|██████▎   | 315/500 [01:27<00:38,  4.79it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  63%|██████▎   | 316/500 [01:27<00:37,  4.85it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  63%|██████▎   | 316/500 [01:27<00:37,  4.85it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  63%|██████▎   | 317/500 [01:27<00:42,  4.28it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  63%|██████▎   | 317/500 [01:27<00:42,  4.28it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  64%|██████▎   | 318/500 [01:27<00:46,  3.94it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  64%|██████▎   | 318/500 [01:28<00:46,  3.94it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 7.45e-03 :  64%|██████▍   | 319/500 [01:28<00:46,  3.86it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  64%|██████▍   | 319/500 [01:28<00:46,  3.86it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  64%|██████▍   | 320/500 [01:28<00:43,  4.14it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  64%|██████▍   | 320/500 [01:28<00:43,  4.14it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  64%|██████▍   | 321/500 [01:28<00:40,  4.37it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  64%|██████▍   | 321/500 [01:28<00:40,  4.37it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  64%|██████▍   | 322/500 [01:28<00:39,  4.54it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  64%|██████▍   | 322/500 [01:28<00:39,  4.54it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  65%|██████▍   | 323/500 [01:28<00:39,  4.48it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  65%|██████▍   | 323/500 [01:29<00:39,  4.48it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  65%|██████▍   | 324/500 [01:29<00:38,  4.61it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  65%|██████▍   | 324/500 [01:29<00:38,  4.61it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  65%|██████▌   | 325/500 [01:29<00:40,  4.31it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  65%|██████▌   | 325/500 [01:29<00:40,  4.31it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  65%|██████▌   | 326/500 [01:29<00:40,  4.32it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  65%|██████▌   | 326/500 [01:29<00:40,  4.32it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  65%|██████▌   | 327/500 [01:29<00:42,  4.11it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  65%|██████▌   | 327/500 [01:30<00:42,  4.11it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  66%|██████▌   | 328/500 [01:30<00:41,  4.18it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  66%|██████▌   | 328/500 [01:30<00:41,  4.18it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  66%|██████▌   | 329/500 [01:30<00:42,  4.03it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  66%|██████▌   | 329/500 [01:30<00:42,  4.03it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  66%|██████▌   | 330/500 [01:30<00:44,  3.79it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  66%|██████▌   | 330/500 [01:30<00:44,  3.79it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  66%|██████▌   | 331/500 [01:30<00:42,  3.94it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  66%|██████▌   | 331/500 [01:31<00:42,  3.94it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  66%|██████▋   | 332/500 [01:31<00:44,  3.74it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  66%|██████▋   | 332/500 [01:31<00:44,  3.74it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  67%|██████▋   | 333/500 [01:31<00:41,  4.03it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  67%|██████▋   | 333/500 [01:31<00:41,  4.03it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  67%|██████▋   | 334/500 [01:31<00:43,  3.81it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  67%|██████▋   | 334/500 [01:31<00:43,  3.81it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  67%|██████▋   | 335/500 [01:31<00:40,  4.10it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  67%|██████▋   | 335/500 [01:32<00:40,  4.10it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  67%|██████▋   | 336/500 [01:32<00:37,  4.32it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  67%|██████▋   | 336/500 [01:32<00:37,  4.32it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  67%|██████▋   | 337/500 [01:32<00:36,  4.49it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  67%|██████▋   | 337/500 [01:32<00:36,  4.49it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  68%|██████▊   | 338/500 [01:32<00:39,  4.08it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  68%|██████▊   | 338/500 [01:32<00:39,  4.08it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  68%|██████▊   | 339/500 [01:32<00:42,  3.83it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  68%|██████▊   | 339/500 [01:33<00:42,  3.83it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  68%|██████▊   | 340/500 [01:33<00:43,  3.67it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  68%|██████▊   | 340/500 [01:33<00:43,  3.67it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  68%|██████▊   | 341/500 [01:33<00:39,  3.98it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  68%|██████▊   | 341/500 [01:33<00:39,  3.98it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  68%|██████▊   | 342/500 [01:33<00:40,  3.89it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  68%|██████▊   | 342/500 [01:33<00:40,  3.89it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  69%|██████▊   | 343/500 [01:33<00:37,  4.17it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  69%|██████▊   | 343/500 [01:34<00:37,  4.17it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  69%|██████▉   | 344/500 [01:34<00:36,  4.23it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  69%|██████▉   | 344/500 [01:34<00:36,  4.23it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  69%|██████▉   | 345/500 [01:34<00:34,  4.43it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  69%|██████▉   | 345/500 [01:34<00:34,  4.43it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  69%|██████▉   | 346/500 [01:34<00:33,  4.59it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  69%|██████▉   | 346/500 [01:34<00:33,  4.59it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  69%|██████▉   | 347/500 [01:34<00:32,  4.70it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  69%|██████▉   | 347/500 [01:34<00:32,  4.70it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  70%|██████▉   | 348/500 [01:34<00:31,  4.77it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  70%|██████▉   | 348/500 [01:35<00:31,  4.77it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  70%|██████▉   | 349/500 [01:35<00:31,  4.84it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  70%|██████▉   | 349/500 [01:35<00:31,  4.84it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 7.45e-03 :  70%|███████   | 350/500 [01:35<00:35,  4.26it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  70%|███████   | 350/500 [01:35<00:35,  4.26it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  70%|███████   | 351/500 [01:35<00:37,  3.94it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  70%|███████   | 351/500 [01:35<00:37,  3.94it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  70%|███████   | 352/500 [01:35<00:35,  4.20it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  70%|███████   | 352/500 [01:36<00:35,  4.20it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  71%|███████   | 353/500 [01:36<00:37,  3.90it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  71%|███████   | 353/500 [01:36<00:37,  3.90it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  71%|███████   | 354/500 [01:36<00:34,  4.17it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  71%|███████   | 354/500 [01:36<00:34,  4.17it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  71%|███████   | 355/500 [01:36<00:37,  3.89it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  71%|███████   | 355/500 [01:36<00:37,  3.89it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  71%|███████   | 356/500 [01:36<00:34,  4.16it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  71%|███████   | 356/500 [01:37<00:34,  4.16it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  71%|███████▏  | 357/500 [01:37<00:32,  4.38it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  71%|███████▏  | 357/500 [01:37<00:32,  4.38it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  72%|███████▏  | 358/500 [01:37<00:31,  4.54it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  72%|███████▏  | 358/500 [01:37<00:31,  4.54it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  72%|███████▏  | 359/500 [01:37<00:33,  4.26it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  72%|███████▏  | 359/500 [01:37<00:33,  4.26it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  72%|███████▏  | 360/500 [01:37<00:32,  4.30it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  72%|███████▏  | 360/500 [01:38<00:32,  4.30it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  72%|███████▏  | 361/500 [01:38<00:29,  4.66it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  72%|███████▏  | 361/500 [01:38<00:29,  4.66it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  72%|███████▏  | 362/500 [01:38<00:30,  4.56it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  72%|███████▏  | 362/500 [01:38<00:30,  4.56it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  73%|███████▎  | 363/500 [01:38<00:29,  4.68it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  73%|███████▎  | 363/500 [01:38<00:29,  4.68it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  73%|███████▎  | 364/500 [01:38<00:28,  4.77it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  73%|███████▎  | 364/500 [01:38<00:28,  4.77it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  73%|███████▎  | 365/500 [01:38<00:30,  4.40it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  73%|███████▎  | 365/500 [01:39<00:30,  4.40it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  73%|███████▎  | 366/500 [01:39<00:29,  4.56it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  73%|███████▎  | 366/500 [01:39<00:29,  4.56it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  73%|███████▎  | 367/500 [01:39<00:29,  4.51it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  73%|███████▎  | 367/500 [01:39<00:29,  4.51it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  74%|███████▎  | 368/500 [01:39<00:28,  4.64it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  74%|███████▎  | 368/500 [01:39<00:28,  4.64it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  74%|███████▍  | 369/500 [01:39<00:27,  4.73it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  74%|███████▍  | 369/500 [01:39<00:27,  4.73it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  74%|███████▍  | 370/500 [01:39<00:27,  4.80it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  74%|███████▍  | 370/500 [01:40<00:27,  4.80it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  74%|███████▍  | 371/500 [01:40<00:26,  4.86it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  74%|███████▍  | 371/500 [01:40<00:26,  4.86it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  74%|███████▍  | 372/500 [01:40<00:28,  4.45it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  74%|███████▍  | 372/500 [01:40<00:28,  4.45it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  75%|███████▍  | 373/500 [01:40<00:25,  5.07it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  75%|███████▍  | 373/500 [01:40<00:25,  5.07it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  75%|███████▍  | 374/500 [01:40<00:27,  4.58it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  75%|███████▍  | 374/500 [01:40<00:27,  4.58it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  75%|███████▌  | 375/500 [01:40<00:24,  5.19it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  75%|███████▌  | 375/500 [01:41<00:24,  5.19it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  75%|███████▌  | 376/500 [01:41<00:24,  5.14it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  75%|███████▌  | 376/500 [01:41<00:24,  5.14it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  75%|███████▌  | 377/500 [01:41<00:26,  4.62it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  75%|███████▌  | 377/500 [01:41<00:26,  4.62it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  76%|███████▌  | 378/500 [01:41<00:26,  4.53it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  76%|███████▌  | 378/500 [01:41<00:26,  4.53it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  76%|███████▌  | 379/500 [01:41<00:29,  4.09it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  76%|███████▌  | 379/500 [01:42<00:29,  4.09it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  76%|███████▌  | 380/500 [01:42<00:30,  3.97it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  76%|███████▌  | 380/500 [01:42<00:30,  3.97it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  76%|███████▌  | 381/500 [01:42<00:28,  4.22it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  76%|███████▌  | 381/500 [01:42<00:28,  4.22it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  76%|███████▋  | 382/500 [01:42<00:26,  4.44it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  76%|███████▋  | 382/500 [01:42<00:26,  4.44it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  77%|███████▋  | 383/500 [01:42<00:25,  4.60it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  77%|███████▋  | 383/500 [01:43<00:25,  4.60it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  77%|███████▋  | 384/500 [01:43<00:24,  4.71it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  77%|███████▋  | 384/500 [01:43<00:24,  4.71it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  77%|███████▋  | 385/500 [01:43<00:25,  4.59it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  77%|███████▋  | 385/500 [01:43<00:25,  4.59it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  77%|███████▋  | 386/500 [01:43<00:24,  4.70it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  77%|███████▋  | 386/500 [01:43<00:24,  4.70it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  77%|███████▋  | 387/500 [01:43<00:23,  4.79it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  77%|███████▋  | 387/500 [01:43<00:23,  4.79it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  78%|███████▊  | 388/500 [01:43<00:23,  4.84it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  78%|███████▊  | 388/500 [01:44<00:23,  4.84it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  78%|███████▊  | 389/500 [01:44<00:22,  4.88it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  78%|███████▊  | 389/500 [01:44<00:22,  4.88it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  78%|███████▊  | 390/500 [01:44<00:24,  4.47it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  78%|███████▊  | 390/500 [01:44<00:24,  4.47it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  78%|███████▊  | 391/500 [01:44<00:21,  5.08it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  78%|███████▊  | 391/500 [01:44<00:21,  5.08it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  78%|███████▊  | 392/500 [01:44<00:24,  4.41it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  78%|███████▊  | 392/500 [01:45<00:24,  4.41it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  79%|███████▊  | 393/500 [01:45<00:25,  4.18it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  79%|███████▊  | 393/500 [01:45<00:25,  4.18it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  79%|███████▉  | 394/500 [01:45<00:25,  4.22it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  79%|███████▉  | 394/500 [01:45<00:25,  4.22it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  79%|███████▉  | 395/500 [01:45<00:26,  3.91it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  79%|███████▉  | 395/500 [01:45<00:26,  3.91it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  79%|███████▉  | 396/500 [01:45<00:24,  4.19it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  79%|███████▉  | 396/500 [01:45<00:24,  4.19it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  79%|███████▉  | 397/500 [01:45<00:23,  4.39it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  79%|███████▉  | 397/500 [01:46<00:23,  4.39it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  80%|███████▉  | 398/500 [01:46<00:24,  4.17it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  80%|███████▉  | 398/500 [01:46<00:24,  4.17it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  80%|███████▉  | 399/500 [01:46<00:23,  4.23it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  80%|███████▉  | 399/500 [01:46<00:23,  4.23it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 4.99e-03 :  80%|████████  | 400/500 [01:46<00:22,  4.44it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 3.89e-03 :  80%|████████  | 400/500 [01:46<00:22,  4.44it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 3.89e-03 :  80%|████████  | 401/500 [01:46<00:20,  4.77it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 3.89e-03 :  80%|████████  | 401/500 [01:47<00:20,  4.77it/s]
[TorchDR] DR Loss : 1.21e+01 | Grad norm : 3.89e-03 :  80%|████████  | 402/500 [01:47<00:21,  4.63it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  80%|████████  | 402/500 [01:47<00:21,  4.63it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  81%|████████  | 403/500 [01:47<00:23,  4.13it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  81%|████████  | 403/500 [01:47<00:23,  4.13it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  81%|████████  | 404/500 [01:47<00:21,  4.37it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  81%|████████  | 404/500 [01:47<00:21,  4.37it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  81%|████████  | 405/500 [01:47<00:23,  4.02it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  81%|████████  | 405/500 [01:48<00:23,  4.02it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  81%|████████  | 406/500 [01:48<00:22,  4.25it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  81%|████████  | 406/500 [01:48<00:22,  4.25it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  81%|████████▏ | 407/500 [01:48<00:20,  4.43it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  81%|████████▏ | 407/500 [01:48<00:20,  4.43it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  82%|████████▏ | 408/500 [01:48<00:22,  4.06it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  82%|████████▏ | 408/500 [01:48<00:22,  4.06it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  82%|████████▏ | 409/500 [01:48<00:23,  3.81it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  82%|████████▏ | 409/500 [01:49<00:23,  3.81it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  82%|████████▏ | 410/500 [01:49<00:35,  2.54it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  82%|████████▏ | 410/500 [01:50<00:35,  2.54it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  82%|████████▏ | 411/500 [01:50<00:40,  2.19it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  82%|████████▏ | 411/500 [01:50<00:40,  2.19it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  82%|████████▏ | 412/500 [01:50<00:46,  1.89it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  82%|████████▏ | 412/500 [01:51<00:46,  1.89it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  83%|████████▎ | 413/500 [01:51<00:45,  1.92it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  83%|████████▎ | 413/500 [01:52<00:45,  1.92it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  83%|████████▎ | 414/500 [01:52<00:52,  1.65it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  83%|████████▎ | 414/500 [01:52<00:52,  1.65it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  83%|████████▎ | 415/500 [01:52<00:53,  1.58it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  83%|████████▎ | 415/500 [01:53<00:53,  1.58it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  83%|████████▎ | 416/500 [01:53<00:54,  1.53it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  83%|████████▎ | 416/500 [01:54<00:54,  1.53it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  83%|████████▎ | 417/500 [01:54<00:52,  1.57it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  83%|████████▎ | 417/500 [01:54<00:52,  1.57it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  84%|████████▎ | 418/500 [01:54<00:48,  1.71it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  84%|████████▎ | 418/500 [01:55<00:48,  1.71it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  84%|████████▍ | 419/500 [01:55<00:50,  1.61it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  84%|████████▍ | 419/500 [01:55<00:50,  1.61it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  84%|████████▍ | 420/500 [01:55<00:49,  1.60it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  84%|████████▍ | 420/500 [01:56<00:49,  1.60it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  84%|████████▍ | 421/500 [01:56<00:50,  1.57it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  84%|████████▍ | 421/500 [01:57<00:50,  1.57it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  84%|████████▍ | 422/500 [01:57<00:51,  1.52it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  84%|████████▍ | 422/500 [01:58<00:51,  1.52it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  85%|████████▍ | 423/500 [01:58<00:54,  1.41it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  85%|████████▍ | 423/500 [01:59<00:54,  1.41it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  85%|████████▍ | 424/500 [01:59<01:02,  1.21it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  85%|████████▍ | 424/500 [01:59<01:02,  1.21it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  85%|████████▌ | 425/500 [01:59<00:56,  1.34it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  85%|████████▌ | 425/500 [02:00<00:56,  1.34it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  85%|████████▌ | 426/500 [02:00<00:52,  1.40it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  85%|████████▌ | 426/500 [02:01<00:52,  1.40it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  85%|████████▌ | 427/500 [02:01<00:51,  1.41it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  85%|████████▌ | 427/500 [02:01<00:51,  1.41it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  86%|████████▌ | 428/500 [02:01<00:52,  1.37it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  86%|████████▌ | 428/500 [02:02<00:52,  1.37it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  86%|████████▌ | 429/500 [02:02<00:53,  1.34it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  86%|████████▌ | 429/500 [02:03<00:53,  1.34it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  86%|████████▌ | 430/500 [02:03<00:52,  1.34it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  86%|████████▌ | 430/500 [02:04<00:52,  1.34it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  86%|████████▌ | 431/500 [02:04<00:50,  1.37it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  86%|████████▌ | 431/500 [02:04<00:50,  1.37it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  86%|████████▋ | 432/500 [02:04<00:49,  1.39it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  86%|████████▋ | 432/500 [02:05<00:49,  1.39it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  87%|████████▋ | 433/500 [02:05<00:43,  1.52it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  87%|████████▋ | 433/500 [02:05<00:43,  1.52it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  87%|████████▋ | 434/500 [02:05<00:40,  1.64it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  87%|████████▋ | 434/500 [02:06<00:40,  1.64it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  87%|████████▋ | 435/500 [02:06<00:41,  1.57it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  87%|████████▋ | 435/500 [02:06<00:41,  1.57it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  87%|████████▋ | 436/500 [02:06<00:36,  1.77it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  87%|████████▋ | 436/500 [02:07<00:36,  1.77it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  87%|████████▋ | 437/500 [02:07<00:34,  1.84it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  87%|████████▋ | 437/500 [02:07<00:34,  1.84it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  88%|████████▊ | 438/500 [02:07<00:31,  1.99it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  88%|████████▊ | 438/500 [02:08<00:31,  1.99it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  88%|████████▊ | 439/500 [02:08<00:28,  2.16it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  88%|████████▊ | 439/500 [02:08<00:28,  2.16it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  88%|████████▊ | 440/500 [02:08<00:32,  1.84it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  88%|████████▊ | 440/500 [02:09<00:32,  1.84it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  88%|████████▊ | 441/500 [02:09<00:36,  1.61it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  88%|████████▊ | 441/500 [02:10<00:36,  1.61it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  88%|████████▊ | 442/500 [02:10<00:36,  1.58it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  88%|████████▊ | 442/500 [02:11<00:36,  1.58it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  89%|████████▊ | 443/500 [02:11<00:35,  1.60it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  89%|████████▊ | 443/500 [02:11<00:35,  1.60it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  89%|████████▉ | 444/500 [02:11<00:40,  1.40it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  89%|████████▉ | 444/500 [02:12<00:40,  1.40it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  89%|████████▉ | 445/500 [02:12<00:35,  1.54it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  89%|████████▉ | 445/500 [02:12<00:35,  1.54it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  89%|████████▉ | 446/500 [02:12<00:31,  1.73it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  89%|████████▉ | 446/500 [02:13<00:31,  1.73it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  89%|████████▉ | 447/500 [02:13<00:32,  1.66it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  89%|████████▉ | 447/500 [02:14<00:32,  1.66it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  90%|████████▉ | 448/500 [02:14<00:33,  1.56it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  90%|████████▉ | 448/500 [02:14<00:33,  1.56it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  90%|████████▉ | 449/500 [02:14<00:32,  1.59it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  90%|████████▉ | 449/500 [02:15<00:32,  1.59it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.89e-03 :  90%|█████████ | 450/500 [02:15<00:31,  1.61it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  90%|█████████ | 450/500 [02:16<00:31,  1.61it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  90%|█████████ | 451/500 [02:16<00:29,  1.66it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  90%|█████████ | 451/500 [02:16<00:29,  1.66it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  90%|█████████ | 452/500 [02:16<00:33,  1.44it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  90%|█████████ | 452/500 [02:17<00:33,  1.44it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  91%|█████████ | 453/500 [02:17<00:31,  1.50it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  91%|█████████ | 453/500 [02:18<00:31,  1.50it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  91%|█████████ | 454/500 [02:18<00:32,  1.40it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  91%|█████████ | 454/500 [02:19<00:32,  1.40it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  91%|█████████ | 455/500 [02:19<00:33,  1.35it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  91%|█████████ | 455/500 [02:19<00:33,  1.35it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  91%|█████████ | 456/500 [02:19<00:30,  1.43it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  91%|█████████ | 456/500 [02:20<00:30,  1.43it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  91%|█████████▏| 457/500 [02:20<00:30,  1.39it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  91%|█████████▏| 457/500 [02:21<00:30,  1.39it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  92%|█████████▏| 458/500 [02:21<00:34,  1.23it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  92%|█████████▏| 458/500 [02:22<00:34,  1.23it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  92%|█████████▏| 459/500 [02:22<00:30,  1.35it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  92%|█████████▏| 459/500 [02:22<00:30,  1.35it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  92%|█████████▏| 460/500 [02:22<00:30,  1.31it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  92%|█████████▏| 460/500 [02:24<00:30,  1.31it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  92%|█████████▏| 461/500 [02:24<00:36,  1.08it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  92%|█████████▏| 461/500 [02:25<00:36,  1.08it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  92%|█████████▏| 462/500 [02:25<00:34,  1.10it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  92%|█████████▏| 462/500 [02:25<00:34,  1.10it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  93%|█████████▎| 463/500 [02:25<00:30,  1.21it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  93%|█████████▎| 463/500 [02:26<00:30,  1.21it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  93%|█████████▎| 464/500 [02:26<00:27,  1.32it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  93%|█████████▎| 464/500 [02:27<00:27,  1.32it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  93%|█████████▎| 465/500 [02:27<00:27,  1.26it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  93%|█████████▎| 465/500 [02:27<00:27,  1.26it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  93%|█████████▎| 466/500 [02:27<00:24,  1.40it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  93%|█████████▎| 466/500 [02:28<00:24,  1.40it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  93%|█████████▎| 467/500 [02:28<00:22,  1.47it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  93%|█████████▎| 467/500 [02:28<00:22,  1.47it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  94%|█████████▎| 468/500 [02:28<00:18,  1.77it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  94%|█████████▎| 468/500 [02:28<00:18,  1.77it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  94%|█████████▍| 469/500 [02:28<00:15,  2.06it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  94%|█████████▍| 469/500 [02:29<00:15,  2.06it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  94%|█████████▍| 470/500 [02:29<00:12,  2.50it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  94%|█████████▍| 470/500 [02:29<00:12,  2.50it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  94%|█████████▍| 471/500 [02:29<00:10,  2.70it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  94%|█████████▍| 471/500 [02:29<00:10,  2.70it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  94%|█████████▍| 472/500 [02:29<00:09,  2.86it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  94%|█████████▍| 472/500 [02:30<00:09,  2.86it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  95%|█████████▍| 473/500 [02:30<00:09,  3.00it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  95%|█████████▍| 473/500 [02:30<00:09,  3.00it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  95%|█████████▍| 474/500 [02:30<00:07,  3.40it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  95%|█████████▍| 474/500 [02:30<00:07,  3.40it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  95%|█████████▌| 475/500 [02:30<00:07,  3.39it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  95%|█████████▌| 475/500 [02:30<00:07,  3.39it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  95%|█████████▌| 476/500 [02:30<00:06,  3.74it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  95%|█████████▌| 476/500 [02:30<00:06,  3.74it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  95%|█████████▌| 477/500 [02:30<00:05,  4.04it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  95%|█████████▌| 477/500 [02:31<00:05,  4.04it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  96%|█████████▌| 478/500 [02:31<00:05,  3.95it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  96%|█████████▌| 478/500 [02:31<00:05,  3.95it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  96%|█████████▌| 479/500 [02:31<00:05,  4.06it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  96%|█████████▌| 479/500 [02:31<00:05,  4.06it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  96%|█████████▌| 480/500 [02:31<00:04,  4.46it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  96%|█████████▌| 480/500 [02:31<00:04,  4.46it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  96%|█████████▌| 481/500 [02:31<00:04,  4.62it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  96%|█████████▌| 481/500 [02:32<00:04,  4.62it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  96%|█████████▋| 482/500 [02:32<00:04,  3.98it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  96%|█████████▋| 482/500 [02:32<00:04,  3.98it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  97%|█████████▋| 483/500 [02:32<00:04,  3.90it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  97%|█████████▋| 483/500 [02:32<00:04,  3.90it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  97%|█████████▋| 484/500 [02:32<00:03,  4.02it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  97%|█████████▋| 484/500 [02:32<00:03,  4.02it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  97%|█████████▋| 485/500 [02:32<00:03,  3.78it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  97%|█████████▋| 485/500 [02:33<00:03,  3.78it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  97%|█████████▋| 486/500 [02:33<00:03,  3.76it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  97%|█████████▋| 486/500 [02:33<00:03,  3.76it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  97%|█████████▋| 487/500 [02:33<00:03,  3.91it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  97%|█████████▋| 487/500 [02:33<00:03,  3.91it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  98%|█████████▊| 488/500 [02:33<00:03,  3.35it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  98%|█████████▊| 488/500 [02:34<00:03,  3.35it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  98%|█████████▊| 489/500 [02:34<00:03,  3.34it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  98%|█████████▊| 489/500 [02:34<00:03,  3.34it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  98%|█████████▊| 490/500 [02:34<00:02,  3.44it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  98%|█████████▊| 490/500 [02:34<00:02,  3.44it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  98%|█████████▊| 491/500 [02:34<00:02,  3.31it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  98%|█████████▊| 491/500 [02:35<00:02,  3.31it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  98%|█████████▊| 492/500 [02:35<00:02,  3.32it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  98%|█████████▊| 492/500 [02:35<00:02,  3.32it/s]
[TorchDR] DR Loss : 1.20e+01 | Grad norm : 3.23e-03 :  99%|█████████▊| 493/500 [02:35<00:01,  3.69it/s]
[TorchDR] DR Loss : 1.19e+01 | Grad norm : 3.23e-03 :  99%|█████████▊| 493/500 [02:35<00:01,  3.69it/s]
[TorchDR] DR Loss : 1.19e+01 | Grad norm : 3.23e-03 :  99%|█████████▉| 494/500 [02:35<00:01,  3.57it/s]
[TorchDR] DR Loss : 1.19e+01 | Grad norm : 3.23e-03 :  99%|█████████▉| 494/500 [02:35<00:01,  3.57it/s]
[TorchDR] DR Loss : 1.19e+01 | Grad norm : 3.23e-03 :  99%|█████████▉| 495/500 [02:35<00:01,  3.50it/s]
[TorchDR] DR Loss : 1.19e+01 | Grad norm : 3.23e-03 :  99%|█████████▉| 495/500 [02:36<00:01,  3.50it/s]
[TorchDR] DR Loss : 1.19e+01 | Grad norm : 3.23e-03 :  99%|█████████▉| 496/500 [02:36<00:01,  3.46it/s]
[TorchDR] DR Loss : 1.19e+01 | Grad norm : 3.23e-03 :  99%|█████████▉| 496/500 [02:36<00:01,  3.46it/s]
[TorchDR] DR Loss : 1.19e+01 | Grad norm : 3.23e-03 :  99%|█████████▉| 497/500 [02:36<00:00,  3.79it/s]
[TorchDR] DR Loss : 1.19e+01 | Grad norm : 3.23e-03 :  99%|█████████▉| 497/500 [02:36<00:00,  3.79it/s]
[TorchDR] DR Loss : 1.19e+01 | Grad norm : 3.23e-03 : 100%|█████████▉| 498/500 [02:36<00:00,  3.77it/s]
[TorchDR] DR Loss : 1.19e+01 | Grad norm : 3.23e-03 : 100%|█████████▉| 498/500 [02:36<00:00,  3.77it/s]
[TorchDR] DR Loss : 1.19e+01 | Grad norm : 3.23e-03 : 100%|█████████▉| 499/500 [02:36<00:00,  4.06it/s]
[TorchDR] DR Loss : 1.19e+01 | Grad norm : 3.23e-03 : 100%|█████████▉| 499/500 [02:37<00:00,  4.06it/s]
[TorchDR] DR Loss : 1.19e+01 | Grad norm : 3.23e-03 : 100%|██████████| 500/500 [02:37<00:00,  4.32it/s]
[TorchDR] DR Loss : 1.19e+01 | Grad norm : 3.23e-03 : 100%|██████████| 500/500 [02:37<00:00,  3.18it/s]
Random state is None
[TorchDR] Initializing DR model COSNE.
[TorchDR] Affinity : computing the Entropic Affinity matrix.
[TorchDR] Affinity : sparsity mode enabled, computing 90 nearest neighbors. If this step is too slow, consider reducing the dimensionality of the data or disabling sparsity.

  0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  3.40e-01 (std =  8.51e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  2.16e-01 (std =  7.86e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  1.31e-01 (std =  6.42e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  7.78e-02 (std =  4.84e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  4.59e-02 (std =  3.47e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  2.72e-02 (std =  2.42e-02) :   0%|          | 0/100 [00:00<?, ?it/s]
[TorchDR] Root search : mean abs value =  2.72e-02 (std =  2.42e-02) :   6%|▌         | 6/100 [00:00<00:02, 36.22it/s]
[TorchDR] Root search : mean abs value =  1.62e-02 (std =  1.66e-02) :   6%|▌         | 6/100 [00:00<00:02, 36.22it/s]
[TorchDR] Root search : mean abs value =  9.73e-03 (std =  1.13e-02) :   6%|▌         | 6/100 [00:00<00:02, 36.22it/s]
[TorchDR] Root search : mean abs value =  5.90e-03 (std =  7.69e-03) :   6%|▌         | 6/100 [00:00<00:02, 36.22it/s]
[TorchDR] Root search : mean abs value =  3.60e-03 (std =  5.23e-03) :   6%|▌         | 6/100 [00:00<00:02, 36.22it/s]
[TorchDR] Root search : mean abs value =  3.60e-03 (std =  5.23e-03) :  10%|█         | 10/100 [00:00<00:02, 33.54it/s]
[TorchDR] Root search : mean abs value =  2.22e-03 (std =  3.56e-03) :  10%|█         | 10/100 [00:00<00:02, 33.54it/s]
[TorchDR] Root search : mean abs value =  1.38e-03 (std =  2.43e-03) :  10%|█         | 10/100 [00:00<00:02, 33.54it/s]
[TorchDR] Root search : mean abs value =  8.67e-04 (std =  1.66e-03) :  10%|█         | 10/100 [00:00<00:02, 33.54it/s]
[TorchDR] Root search : mean abs value =  5.47e-04 (std =  1.14e-03) :  10%|█         | 10/100 [00:00<00:02, 33.54it/s]
[TorchDR] Root search : mean abs value =  3.48e-04 (std =  7.87e-04) :  10%|█         | 10/100 [00:00<00:02, 33.54it/s]
[TorchDR] Root search : mean abs value =  2.23e-04 (std =  5.45e-04) :  10%|█         | 10/100 [00:00<00:02, 33.54it/s]
[TorchDR] Root search : mean abs value =  2.23e-04 (std =  5.45e-04) :  16%|█▌        | 16/100 [00:00<00:01, 43.13it/s]
[TorchDR] Root search : mean abs value =  1.44e-04 (std =  3.79e-04) :  16%|█▌        | 16/100 [00:00<00:01, 43.13it/s]
[TorchDR] Root search : mean abs value =  9.31e-05 (std =  2.65e-04) :  16%|█▌        | 16/100 [00:00<00:01, 43.13it/s]
[TorchDR] Root search : mean abs value =  6.07e-05 (std =  1.86e-04) :  16%|█▌        | 16/100 [00:00<00:01, 43.13it/s]
[TorchDR] Root search : mean abs value =  3.98e-05 (std =  1.31e-04) :  16%|█▌        | 16/100 [00:00<00:01, 43.13it/s]
[TorchDR] Root search : mean abs value =  2.62e-05 (std =  9.25e-05) :  16%|█▌        | 16/100 [00:00<00:01, 43.13it/s]
[TorchDR] Root search : mean abs value =  2.62e-05 (std =  9.25e-05) :  21%|██        | 21/100 [00:00<00:01, 45.49it/s]
[TorchDR] Root search : mean abs value =  1.73e-05 (std =  6.56e-05) :  21%|██        | 21/100 [00:00<00:01, 45.49it/s]
[TorchDR] Root search : mean abs value =  1.15e-05 (std =  4.67e-05) :  21%|██        | 21/100 [00:00<00:01, 45.49it/s]
[TorchDR] Root search : mean abs value =  1.15e-05 (std =  4.67e-05) :  23%|██▎       | 23/100 [00:00<00:01, 46.24it/s]

  0%|          | 0/500 [00:00<?, ?it/s]
[TorchDR] DR Loss : 3.45e+01 | Grad norm : 2.15e+00 :   0%|          | 0/500 [00:00<?, ?it/s]
[TorchDR] DR Loss : 3.45e+01 | Grad norm : 2.15e+00 :   0%|          | 1/500 [00:00<04:22,  1.90it/s]
[TorchDR] DR Loss : 3.29e+01 | Grad norm : 2.15e+00 :   0%|          | 1/500 [00:01<04:22,  1.90it/s]
[TorchDR] DR Loss : 3.29e+01 | Grad norm : 2.15e+00 :   0%|          | 2/500 [00:01<05:13,  1.59it/s]
[TorchDR] DR Loss : 3.12e+01 | Grad norm : 2.15e+00 :   0%|          | 2/500 [00:01<05:13,  1.59it/s]
[TorchDR] DR Loss : 3.12e+01 | Grad norm : 2.15e+00 :   1%|          | 3/500 [00:01<04:37,  1.79it/s]
[TorchDR] DR Loss : 2.96e+01 | Grad norm : 2.15e+00 :   1%|          | 3/500 [00:02<04:37,  1.79it/s]
[TorchDR] DR Loss : 2.96e+01 | Grad norm : 2.15e+00 :   1%|          | 4/500 [00:02<05:10,  1.60it/s]
[TorchDR] DR Loss : 2.79e+01 | Grad norm : 2.15e+00 :   1%|          | 4/500 [00:03<05:10,  1.60it/s]
[TorchDR] DR Loss : 2.79e+01 | Grad norm : 2.15e+00 :   1%|          | 5/500 [00:03<07:09,  1.15it/s]
[TorchDR] DR Loss : 2.62e+01 | Grad norm : 2.15e+00 :   1%|          | 5/500 [00:04<07:09,  1.15it/s]
[TorchDR] DR Loss : 2.62e+01 | Grad norm : 2.15e+00 :   1%|          | 6/500 [00:04<07:26,  1.11it/s]
[TorchDR] DR Loss : 2.46e+01 | Grad norm : 2.15e+00 :   1%|          | 6/500 [00:05<07:26,  1.11it/s]
[TorchDR] DR Loss : 2.46e+01 | Grad norm : 2.15e+00 :   1%|▏         | 7/500 [00:05<06:52,  1.20it/s]
[TorchDR] DR Loss : 2.29e+01 | Grad norm : 2.15e+00 :   1%|▏         | 7/500 [00:06<06:52,  1.20it/s]
[TorchDR] DR Loss : 2.29e+01 | Grad norm : 2.15e+00 :   2%|▏         | 8/500 [00:06<07:01,  1.17it/s]
[TorchDR] DR Loss : 2.13e+01 | Grad norm : 2.15e+00 :   2%|▏         | 8/500 [00:07<07:01,  1.17it/s]
[TorchDR] DR Loss : 2.13e+01 | Grad norm : 2.15e+00 :   2%|▏         | 9/500 [00:07<07:26,  1.10it/s]
[TorchDR] DR Loss : 1.98e+01 | Grad norm : 2.15e+00 :   2%|▏         | 9/500 [00:08<07:26,  1.10it/s]
[TorchDR] DR Loss : 1.98e+01 | Grad norm : 2.15e+00 :   2%|▏         | 10/500 [00:08<07:24,  1.10it/s]
[TorchDR] DR Loss : 1.83e+01 | Grad norm : 2.15e+00 :   2%|▏         | 10/500 [00:08<07:24,  1.10it/s]
[TorchDR] DR Loss : 1.83e+01 | Grad norm : 2.15e+00 :   2%|▏         | 11/500 [00:09<07:07,  1.14it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 2.15e+00 :   2%|▏         | 11/500 [00:09<07:07,  1.14it/s]
[TorchDR] DR Loss : 1.69e+01 | Grad norm : 2.15e+00 :   2%|▏         | 12/500 [00:09<06:40,  1.22it/s]
[TorchDR] DR Loss : 1.56e+01 | Grad norm : 2.15e+00 :   2%|▏         | 12/500 [00:10<06:40,  1.22it/s]
[TorchDR] DR Loss : 1.56e+01 | Grad norm : 2.15e+00 :   3%|▎         | 13/500 [00:10<06:36,  1.23it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 2.15e+00 :   3%|▎         | 13/500 [00:11<06:36,  1.23it/s]
[TorchDR] DR Loss : 1.44e+01 | Grad norm : 2.15e+00 :   3%|▎         | 14/500 [00:11<06:33,  1.23it/s]
[TorchDR] DR Loss : 1.33e+01 | Grad norm : 2.15e+00 :   3%|▎         | 14/500 [00:12<06:33,  1.23it/s]
[TorchDR] DR Loss : 1.33e+01 | Grad norm : 2.15e+00 :   3%|▎         | 15/500 [00:12<06:17,  1.29it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 2.15e+00 :   3%|▎         | 15/500 [00:12<06:17,  1.29it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 2.15e+00 :   3%|▎         | 16/500 [00:12<06:00,  1.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.15e+00 :   3%|▎         | 16/500 [00:13<06:00,  1.34it/s]
[TorchDR] DR Loss : 1.14e+01 | Grad norm : 2.15e+00 :   3%|▎         | 17/500 [00:13<05:53,  1.37it/s]
[TorchDR] DR Loss : 1.07e+01 | Grad norm : 2.15e+00 :   3%|▎         | 17/500 [00:14<05:53,  1.37it/s]
[TorchDR] DR Loss : 1.07e+01 | Grad norm : 2.15e+00 :   4%|▎         | 18/500 [00:14<06:02,  1.33it/s]
[TorchDR] DR Loss : 1.01e+01 | Grad norm : 2.15e+00 :   4%|▎         | 18/500 [00:15<06:02,  1.33it/s]
[TorchDR] DR Loss : 1.01e+01 | Grad norm : 2.15e+00 :   4%|▍         | 19/500 [00:15<06:56,  1.16it/s]
[TorchDR] DR Loss : 9.63e+00 | Grad norm : 2.15e+00 :   4%|▍         | 19/500 [00:15<06:56,  1.16it/s]
[TorchDR] DR Loss : 9.63e+00 | Grad norm : 2.15e+00 :   4%|▍         | 20/500 [00:15<06:17,  1.27it/s]
[TorchDR] DR Loss : 9.22e+00 | Grad norm : 2.15e+00 :   4%|▍         | 20/500 [00:16<06:17,  1.27it/s]
[TorchDR] DR Loss : 9.22e+00 | Grad norm : 2.15e+00 :   4%|▍         | 21/500 [00:16<06:03,  1.32it/s]
[TorchDR] DR Loss : 8.89e+00 | Grad norm : 2.15e+00 :   4%|▍         | 21/500 [00:17<06:03,  1.32it/s]
[TorchDR] DR Loss : 8.89e+00 | Grad norm : 2.15e+00 :   4%|▍         | 22/500 [00:17<06:08,  1.30it/s]
[TorchDR] DR Loss : 8.62e+00 | Grad norm : 2.15e+00 :   4%|▍         | 22/500 [00:18<06:08,  1.30it/s]
[TorchDR] DR Loss : 8.62e+00 | Grad norm : 2.15e+00 :   5%|▍         | 23/500 [00:18<06:12,  1.28it/s]
[TorchDR] DR Loss : 8.40e+00 | Grad norm : 2.15e+00 :   5%|▍         | 23/500 [00:19<06:12,  1.28it/s]
[TorchDR] DR Loss : 8.40e+00 | Grad norm : 2.15e+00 :   5%|▍         | 24/500 [00:19<06:14,  1.27it/s]
[TorchDR] DR Loss : 8.23e+00 | Grad norm : 2.15e+00 :   5%|▍         | 24/500 [00:19<06:14,  1.27it/s]
[TorchDR] DR Loss : 8.23e+00 | Grad norm : 2.15e+00 :   5%|▌         | 25/500 [00:19<06:01,  1.32it/s]
[TorchDR] DR Loss : 8.09e+00 | Grad norm : 2.15e+00 :   5%|▌         | 25/500 [00:20<06:01,  1.32it/s]
[TorchDR] DR Loss : 8.09e+00 | Grad norm : 2.15e+00 :   5%|▌         | 26/500 [00:20<05:51,  1.35it/s]
[TorchDR] DR Loss : 7.97e+00 | Grad norm : 2.15e+00 :   5%|▌         | 26/500 [00:20<05:51,  1.35it/s]
[TorchDR] DR Loss : 7.97e+00 | Grad norm : 2.15e+00 :   5%|▌         | 27/500 [00:20<05:12,  1.51it/s]
[TorchDR] DR Loss : 7.85e+00 | Grad norm : 2.15e+00 :   5%|▌         | 27/500 [00:21<05:12,  1.51it/s]
[TorchDR] DR Loss : 7.85e+00 | Grad norm : 2.15e+00 :   6%|▌         | 28/500 [00:21<04:53,  1.61it/s]
[TorchDR] DR Loss : 7.71e+00 | Grad norm : 2.15e+00 :   6%|▌         | 28/500 [00:22<04:53,  1.61it/s]
[TorchDR] DR Loss : 7.71e+00 | Grad norm : 2.15e+00 :   6%|▌         | 29/500 [00:22<05:03,  1.55it/s]
[TorchDR] DR Loss : 7.57e+00 | Grad norm : 2.15e+00 :   6%|▌         | 29/500 [00:23<05:03,  1.55it/s]
[TorchDR] DR Loss : 7.57e+00 | Grad norm : 2.15e+00 :   6%|▌         | 30/500 [00:23<05:39,  1.39it/s]
[TorchDR] DR Loss : 7.42e+00 | Grad norm : 2.15e+00 :   6%|▌         | 30/500 [00:23<05:39,  1.39it/s]
[TorchDR] DR Loss : 7.42e+00 | Grad norm : 2.15e+00 :   6%|▌         | 31/500 [00:23<05:49,  1.34it/s]
[TorchDR] DR Loss : 7.28e+00 | Grad norm : 2.15e+00 :   6%|▌         | 31/500 [00:24<05:49,  1.34it/s]
[TorchDR] DR Loss : 7.28e+00 | Grad norm : 2.15e+00 :   6%|▋         | 32/500 [00:24<05:56,  1.31it/s]
[TorchDR] DR Loss : 7.16e+00 | Grad norm : 2.15e+00 :   6%|▋         | 32/500 [00:25<05:56,  1.31it/s]
[TorchDR] DR Loss : 7.16e+00 | Grad norm : 2.15e+00 :   7%|▋         | 33/500 [00:25<05:46,  1.35it/s]
[TorchDR] DR Loss : 7.06e+00 | Grad norm : 2.15e+00 :   7%|▋         | 33/500 [00:25<05:46,  1.35it/s]
[TorchDR] DR Loss : 7.06e+00 | Grad norm : 2.15e+00 :   7%|▋         | 34/500 [00:25<05:26,  1.43it/s]
[TorchDR] DR Loss : 6.97e+00 | Grad norm : 2.15e+00 :   7%|▋         | 34/500 [00:26<05:26,  1.43it/s]
[TorchDR] DR Loss : 6.97e+00 | Grad norm : 2.15e+00 :   7%|▋         | 35/500 [00:26<05:39,  1.37it/s]
[TorchDR] DR Loss : 6.90e+00 | Grad norm : 2.15e+00 :   7%|▋         | 35/500 [00:27<05:39,  1.37it/s]
[TorchDR] DR Loss : 6.90e+00 | Grad norm : 2.15e+00 :   7%|▋         | 36/500 [00:27<05:21,  1.45it/s]
[TorchDR] DR Loss : 6.84e+00 | Grad norm : 2.15e+00 :   7%|▋         | 36/500 [00:27<05:21,  1.45it/s]
[TorchDR] DR Loss : 6.84e+00 | Grad norm : 2.15e+00 :   7%|▋         | 37/500 [00:27<05:06,  1.51it/s]
[TorchDR] DR Loss : 6.78e+00 | Grad norm : 2.15e+00 :   7%|▋         | 37/500 [00:28<05:06,  1.51it/s]
[TorchDR] DR Loss : 6.78e+00 | Grad norm : 2.15e+00 :   8%|▊         | 38/500 [00:28<04:58,  1.55it/s]
[TorchDR] DR Loss : 6.72e+00 | Grad norm : 2.15e+00 :   8%|▊         | 38/500 [00:29<04:58,  1.55it/s]
[TorchDR] DR Loss : 6.72e+00 | Grad norm : 2.15e+00 :   8%|▊         | 39/500 [00:29<05:18,  1.45it/s]
[TorchDR] DR Loss : 6.67e+00 | Grad norm : 2.15e+00 :   8%|▊         | 39/500 [00:29<05:18,  1.45it/s]
[TorchDR] DR Loss : 6.67e+00 | Grad norm : 2.15e+00 :   8%|▊         | 40/500 [00:29<05:05,  1.50it/s]
[TorchDR] DR Loss : 6.61e+00 | Grad norm : 2.15e+00 :   8%|▊         | 40/500 [00:30<05:05,  1.50it/s]
[TorchDR] DR Loss : 6.61e+00 | Grad norm : 2.15e+00 :   8%|▊         | 41/500 [00:30<04:55,  1.55it/s]
[TorchDR] DR Loss : 6.56e+00 | Grad norm : 2.15e+00 :   8%|▊         | 41/500 [00:31<04:55,  1.55it/s]
[TorchDR] DR Loss : 6.56e+00 | Grad norm : 2.15e+00 :   8%|▊         | 42/500 [00:31<04:48,  1.59it/s]
[TorchDR] DR Loss : 6.51e+00 | Grad norm : 2.15e+00 :   8%|▊         | 42/500 [00:31<04:48,  1.59it/s]
[TorchDR] DR Loss : 6.51e+00 | Grad norm : 2.15e+00 :   9%|▊         | 43/500 [00:31<04:40,  1.63it/s]
[TorchDR] DR Loss : 6.46e+00 | Grad norm : 2.15e+00 :   9%|▊         | 43/500 [00:32<04:40,  1.63it/s]
[TorchDR] DR Loss : 6.46e+00 | Grad norm : 2.15e+00 :   9%|▉         | 44/500 [00:32<04:51,  1.57it/s]
[TorchDR] DR Loss : 6.43e+00 | Grad norm : 2.15e+00 :   9%|▉         | 44/500 [00:33<04:51,  1.57it/s]
[TorchDR] DR Loss : 6.43e+00 | Grad norm : 2.15e+00 :   9%|▉         | 45/500 [00:33<05:16,  1.44it/s]
[TorchDR] DR Loss : 6.40e+00 | Grad norm : 2.15e+00 :   9%|▉         | 45/500 [00:33<05:16,  1.44it/s]
[TorchDR] DR Loss : 6.40e+00 | Grad norm : 2.15e+00 :   9%|▉         | 46/500 [00:33<05:16,  1.43it/s]
[TorchDR] DR Loss : 6.37e+00 | Grad norm : 2.15e+00 :   9%|▉         | 46/500 [00:34<05:16,  1.43it/s]
[TorchDR] DR Loss : 6.37e+00 | Grad norm : 2.15e+00 :   9%|▉         | 47/500 [00:34<05:16,  1.43it/s]
[TorchDR] DR Loss : 6.35e+00 | Grad norm : 2.15e+00 :   9%|▉         | 47/500 [00:35<05:16,  1.43it/s]
[TorchDR] DR Loss : 6.35e+00 | Grad norm : 2.15e+00 :  10%|▉         | 48/500 [00:35<05:25,  1.39it/s]
[TorchDR] DR Loss : 6.32e+00 | Grad norm : 2.15e+00 :  10%|▉         | 48/500 [00:36<05:25,  1.39it/s]
[TorchDR] DR Loss : 6.32e+00 | Grad norm : 2.15e+00 :  10%|▉         | 49/500 [00:36<05:52,  1.28it/s]
[TorchDR] DR Loss : 6.29e+00 | Grad norm : 2.15e+00 :  10%|▉         | 49/500 [00:36<05:52,  1.28it/s]
[TorchDR] DR Loss : 6.29e+00 | Grad norm : 2.15e+00 :  10%|█         | 50/500 [00:36<05:27,  1.37it/s]
[TorchDR] DR Loss : 6.27e+00 | Grad norm : 4.75e-01 :  10%|█         | 50/500 [00:37<05:27,  1.37it/s]
[TorchDR] DR Loss : 6.27e+00 | Grad norm : 4.75e-01 :  10%|█         | 51/500 [00:37<05:10,  1.45it/s]
[TorchDR] DR Loss : 6.24e+00 | Grad norm : 4.75e-01 :  10%|█         | 51/500 [00:38<05:10,  1.45it/s]
[TorchDR] DR Loss : 6.24e+00 | Grad norm : 4.75e-01 :  10%|█         | 52/500 [00:38<05:19,  1.40it/s]
[TorchDR] DR Loss : 6.21e+00 | Grad norm : 4.75e-01 :  10%|█         | 52/500 [00:38<05:19,  1.40it/s]
[TorchDR] DR Loss : 6.21e+00 | Grad norm : 4.75e-01 :  11%|█         | 53/500 [00:38<05:08,  1.45it/s]
[TorchDR] DR Loss : 6.19e+00 | Grad norm : 4.75e-01 :  11%|█         | 53/500 [00:39<05:08,  1.45it/s]
[TorchDR] DR Loss : 6.19e+00 | Grad norm : 4.75e-01 :  11%|█         | 54/500 [00:39<04:55,  1.51it/s]
[TorchDR] DR Loss : 6.16e+00 | Grad norm : 4.75e-01 :  11%|█         | 54/500 [00:40<04:55,  1.51it/s]
[TorchDR] DR Loss : 6.16e+00 | Grad norm : 4.75e-01 :  11%|█         | 55/500 [00:40<04:59,  1.49it/s]
[TorchDR] DR Loss : 6.14e+00 | Grad norm : 4.75e-01 :  11%|█         | 55/500 [00:40<04:59,  1.49it/s]
[TorchDR] DR Loss : 6.14e+00 | Grad norm : 4.75e-01 :  11%|█         | 56/500 [00:40<05:02,  1.47it/s]
[TorchDR] DR Loss : 6.12e+00 | Grad norm : 4.75e-01 :  11%|█         | 56/500 [00:41<05:02,  1.47it/s]
[TorchDR] DR Loss : 6.12e+00 | Grad norm : 4.75e-01 :  11%|█▏        | 57/500 [00:41<04:50,  1.52it/s]
[TorchDR] DR Loss : 6.11e+00 | Grad norm : 4.75e-01 :  11%|█▏        | 57/500 [00:42<04:50,  1.52it/s]
[TorchDR] DR Loss : 6.11e+00 | Grad norm : 4.75e-01 :  12%|█▏        | 58/500 [00:42<04:52,  1.51it/s]
[TorchDR] DR Loss : 6.09e+00 | Grad norm : 4.75e-01 :  12%|█▏        | 58/500 [00:42<04:52,  1.51it/s]
[TorchDR] DR Loss : 6.09e+00 | Grad norm : 4.75e-01 :  12%|█▏        | 59/500 [00:42<04:47,  1.53it/s]
[TorchDR] DR Loss : 6.07e+00 | Grad norm : 4.75e-01 :  12%|█▏        | 59/500 [00:43<04:47,  1.53it/s]
[TorchDR] DR Loss : 6.07e+00 | Grad norm : 4.75e-01 :  12%|█▏        | 60/500 [00:43<05:19,  1.38it/s]
[TorchDR] DR Loss : 6.05e+00 | Grad norm : 4.75e-01 :  12%|█▏        | 60/500 [00:44<05:19,  1.38it/s]
[TorchDR] DR Loss : 6.05e+00 | Grad norm : 4.75e-01 :  12%|█▏        | 61/500 [00:44<05:02,  1.45it/s]
[TorchDR] DR Loss : 6.04e+00 | Grad norm : 4.75e-01 :  12%|█▏        | 61/500 [00:45<05:02,  1.45it/s]
[TorchDR] DR Loss : 6.04e+00 | Grad norm : 4.75e-01 :  12%|█▏        | 62/500 [00:45<05:11,  1.40it/s]
[TorchDR] DR Loss : 6.02e+00 | Grad norm : 4.75e-01 :  12%|█▏        | 62/500 [00:45<05:11,  1.40it/s]
[TorchDR] DR Loss : 6.02e+00 | Grad norm : 4.75e-01 :  13%|█▎        | 63/500 [00:45<04:47,  1.52it/s]
[TorchDR] DR Loss : 6.00e+00 | Grad norm : 4.75e-01 :  13%|█▎        | 63/500 [00:46<04:47,  1.52it/s]
[TorchDR] DR Loss : 6.00e+00 | Grad norm : 4.75e-01 :  13%|█▎        | 64/500 [00:46<05:05,  1.43it/s]
[TorchDR] DR Loss : 5.99e+00 | Grad norm : 4.75e-01 :  13%|█▎        | 64/500 [00:47<05:05,  1.43it/s]
[TorchDR] DR Loss : 5.99e+00 | Grad norm : 4.75e-01 :  13%|█▎        | 65/500 [00:47<05:56,  1.22it/s]
[TorchDR] DR Loss : 5.97e+00 | Grad norm : 4.75e-01 :  13%|█▎        | 65/500 [00:48<05:56,  1.22it/s]
[TorchDR] DR Loss : 5.97e+00 | Grad norm : 4.75e-01 :  13%|█▎        | 66/500 [00:48<05:53,  1.23it/s]
[TorchDR] DR Loss : 5.96e+00 | Grad norm : 4.75e-01 :  13%|█▎        | 66/500 [00:48<05:53,  1.23it/s]
[TorchDR] DR Loss : 5.96e+00 | Grad norm : 4.75e-01 :  13%|█▎        | 67/500 [00:48<05:24,  1.33it/s]
[TorchDR] DR Loss : 5.94e+00 | Grad norm : 4.75e-01 :  13%|█▎        | 67/500 [00:49<05:24,  1.33it/s]
[TorchDR] DR Loss : 5.94e+00 | Grad norm : 4.75e-01 :  14%|█▎        | 68/500 [00:49<04:51,  1.48it/s]
[TorchDR] DR Loss : 5.93e+00 | Grad norm : 4.75e-01 :  14%|█▎        | 68/500 [00:49<04:51,  1.48it/s]
[TorchDR] DR Loss : 5.93e+00 | Grad norm : 4.75e-01 :  14%|█▍        | 69/500 [00:49<04:28,  1.60it/s]
[TorchDR] DR Loss : 5.92e+00 | Grad norm : 4.75e-01 :  14%|█▍        | 69/500 [00:50<04:28,  1.60it/s]
[TorchDR] DR Loss : 5.92e+00 | Grad norm : 4.75e-01 :  14%|█▍        | 70/500 [00:50<04:20,  1.65it/s]
[TorchDR] DR Loss : 5.91e+00 | Grad norm : 4.75e-01 :  14%|█▍        | 70/500 [00:51<04:20,  1.65it/s]
[TorchDR] DR Loss : 5.91e+00 | Grad norm : 4.75e-01 :  14%|█▍        | 71/500 [00:51<04:19,  1.65it/s]
[TorchDR] DR Loss : 5.89e+00 | Grad norm : 4.75e-01 :  14%|█▍        | 71/500 [00:51<04:19,  1.65it/s]
[TorchDR] DR Loss : 5.89e+00 | Grad norm : 4.75e-01 :  14%|█▍        | 72/500 [00:51<04:09,  1.71it/s]
[TorchDR] DR Loss : 5.88e+00 | Grad norm : 4.75e-01 :  14%|█▍        | 72/500 [00:52<04:09,  1.71it/s]
[TorchDR] DR Loss : 5.88e+00 | Grad norm : 4.75e-01 :  15%|█▍        | 73/500 [00:52<04:32,  1.57it/s]
[TorchDR] DR Loss : 5.87e+00 | Grad norm : 4.75e-01 :  15%|█▍        | 73/500 [00:53<04:32,  1.57it/s]
[TorchDR] DR Loss : 5.87e+00 | Grad norm : 4.75e-01 :  15%|█▍        | 74/500 [00:53<04:56,  1.44it/s]
[TorchDR] DR Loss : 5.86e+00 | Grad norm : 4.75e-01 :  15%|█▍        | 74/500 [00:53<04:56,  1.44it/s]
[TorchDR] DR Loss : 5.86e+00 | Grad norm : 4.75e-01 :  15%|█▌        | 75/500 [00:53<04:43,  1.50it/s]
[TorchDR] DR Loss : 5.85e+00 | Grad norm : 4.75e-01 :  15%|█▌        | 75/500 [00:54<04:43,  1.50it/s]
[TorchDR] DR Loss : 5.85e+00 | Grad norm : 4.75e-01 :  15%|█▌        | 76/500 [00:54<04:47,  1.48it/s]
[TorchDR] DR Loss : 5.84e+00 | Grad norm : 4.75e-01 :  15%|█▌        | 76/500 [00:55<04:47,  1.48it/s]
[TorchDR] DR Loss : 5.84e+00 | Grad norm : 4.75e-01 :  15%|█▌        | 77/500 [00:55<05:01,  1.40it/s]
[TorchDR] DR Loss : 5.83e+00 | Grad norm : 4.75e-01 :  15%|█▌        | 77/500 [00:55<05:01,  1.40it/s]
[TorchDR] DR Loss : 5.83e+00 | Grad norm : 4.75e-01 :  16%|█▌        | 78/500 [00:55<04:46,  1.47it/s]
[TorchDR] DR Loss : 5.83e+00 | Grad norm : 4.75e-01 :  16%|█▌        | 78/500 [00:56<04:46,  1.47it/s]
[TorchDR] DR Loss : 5.83e+00 | Grad norm : 4.75e-01 :  16%|█▌        | 79/500 [00:56<04:19,  1.62it/s]
[TorchDR] DR Loss : 5.82e+00 | Grad norm : 4.75e-01 :  16%|█▌        | 79/500 [00:56<04:19,  1.62it/s]
[TorchDR] DR Loss : 5.82e+00 | Grad norm : 4.75e-01 :  16%|█▌        | 80/500 [00:57<04:20,  1.61it/s]
[TorchDR] DR Loss : 5.81e+00 | Grad norm : 4.75e-01 :  16%|█▌        | 80/500 [00:57<04:20,  1.61it/s]
[TorchDR] DR Loss : 5.81e+00 | Grad norm : 4.75e-01 :  16%|█▌        | 81/500 [00:57<04:05,  1.71it/s]
[TorchDR] DR Loss : 5.80e+00 | Grad norm : 4.75e-01 :  16%|█▌        | 81/500 [00:58<04:05,  1.71it/s]
[TorchDR] DR Loss : 5.80e+00 | Grad norm : 4.75e-01 :  16%|█▋        | 82/500 [00:58<04:31,  1.54it/s]
[TorchDR] DR Loss : 5.80e+00 | Grad norm : 4.75e-01 :  16%|█▋        | 82/500 [00:59<04:31,  1.54it/s]
[TorchDR] DR Loss : 5.80e+00 | Grad norm : 4.75e-01 :  17%|█▋        | 83/500 [00:59<04:45,  1.46it/s]
[TorchDR] DR Loss : 5.79e+00 | Grad norm : 4.75e-01 :  17%|█▋        | 83/500 [00:59<04:45,  1.46it/s]
[TorchDR] DR Loss : 5.79e+00 | Grad norm : 4.75e-01 :  17%|█▋        | 84/500 [00:59<04:25,  1.57it/s]
[TorchDR] DR Loss : 5.78e+00 | Grad norm : 4.75e-01 :  17%|█▋        | 84/500 [01:00<04:25,  1.57it/s]
[TorchDR] DR Loss : 5.78e+00 | Grad norm : 4.75e-01 :  17%|█▋        | 85/500 [01:00<04:04,  1.70it/s]
[TorchDR] DR Loss : 5.78e+00 | Grad norm : 4.75e-01 :  17%|█▋        | 85/500 [01:00<04:04,  1.70it/s]
[TorchDR] DR Loss : 5.78e+00 | Grad norm : 4.75e-01 :  17%|█▋        | 86/500 [01:00<04:17,  1.61it/s]
[TorchDR] DR Loss : 5.77e+00 | Grad norm : 4.75e-01 :  17%|█▋        | 86/500 [01:01<04:17,  1.61it/s]
[TorchDR] DR Loss : 5.77e+00 | Grad norm : 4.75e-01 :  17%|█▋        | 87/500 [01:01<04:05,  1.68it/s]
[TorchDR] DR Loss : 5.77e+00 | Grad norm : 4.75e-01 :  17%|█▋        | 87/500 [01:02<04:05,  1.68it/s]
[TorchDR] DR Loss : 5.77e+00 | Grad norm : 4.75e-01 :  18%|█▊        | 88/500 [01:02<04:30,  1.53it/s]
[TorchDR] DR Loss : 5.76e+00 | Grad norm : 4.75e-01 :  18%|█▊        | 88/500 [01:02<04:30,  1.53it/s]
[TorchDR] DR Loss : 5.76e+00 | Grad norm : 4.75e-01 :  18%|█▊        | 89/500 [01:02<04:22,  1.57it/s]
[TorchDR] DR Loss : 5.76e+00 | Grad norm : 4.75e-01 :  18%|█▊        | 89/500 [01:03<04:22,  1.57it/s]
[TorchDR] DR Loss : 5.76e+00 | Grad norm : 4.75e-01 :  18%|█▊        | 90/500 [01:03<04:01,  1.70it/s]
[TorchDR] DR Loss : 5.75e+00 | Grad norm : 4.75e-01 :  18%|█▊        | 90/500 [01:03<04:01,  1.70it/s]
[TorchDR] DR Loss : 5.75e+00 | Grad norm : 4.75e-01 :  18%|█▊        | 91/500 [01:03<04:17,  1.59it/s]
[TorchDR] DR Loss : 5.75e+00 | Grad norm : 4.75e-01 :  18%|█▊        | 91/500 [01:04<04:17,  1.59it/s]
[TorchDR] DR Loss : 5.75e+00 | Grad norm : 4.75e-01 :  18%|█▊        | 92/500 [01:04<04:13,  1.61it/s]
[TorchDR] DR Loss : 5.74e+00 | Grad norm : 4.75e-01 :  18%|█▊        | 92/500 [01:05<04:13,  1.61it/s]
[TorchDR] DR Loss : 5.74e+00 | Grad norm : 4.75e-01 :  19%|█▊        | 93/500 [01:05<03:58,  1.71it/s]
[TorchDR] DR Loss : 5.74e+00 | Grad norm : 4.75e-01 :  19%|█▊        | 93/500 [01:05<03:58,  1.71it/s]
[TorchDR] DR Loss : 5.74e+00 | Grad norm : 4.75e-01 :  19%|█▉        | 94/500 [01:05<04:07,  1.64it/s]
[TorchDR] DR Loss : 5.74e+00 | Grad norm : 4.75e-01 :  19%|█▉        | 94/500 [01:06<04:07,  1.64it/s]
[TorchDR] DR Loss : 5.74e+00 | Grad norm : 4.75e-01 :  19%|█▉        | 95/500 [01:06<03:45,  1.80it/s]
[TorchDR] DR Loss : 5.73e+00 | Grad norm : 4.75e-01 :  19%|█▉        | 95/500 [01:06<03:45,  1.80it/s]
[TorchDR] DR Loss : 5.73e+00 | Grad norm : 4.75e-01 :  19%|█▉        | 96/500 [01:06<03:38,  1.85it/s]
[TorchDR] DR Loss : 5.73e+00 | Grad norm : 4.75e-01 :  19%|█▉        | 96/500 [01:07<03:38,  1.85it/s]
[TorchDR] DR Loss : 5.73e+00 | Grad norm : 4.75e-01 :  19%|█▉        | 97/500 [01:07<03:44,  1.79it/s]
[TorchDR] DR Loss : 5.73e+00 | Grad norm : 4.75e-01 :  19%|█▉        | 97/500 [01:07<03:44,  1.79it/s]
[TorchDR] DR Loss : 5.73e+00 | Grad norm : 4.75e-01 :  20%|█▉        | 98/500 [01:07<04:01,  1.67it/s]
[TorchDR] DR Loss : 5.72e+00 | Grad norm : 4.75e-01 :  20%|█▉        | 98/500 [01:08<04:01,  1.67it/s]
[TorchDR] DR Loss : 5.72e+00 | Grad norm : 4.75e-01 :  20%|█▉        | 99/500 [01:08<04:24,  1.52it/s]
[TorchDR] DR Loss : 5.72e+00 | Grad norm : 4.75e-01 :  20%|█▉        | 99/500 [01:09<04:24,  1.52it/s]
[TorchDR] DR Loss : 5.72e+00 | Grad norm : 4.75e-01 :  20%|██        | 100/500 [01:09<04:40,  1.42it/s]
[TorchDR] DR Loss : 5.72e+00 | Grad norm : 5.40e-02 :  20%|██        | 100/500 [01:10<04:40,  1.42it/s]
[TorchDR] DR Loss : 5.72e+00 | Grad norm : 5.40e-02 :  20%|██        | 101/500 [01:10<04:27,  1.49it/s]
[TorchDR] DR Loss : 5.71e+00 | Grad norm : 5.40e-02 :  20%|██        | 101/500 [01:10<04:27,  1.49it/s]
[TorchDR] DR Loss : 5.71e+00 | Grad norm : 5.40e-02 :  20%|██        | 102/500 [01:10<04:19,  1.54it/s]
[TorchDR] DR Loss : 5.71e+00 | Grad norm : 5.40e-02 :  20%|██        | 102/500 [01:11<04:19,  1.54it/s]
[TorchDR] DR Loss : 5.71e+00 | Grad norm : 5.40e-02 :  21%|██        | 103/500 [01:11<04:11,  1.58it/s]
[TorchDR] DR Loss : 5.71e+00 | Grad norm : 5.40e-02 :  21%|██        | 103/500 [01:11<04:11,  1.58it/s]
[TorchDR] DR Loss : 5.71e+00 | Grad norm : 5.40e-02 :  21%|██        | 104/500 [01:11<04:03,  1.62it/s]
[TorchDR] DR Loss : 5.71e+00 | Grad norm : 5.40e-02 :  21%|██        | 104/500 [01:12<04:03,  1.62it/s]
[TorchDR] DR Loss : 5.71e+00 | Grad norm : 5.40e-02 :  21%|██        | 105/500 [01:12<04:01,  1.64it/s]
[TorchDR] DR Loss : 5.70e+00 | Grad norm : 5.40e-02 :  21%|██        | 105/500 [01:13<04:01,  1.64it/s]
[TorchDR] DR Loss : 5.70e+00 | Grad norm : 5.40e-02 :  21%|██        | 106/500 [01:13<03:59,  1.65it/s]
[TorchDR] DR Loss : 5.70e+00 | Grad norm : 5.40e-02 :  21%|██        | 106/500 [01:13<03:59,  1.65it/s]
[TorchDR] DR Loss : 5.70e+00 | Grad norm : 5.40e-02 :  21%|██▏       | 107/500 [01:13<03:58,  1.65it/s]
[TorchDR] DR Loss : 5.70e+00 | Grad norm : 5.40e-02 :  21%|██▏       | 107/500 [01:14<03:58,  1.65it/s]
[TorchDR] DR Loss : 5.70e+00 | Grad norm : 5.40e-02 :  22%|██▏       | 108/500 [01:14<03:56,  1.66it/s]
[TorchDR] DR Loss : 5.70e+00 | Grad norm : 5.40e-02 :  22%|██▏       | 108/500 [01:14<03:56,  1.66it/s]
[TorchDR] DR Loss : 5.70e+00 | Grad norm : 5.40e-02 :  22%|██▏       | 109/500 [01:14<03:58,  1.64it/s]
[TorchDR] DR Loss : 5.69e+00 | Grad norm : 5.40e-02 :  22%|██▏       | 109/500 [01:15<03:58,  1.64it/s]
[TorchDR] DR Loss : 5.69e+00 | Grad norm : 5.40e-02 :  22%|██▏       | 110/500 [01:15<03:57,  1.64it/s]
[TorchDR] DR Loss : 5.69e+00 | Grad norm : 5.40e-02 :  22%|██▏       | 110/500 [01:16<03:57,  1.64it/s]
[TorchDR] DR Loss : 5.69e+00 | Grad norm : 5.40e-02 :  22%|██▏       | 111/500 [01:16<03:43,  1.74it/s]
[TorchDR] DR Loss : 5.69e+00 | Grad norm : 5.40e-02 :  22%|██▏       | 111/500 [01:16<03:43,  1.74it/s]
[TorchDR] DR Loss : 5.69e+00 | Grad norm : 5.40e-02 :  22%|██▏       | 112/500 [01:16<03:34,  1.81it/s]
[TorchDR] DR Loss : 5.69e+00 | Grad norm : 5.40e-02 :  22%|██▏       | 112/500 [01:17<03:34,  1.81it/s]
[TorchDR] DR Loss : 5.69e+00 | Grad norm : 5.40e-02 :  23%|██▎       | 113/500 [01:17<03:27,  1.86it/s]
[TorchDR] DR Loss : 5.69e+00 | Grad norm : 5.40e-02 :  23%|██▎       | 113/500 [01:17<03:27,  1.86it/s]
[TorchDR] DR Loss : 5.69e+00 | Grad norm : 5.40e-02 :  23%|██▎       | 114/500 [01:17<03:57,  1.62it/s]
[TorchDR] DR Loss : 5.68e+00 | Grad norm : 5.40e-02 :  23%|██▎       | 114/500 [01:18<03:57,  1.62it/s]
[TorchDR] DR Loss : 5.68e+00 | Grad norm : 5.40e-02 :  23%|██▎       | 115/500 [01:18<03:55,  1.64it/s]
[TorchDR] DR Loss : 5.68e+00 | Grad norm : 5.40e-02 :  23%|██▎       | 115/500 [01:19<03:55,  1.64it/s]
[TorchDR] DR Loss : 5.68e+00 | Grad norm : 5.40e-02 :  23%|██▎       | 116/500 [01:19<03:53,  1.65it/s]
[TorchDR] DR Loss : 5.68e+00 | Grad norm : 5.40e-02 :  23%|██▎       | 116/500 [01:19<03:53,  1.65it/s]
[TorchDR] DR Loss : 5.68e+00 | Grad norm : 5.40e-02 :  23%|██▎       | 117/500 [01:19<03:51,  1.65it/s]
[TorchDR] DR Loss : 5.68e+00 | Grad norm : 5.40e-02 :  23%|██▎       | 117/500 [01:20<03:51,  1.65it/s]
[TorchDR] DR Loss : 5.68e+00 | Grad norm : 5.40e-02 :  24%|██▎       | 118/500 [01:20<03:39,  1.74it/s]
[TorchDR] DR Loss : 5.68e+00 | Grad norm : 5.40e-02 :  24%|██▎       | 118/500 [01:20<03:39,  1.74it/s]
[TorchDR] DR Loss : 5.68e+00 | Grad norm : 5.40e-02 :  24%|██▍       | 119/500 [01:20<03:41,  1.72it/s]
[TorchDR] DR Loss : 5.67e+00 | Grad norm : 5.40e-02 :  24%|██▍       | 119/500 [01:21<03:41,  1.72it/s]
[TorchDR] DR Loss : 5.67e+00 | Grad norm : 5.40e-02 :  24%|██▍       | 120/500 [01:21<03:31,  1.79it/s]
[TorchDR] DR Loss : 5.67e+00 | Grad norm : 5.40e-02 :  24%|██▍       | 120/500 [01:21<03:31,  1.79it/s]
[TorchDR] DR Loss : 5.67e+00 | Grad norm : 5.40e-02 :  24%|██▍       | 121/500 [01:21<03:24,  1.85it/s]
[TorchDR] DR Loss : 5.67e+00 | Grad norm : 5.40e-02 :  24%|██▍       | 121/500 [01:22<03:24,  1.85it/s]
[TorchDR] DR Loss : 5.67e+00 | Grad norm : 5.40e-02 :  24%|██▍       | 122/500 [01:22<03:19,  1.89it/s]
[TorchDR] DR Loss : 5.67e+00 | Grad norm : 5.40e-02 :  24%|██▍       | 122/500 [01:22<03:19,  1.89it/s]
[TorchDR] DR Loss : 5.67e+00 | Grad norm : 5.40e-02 :  25%|██▍       | 123/500 [01:22<03:23,  1.85it/s]
[TorchDR] DR Loss : 5.67e+00 | Grad norm : 5.40e-02 :  25%|██▍       | 123/500 [01:23<03:23,  1.85it/s]
[TorchDR] DR Loss : 5.67e+00 | Grad norm : 5.40e-02 :  25%|██▍       | 124/500 [01:23<03:22,  1.86it/s]
[TorchDR] DR Loss : 5.67e+00 | Grad norm : 5.40e-02 :  25%|██▍       | 124/500 [01:23<03:22,  1.86it/s]
[TorchDR] DR Loss : 5.67e+00 | Grad norm : 5.40e-02 :  25%|██▌       | 125/500 [01:23<03:28,  1.80it/s]
[TorchDR] DR Loss : 5.66e+00 | Grad norm : 5.40e-02 :  25%|██▌       | 125/500 [01:24<03:28,  1.80it/s]
[TorchDR] DR Loss : 5.66e+00 | Grad norm : 5.40e-02 :  25%|██▌       | 126/500 [01:24<03:21,  1.85it/s]
[TorchDR] DR Loss : 5.66e+00 | Grad norm : 5.40e-02 :  25%|██▌       | 126/500 [01:24<03:21,  1.85it/s]
[TorchDR] DR Loss : 5.66e+00 | Grad norm : 5.40e-02 :  25%|██▌       | 127/500 [01:24<03:16,  1.90it/s]
[TorchDR] DR Loss : 5.66e+00 | Grad norm : 5.40e-02 :  25%|██▌       | 127/500 [01:25<03:16,  1.90it/s]
[TorchDR] DR Loss : 5.66e+00 | Grad norm : 5.40e-02 :  26%|██▌       | 128/500 [01:25<03:13,  1.93it/s]
[TorchDR] DR Loss : 5.66e+00 | Grad norm : 5.40e-02 :  26%|██▌       | 128/500 [01:25<03:13,  1.93it/s]
[TorchDR] DR Loss : 5.66e+00 | Grad norm : 5.40e-02 :  26%|██▌       | 129/500 [01:25<03:07,  1.98it/s]
[TorchDR] DR Loss : 5.66e+00 | Grad norm : 5.40e-02 :  26%|██▌       | 129/500 [01:26<03:07,  1.98it/s]
[TorchDR] DR Loss : 5.66e+00 | Grad norm : 5.40e-02 :  26%|██▌       | 130/500 [01:26<03:09,  1.95it/s]
[TorchDR] DR Loss : 5.66e+00 | Grad norm : 5.40e-02 :  26%|██▌       | 130/500 [01:27<03:09,  1.95it/s]
[TorchDR] DR Loss : 5.66e+00 | Grad norm : 5.40e-02 :  26%|██▌       | 131/500 [01:27<03:29,  1.76it/s]
[TorchDR] DR Loss : 5.66e+00 | Grad norm : 5.40e-02 :  26%|██▌       | 131/500 [01:27<03:29,  1.76it/s]
[TorchDR] DR Loss : 5.66e+00 | Grad norm : 5.40e-02 :  26%|██▋       | 132/500 [01:27<03:10,  1.93it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  26%|██▋       | 132/500 [01:28<03:10,  1.93it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  27%|██▋       | 133/500 [01:28<03:18,  1.84it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  27%|██▋       | 133/500 [01:28<03:18,  1.84it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  27%|██▋       | 134/500 [01:28<03:14,  1.89it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  27%|██▋       | 134/500 [01:29<03:14,  1.89it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  27%|██▋       | 135/500 [01:29<03:20,  1.82it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  27%|██▋       | 135/500 [01:29<03:20,  1.82it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  27%|██▋       | 136/500 [01:29<03:14,  1.87it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  27%|██▋       | 136/500 [01:30<03:14,  1.87it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  27%|██▋       | 137/500 [01:30<03:10,  1.91it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  27%|██▋       | 137/500 [01:30<03:10,  1.91it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  28%|██▊       | 138/500 [01:30<03:07,  1.93it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  28%|██▊       | 138/500 [01:31<03:07,  1.93it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  28%|██▊       | 139/500 [01:31<03:16,  1.84it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  28%|██▊       | 139/500 [01:32<03:16,  1.84it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  28%|██▊       | 140/500 [01:32<03:50,  1.56it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  28%|██▊       | 140/500 [01:32<03:50,  1.56it/s]
[TorchDR] DR Loss : 5.65e+00 | Grad norm : 5.40e-02 :  28%|██▊       | 141/500 [01:32<03:48,  1.57it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  28%|██▊       | 141/500 [01:33<03:48,  1.57it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  28%|██▊       | 142/500 [01:33<03:30,  1.70it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  28%|██▊       | 142/500 [01:33<03:30,  1.70it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  29%|██▊       | 143/500 [01:33<03:31,  1.69it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  29%|██▊       | 143/500 [01:34<03:31,  1.69it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  29%|██▉       | 144/500 [01:34<03:13,  1.84it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  29%|██▉       | 144/500 [01:35<03:13,  1.84it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  29%|██▉       | 145/500 [01:35<03:29,  1.69it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  29%|██▉       | 145/500 [01:35<03:29,  1.69it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  29%|██▉       | 146/500 [01:35<03:29,  1.69it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  29%|██▉       | 146/500 [01:36<03:29,  1.69it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  29%|██▉       | 147/500 [01:36<03:30,  1.68it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  29%|██▉       | 147/500 [01:36<03:30,  1.68it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  30%|██▉       | 148/500 [01:36<03:19,  1.76it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  30%|██▉       | 148/500 [01:37<03:19,  1.76it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  30%|██▉       | 149/500 [01:37<03:11,  1.83it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  30%|██▉       | 149/500 [01:37<03:11,  1.83it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 5.40e-02 :  30%|███       | 150/500 [01:37<03:13,  1.81it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  30%|███       | 150/500 [01:38<03:13,  1.81it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  30%|███       | 151/500 [01:38<03:20,  1.74it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  30%|███       | 151/500 [01:38<03:20,  1.74it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  30%|███       | 152/500 [01:38<03:12,  1.81it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  30%|███       | 152/500 [01:39<03:12,  1.81it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  31%|███       | 153/500 [01:39<03:13,  1.79it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  31%|███       | 153/500 [01:39<03:13,  1.79it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  31%|███       | 154/500 [01:39<03:07,  1.85it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  31%|███       | 154/500 [01:40<03:07,  1.85it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  31%|███       | 155/500 [01:40<03:26,  1.67it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  31%|███       | 155/500 [01:41<03:26,  1.67it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  31%|███       | 156/500 [01:41<03:33,  1.61it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  31%|███       | 156/500 [01:41<03:33,  1.61it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  31%|███▏      | 157/500 [01:41<03:30,  1.63it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  31%|███▏      | 157/500 [01:42<03:30,  1.63it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  32%|███▏      | 158/500 [01:42<03:31,  1.62it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  32%|███▏      | 158/500 [01:43<03:31,  1.62it/s]
[TorchDR] DR Loss : 5.64e+00 | Grad norm : 1.60e-02 :  32%|███▏      | 159/500 [01:43<03:18,  1.72it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  32%|███▏      | 159/500 [01:43<03:18,  1.72it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  32%|███▏      | 160/500 [01:43<03:30,  1.62it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  32%|███▏      | 160/500 [01:44<03:30,  1.62it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  32%|███▏      | 161/500 [01:44<03:58,  1.42it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  32%|███▏      | 161/500 [01:45<03:58,  1.42it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  32%|███▏      | 162/500 [01:45<04:27,  1.26it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  32%|███▏      | 162/500 [01:46<04:27,  1.26it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  33%|███▎      | 163/500 [01:46<04:17,  1.31it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  33%|███▎      | 163/500 [01:47<04:17,  1.31it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  33%|███▎      | 164/500 [01:47<04:17,  1.31it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  33%|███▎      | 164/500 [01:48<04:17,  1.31it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  33%|███▎      | 165/500 [01:48<04:30,  1.24it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  33%|███▎      | 165/500 [01:49<04:30,  1.24it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  33%|███▎      | 166/500 [01:49<04:41,  1.19it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  33%|███▎      | 166/500 [01:49<04:41,  1.19it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  33%|███▎      | 167/500 [01:49<04:36,  1.20it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  33%|███▎      | 167/500 [01:51<04:36,  1.20it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  34%|███▎      | 168/500 [01:51<05:12,  1.06it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  34%|███▎      | 168/500 [01:52<05:12,  1.06it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  34%|███▍      | 169/500 [01:52<05:56,  1.08s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  34%|███▍      | 169/500 [01:53<05:56,  1.08s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  34%|███▍      | 170/500 [01:53<06:24,  1.17s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  34%|███▍      | 170/500 [01:54<06:24,  1.17s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  34%|███▍      | 171/500 [01:54<06:10,  1.13s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  34%|███▍      | 171/500 [01:55<06:10,  1.13s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  34%|███▍      | 172/500 [01:55<05:53,  1.08s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  34%|███▍      | 172/500 [01:56<05:53,  1.08s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  35%|███▍      | 173/500 [01:56<05:47,  1.06s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  35%|███▍      | 173/500 [01:57<05:47,  1.06s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  35%|███▍      | 174/500 [01:57<05:47,  1.07s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  35%|███▍      | 174/500 [01:58<05:47,  1.07s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  35%|███▌      | 175/500 [01:58<05:42,  1.05s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  35%|███▌      | 175/500 [01:59<05:42,  1.05s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  35%|███▌      | 176/500 [01:59<05:26,  1.01s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  35%|███▌      | 176/500 [02:01<05:26,  1.01s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  35%|███▌      | 177/500 [02:01<05:50,  1.09s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  35%|███▌      | 177/500 [02:02<05:50,  1.09s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  36%|███▌      | 178/500 [02:02<05:34,  1.04s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  36%|███▌      | 178/500 [02:03<05:34,  1.04s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  36%|███▌      | 179/500 [02:03<05:49,  1.09s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  36%|███▌      | 179/500 [02:03<05:49,  1.09s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  36%|███▌      | 180/500 [02:03<05:10,  1.03it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  36%|███▌      | 180/500 [02:04<05:10,  1.03it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  36%|███▌      | 181/500 [02:04<05:09,  1.03it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  36%|███▌      | 181/500 [02:05<05:09,  1.03it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  36%|███▋      | 182/500 [02:05<05:21,  1.01s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  36%|███▋      | 182/500 [02:07<05:21,  1.01s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  37%|███▋      | 183/500 [02:07<05:47,  1.10s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  37%|███▋      | 183/500 [02:08<05:47,  1.10s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  37%|███▋      | 184/500 [02:08<05:46,  1.10s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  37%|███▋      | 184/500 [02:09<05:46,  1.10s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  37%|███▋      | 185/500 [02:09<05:39,  1.08s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  37%|███▋      | 185/500 [02:10<05:39,  1.08s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  37%|███▋      | 186/500 [02:10<05:50,  1.11s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  37%|███▋      | 186/500 [02:11<05:50,  1.11s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  37%|███▋      | 187/500 [02:11<05:25,  1.04s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  37%|███▋      | 187/500 [02:12<05:25,  1.04s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  38%|███▊      | 188/500 [02:12<04:46,  1.09it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  38%|███▊      | 188/500 [02:12<04:46,  1.09it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  38%|███▊      | 189/500 [02:12<04:24,  1.17it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  38%|███▊      | 189/500 [02:13<04:24,  1.17it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  38%|███▊      | 190/500 [02:13<04:07,  1.25it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  38%|███▊      | 190/500 [02:14<04:07,  1.25it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  38%|███▊      | 191/500 [02:14<03:50,  1.34it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  38%|███▊      | 191/500 [02:14<03:50,  1.34it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  38%|███▊      | 192/500 [02:14<03:33,  1.44it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  38%|███▊      | 192/500 [02:15<03:33,  1.44it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  39%|███▊      | 193/500 [02:15<03:45,  1.36it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  39%|███▊      | 193/500 [02:16<03:45,  1.36it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  39%|███▉      | 194/500 [02:16<03:41,  1.38it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  39%|███▉      | 194/500 [02:17<03:41,  1.38it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  39%|███▉      | 195/500 [02:17<03:47,  1.34it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  39%|███▉      | 195/500 [02:17<03:47,  1.34it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  39%|███▉      | 196/500 [02:17<03:33,  1.42it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  39%|███▉      | 196/500 [02:18<03:33,  1.42it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  39%|███▉      | 197/500 [02:18<03:30,  1.44it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  39%|███▉      | 197/500 [02:18<03:30,  1.44it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  40%|███▉      | 198/500 [02:18<03:23,  1.48it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  40%|███▉      | 198/500 [02:19<03:23,  1.48it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  40%|███▉      | 199/500 [02:19<03:25,  1.47it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  40%|███▉      | 199/500 [02:20<03:25,  1.47it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 1.60e-02 :  40%|████      | 200/500 [02:20<03:17,  1.52it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  40%|████      | 200/500 [02:20<03:17,  1.52it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  40%|████      | 201/500 [02:20<03:20,  1.49it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  40%|████      | 201/500 [02:21<03:20,  1.49it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  40%|████      | 202/500 [02:21<03:22,  1.47it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  40%|████      | 202/500 [02:22<03:22,  1.47it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  41%|████      | 203/500 [02:22<03:32,  1.40it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  41%|████      | 203/500 [02:23<03:32,  1.40it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  41%|████      | 204/500 [02:23<03:39,  1.35it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  41%|████      | 204/500 [02:23<03:39,  1.35it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  41%|████      | 205/500 [02:23<03:26,  1.43it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  41%|████      | 205/500 [02:24<03:26,  1.43it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  41%|████      | 206/500 [02:24<03:34,  1.37it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  41%|████      | 206/500 [02:25<03:34,  1.37it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  41%|████▏     | 207/500 [02:25<03:48,  1.28it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  41%|████▏     | 207/500 [02:26<03:48,  1.28it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  42%|████▏     | 208/500 [02:26<03:40,  1.32it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  42%|████▏     | 208/500 [02:27<03:40,  1.32it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  42%|████▏     | 209/500 [02:27<03:43,  1.30it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  42%|████▏     | 209/500 [02:27<03:43,  1.30it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  42%|████▏     | 210/500 [02:27<03:37,  1.34it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  42%|████▏     | 210/500 [02:28<03:37,  1.34it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  42%|████▏     | 211/500 [02:28<03:32,  1.36it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  42%|████▏     | 211/500 [02:29<03:32,  1.36it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  42%|████▏     | 212/500 [02:29<03:51,  1.24it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  42%|████▏     | 212/500 [02:29<03:51,  1.24it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  43%|████▎     | 213/500 [02:29<03:33,  1.35it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  43%|████▎     | 213/500 [02:30<03:33,  1.35it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  43%|████▎     | 214/500 [02:30<03:31,  1.35it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  43%|████▎     | 214/500 [02:31<03:31,  1.35it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  43%|████▎     | 215/500 [02:31<03:52,  1.22it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  43%|████▎     | 215/500 [02:32<03:52,  1.22it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  43%|████▎     | 216/500 [02:32<03:33,  1.33it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  43%|████▎     | 216/500 [02:32<03:33,  1.33it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  43%|████▎     | 217/500 [02:32<03:25,  1.38it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  43%|████▎     | 217/500 [02:33<03:25,  1.38it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  44%|████▎     | 218/500 [02:33<03:17,  1.43it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  44%|████▎     | 218/500 [02:34<03:17,  1.43it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  44%|████▍     | 219/500 [02:34<03:30,  1.33it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  44%|████▍     | 219/500 [02:35<03:30,  1.33it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  44%|████▍     | 220/500 [02:35<03:45,  1.24it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  44%|████▍     | 220/500 [02:35<03:45,  1.24it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  44%|████▍     | 221/500 [02:36<03:26,  1.35it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  44%|████▍     | 221/500 [02:36<03:26,  1.35it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  44%|████▍     | 222/500 [02:36<03:37,  1.28it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  44%|████▍     | 222/500 [02:37<03:37,  1.28it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  45%|████▍     | 223/500 [02:37<03:31,  1.31it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  45%|████▍     | 223/500 [02:38<03:31,  1.31it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  45%|████▍     | 224/500 [02:38<03:17,  1.39it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  45%|████▍     | 224/500 [02:39<03:17,  1.39it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  45%|████▌     | 225/500 [02:39<03:23,  1.35it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  45%|████▌     | 225/500 [02:39<03:23,  1.35it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  45%|████▌     | 226/500 [02:39<03:19,  1.37it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  45%|████▌     | 226/500 [02:40<03:19,  1.37it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  45%|████▌     | 227/500 [02:40<03:22,  1.35it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  45%|████▌     | 227/500 [02:41<03:22,  1.35it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  46%|████▌     | 228/500 [02:41<03:28,  1.30it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  46%|████▌     | 228/500 [02:42<03:28,  1.30it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  46%|████▌     | 229/500 [02:42<03:22,  1.34it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  46%|████▌     | 229/500 [02:42<03:22,  1.34it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  46%|████▌     | 230/500 [02:42<03:17,  1.36it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  46%|████▌     | 230/500 [02:43<03:17,  1.36it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  46%|████▌     | 231/500 [02:43<03:38,  1.23it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  46%|████▌     | 231/500 [02:44<03:38,  1.23it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  46%|████▋     | 232/500 [02:44<03:28,  1.28it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  46%|████▋     | 232/500 [02:45<03:28,  1.28it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  47%|████▋     | 233/500 [02:45<03:37,  1.23it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  47%|████▋     | 233/500 [02:46<03:37,  1.23it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  47%|████▋     | 234/500 [02:46<03:35,  1.23it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  47%|████▋     | 234/500 [02:46<03:35,  1.23it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  47%|████▋     | 235/500 [02:46<03:34,  1.24it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  47%|████▋     | 235/500 [02:47<03:34,  1.24it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  47%|████▋     | 236/500 [02:47<03:32,  1.24it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  47%|████▋     | 236/500 [02:48<03:32,  1.24it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  47%|████▋     | 237/500 [02:48<03:21,  1.31it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  47%|████▋     | 237/500 [02:49<03:21,  1.31it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  48%|████▊     | 238/500 [02:49<03:25,  1.28it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  48%|████▊     | 238/500 [02:50<03:25,  1.28it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  48%|████▊     | 239/500 [02:50<04:02,  1.07it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  48%|████▊     | 239/500 [02:51<04:02,  1.07it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  48%|████▊     | 240/500 [02:51<04:01,  1.08it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  48%|████▊     | 240/500 [02:52<04:01,  1.08it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  48%|████▊     | 241/500 [02:52<04:03,  1.06it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  48%|████▊     | 241/500 [02:53<04:03,  1.06it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  48%|████▊     | 242/500 [02:53<03:54,  1.10it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  48%|████▊     | 242/500 [02:54<03:54,  1.10it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  49%|████▊     | 243/500 [02:54<03:45,  1.14it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  49%|████▊     | 243/500 [02:55<03:45,  1.14it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  49%|████▉     | 244/500 [02:55<04:14,  1.01it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  49%|████▉     | 244/500 [02:56<04:14,  1.01it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  49%|████▉     | 245/500 [02:56<04:21,  1.03s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  49%|████▉     | 245/500 [02:57<04:21,  1.03s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  49%|████▉     | 246/500 [02:57<04:28,  1.06s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  49%|████▉     | 246/500 [02:58<04:28,  1.06s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  49%|████▉     | 247/500 [02:58<04:22,  1.04s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  49%|████▉     | 247/500 [02:59<04:22,  1.04s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  50%|████▉     | 248/500 [02:59<04:11,  1.00it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  50%|████▉     | 248/500 [03:00<04:11,  1.00it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  50%|████▉     | 249/500 [03:00<04:18,  1.03s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  50%|████▉     | 249/500 [03:01<04:18,  1.03s/it]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 6.06e-03 :  50%|█████     | 250/500 [03:01<04:00,  1.04it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 3.81e-03 :  50%|█████     | 250/500 [03:02<04:00,  1.04it/s]
[TorchDR] DR Loss : 5.63e+00 | Grad norm : 3.81e-03 :  50%|█████     | 251/500 [03:02<03:46,  1.10it/s]
[TorchDR] DR Loss : 1.29e+01 | Grad norm : 3.81e-03 :  50%|█████     | 251/500 [03:03<03:46,  1.10it/s]
[TorchDR] DR Loss : 1.29e+01 | Grad norm : 3.81e-03 :  50%|█████     | 252/500 [03:03<04:15,  1.03s/it]
[TorchDR] DR Loss : 1.28e+01 | Grad norm : 3.81e-03 :  50%|█████     | 252/500 [03:04<04:15,  1.03s/it]
[TorchDR] DR Loss : 1.28e+01 | Grad norm : 3.81e-03 :  51%|█████     | 253/500 [03:04<04:39,  1.13s/it]
[TorchDR] DR Loss : 1.27e+01 | Grad norm : 3.81e-03 :  51%|█████     | 253/500 [03:05<04:39,  1.13s/it]
[TorchDR] DR Loss : 1.27e+01 | Grad norm : 3.81e-03 :  51%|█████     | 254/500 [03:05<04:08,  1.01s/it]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.81e-03 :  51%|█████     | 254/500 [03:06<04:08,  1.01s/it]
[TorchDR] DR Loss : 1.26e+01 | Grad norm : 3.81e-03 :  51%|█████     | 255/500 [03:06<03:44,  1.09it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.81e-03 :  51%|█████     | 255/500 [03:06<03:44,  1.09it/s]
[TorchDR] DR Loss : 1.25e+01 | Grad norm : 3.81e-03 :  51%|█████     | 256/500 [03:06<03:20,  1.22it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  51%|█████     | 256/500 [03:07<03:20,  1.22it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  51%|█████▏    | 257/500 [03:07<03:03,  1.32it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  51%|█████▏    | 257/500 [03:08<03:03,  1.32it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  52%|█████▏    | 258/500 [03:08<02:51,  1.41it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  52%|█████▏    | 258/500 [03:08<02:51,  1.41it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  52%|█████▏    | 259/500 [03:08<02:57,  1.36it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  52%|█████▏    | 259/500 [03:09<02:57,  1.36it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  52%|█████▏    | 260/500 [03:09<03:13,  1.24it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  52%|█████▏    | 260/500 [03:10<03:13,  1.24it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  52%|█████▏    | 261/500 [03:10<02:59,  1.33it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  52%|█████▏    | 261/500 [03:11<02:59,  1.33it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  52%|█████▏    | 262/500 [03:11<03:09,  1.26it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  52%|█████▏    | 262/500 [03:12<03:09,  1.26it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  53%|█████▎    | 263/500 [03:12<03:16,  1.21it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  53%|█████▎    | 263/500 [03:13<03:16,  1.21it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  53%|█████▎    | 264/500 [03:13<03:27,  1.14it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  53%|█████▎    | 264/500 [03:14<03:27,  1.14it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  53%|█████▎    | 265/500 [03:14<03:42,  1.06it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  53%|█████▎    | 265/500 [03:15<03:42,  1.06it/s]
[TorchDR] DR Loss : 1.24e+01 | Grad norm : 3.81e-03 :  53%|█████▎    | 266/500 [03:15<03:59,  1.02s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  53%|█████▎    | 266/500 [03:16<03:59,  1.02s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  53%|█████▎    | 267/500 [03:16<03:49,  1.01it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  53%|█████▎    | 267/500 [03:17<03:49,  1.01it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  54%|█████▎    | 268/500 [03:17<03:42,  1.04it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  54%|█████▎    | 268/500 [03:18<03:42,  1.04it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  54%|█████▍    | 269/500 [03:18<03:37,  1.06it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  54%|█████▍    | 269/500 [03:19<03:37,  1.06it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  54%|█████▍    | 270/500 [03:19<03:54,  1.02s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  54%|█████▍    | 270/500 [03:20<03:54,  1.02s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  54%|█████▍    | 271/500 [03:20<03:52,  1.01s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  54%|█████▍    | 271/500 [03:21<03:52,  1.01s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  54%|█████▍    | 272/500 [03:21<04:01,  1.06s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  54%|█████▍    | 272/500 [03:22<04:01,  1.06s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  55%|█████▍    | 273/500 [03:22<03:49,  1.01s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  55%|█████▍    | 273/500 [03:23<03:49,  1.01s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  55%|█████▍    | 274/500 [03:23<03:47,  1.01s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  55%|█████▍    | 274/500 [03:24<03:47,  1.01s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  55%|█████▌    | 275/500 [03:24<03:48,  1.02s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  55%|█████▌    | 275/500 [03:25<03:48,  1.02s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  55%|█████▌    | 276/500 [03:25<03:59,  1.07s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  55%|█████▌    | 276/500 [03:26<03:59,  1.07s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  55%|█████▌    | 277/500 [03:26<03:58,  1.07s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  55%|█████▌    | 277/500 [03:27<03:58,  1.07s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  56%|█████▌    | 278/500 [03:27<03:39,  1.01it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  56%|█████▌    | 278/500 [03:28<03:39,  1.01it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  56%|█████▌    | 279/500 [03:28<03:34,  1.03it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  56%|█████▌    | 279/500 [03:29<03:34,  1.03it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  56%|█████▌    | 280/500 [03:29<03:22,  1.09it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  56%|█████▌    | 280/500 [03:30<03:22,  1.09it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  56%|█████▌    | 281/500 [03:30<03:31,  1.04it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  56%|█████▌    | 281/500 [03:31<03:31,  1.04it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  56%|█████▋    | 282/500 [03:31<03:14,  1.12it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  56%|█████▋    | 282/500 [03:32<03:14,  1.12it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  57%|█████▋    | 283/500 [03:32<03:38,  1.01s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  57%|█████▋    | 283/500 [03:33<03:38,  1.01s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  57%|█████▋    | 284/500 [03:33<03:32,  1.02it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  57%|█████▋    | 284/500 [03:34<03:32,  1.02it/s]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  57%|█████▋    | 285/500 [03:34<03:51,  1.08s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  57%|█████▋    | 285/500 [03:35<03:51,  1.08s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  57%|█████▋    | 286/500 [03:35<03:44,  1.05s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  57%|█████▋    | 286/500 [03:36<03:44,  1.05s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  57%|█████▋    | 287/500 [03:36<03:48,  1.07s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  57%|█████▋    | 287/500 [03:38<03:48,  1.07s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  58%|█████▊    | 288/500 [03:38<03:59,  1.13s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  58%|█████▊    | 288/500 [03:38<03:59,  1.13s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  58%|█████▊    | 289/500 [03:38<03:39,  1.04s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  58%|█████▊    | 289/500 [03:39<03:39,  1.04s/it]
[TorchDR] DR Loss : 1.23e+01 | Grad norm : 3.81e-03 :  58%|█████▊    | 290/500 [03:39<03:34,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  58%|█████▊    | 290/500 [03:40<03:34,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  58%|█████▊    | 291/500 [03:40<03:33,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  58%|█████▊    | 291/500 [03:41<03:33,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  58%|█████▊    | 292/500 [03:41<03:31,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  58%|█████▊    | 292/500 [03:43<03:31,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  59%|█████▊    | 293/500 [03:43<03:41,  1.07s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  59%|█████▊    | 293/500 [03:44<03:41,  1.07s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  59%|█████▉    | 294/500 [03:44<03:54,  1.14s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  59%|█████▉    | 294/500 [03:45<03:54,  1.14s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  59%|█████▉    | 295/500 [03:45<03:57,  1.16s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  59%|█████▉    | 295/500 [03:47<03:57,  1.16s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  59%|█████▉    | 296/500 [03:47<04:17,  1.26s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  59%|█████▉    | 296/500 [03:48<04:17,  1.26s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  59%|█████▉    | 297/500 [03:48<04:18,  1.27s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  59%|█████▉    | 297/500 [03:49<04:18,  1.27s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  60%|█████▉    | 298/500 [03:49<03:58,  1.18s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  60%|█████▉    | 298/500 [03:50<03:58,  1.18s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  60%|█████▉    | 299/500 [03:50<03:42,  1.11s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  60%|█████▉    | 299/500 [03:51<03:42,  1.11s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.81e-03 :  60%|██████    | 300/500 [03:51<03:28,  1.04s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  60%|██████    | 300/500 [03:52<03:28,  1.04s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  60%|██████    | 301/500 [03:52<04:06,  1.24s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  60%|██████    | 301/500 [03:54<04:06,  1.24s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  60%|██████    | 302/500 [03:54<04:03,  1.23s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  60%|██████    | 302/500 [03:55<04:03,  1.23s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  61%|██████    | 303/500 [03:55<04:00,  1.22s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  61%|██████    | 303/500 [03:56<04:00,  1.22s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  61%|██████    | 304/500 [03:56<04:03,  1.24s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  61%|██████    | 304/500 [03:57<04:03,  1.24s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  61%|██████    | 305/500 [03:57<04:11,  1.29s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  61%|██████    | 305/500 [03:59<04:11,  1.29s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  61%|██████    | 306/500 [03:59<04:05,  1.26s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  61%|██████    | 306/500 [04:00<04:05,  1.26s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  61%|██████▏   | 307/500 [04:00<03:54,  1.21s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  61%|██████▏   | 307/500 [04:01<03:54,  1.21s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  62%|██████▏   | 308/500 [04:01<03:40,  1.15s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  62%|██████▏   | 308/500 [04:02<03:40,  1.15s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  62%|██████▏   | 309/500 [04:02<04:05,  1.29s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  62%|██████▏   | 309/500 [04:03<04:05,  1.29s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  62%|██████▏   | 310/500 [04:03<03:36,  1.14s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  62%|██████▏   | 310/500 [04:04<03:36,  1.14s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  62%|██████▏   | 311/500 [04:04<03:33,  1.13s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  62%|██████▏   | 311/500 [04:06<03:33,  1.13s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  62%|██████▏   | 312/500 [04:06<03:51,  1.23s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  62%|██████▏   | 312/500 [04:07<03:51,  1.23s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  63%|██████▎   | 313/500 [04:07<03:44,  1.20s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  63%|██████▎   | 313/500 [04:08<03:44,  1.20s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  63%|██████▎   | 314/500 [04:08<03:47,  1.22s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  63%|██████▎   | 314/500 [04:09<03:47,  1.22s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  63%|██████▎   | 315/500 [04:09<03:33,  1.16s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  63%|██████▎   | 315/500 [04:10<03:33,  1.16s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  63%|██████▎   | 316/500 [04:10<03:25,  1.12s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  63%|██████▎   | 316/500 [04:12<03:25,  1.12s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  63%|██████▎   | 317/500 [04:12<03:45,  1.23s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  63%|██████▎   | 317/500 [04:13<03:45,  1.23s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  64%|██████▎   | 318/500 [04:13<03:31,  1.16s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  64%|██████▎   | 318/500 [04:14<03:31,  1.16s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  64%|██████▍   | 319/500 [04:14<03:52,  1.29s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  64%|██████▍   | 319/500 [04:16<03:52,  1.29s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  64%|██████▍   | 320/500 [04:16<03:57,  1.32s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  64%|██████▍   | 320/500 [04:17<03:57,  1.32s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  64%|██████▍   | 321/500 [04:17<03:39,  1.22s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  64%|██████▍   | 321/500 [04:18<03:39,  1.22s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  64%|██████▍   | 322/500 [04:18<03:32,  1.20s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  64%|██████▍   | 322/500 [04:19<03:32,  1.20s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  65%|██████▍   | 323/500 [04:19<03:42,  1.26s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  65%|██████▍   | 323/500 [04:20<03:42,  1.26s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  65%|██████▍   | 324/500 [04:20<03:41,  1.26s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  65%|██████▍   | 324/500 [04:22<03:41,  1.26s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  65%|██████▌   | 325/500 [04:22<03:38,  1.25s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  65%|██████▌   | 325/500 [04:23<03:38,  1.25s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  65%|██████▌   | 326/500 [04:23<03:35,  1.24s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  65%|██████▌   | 326/500 [04:24<03:35,  1.24s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  65%|██████▌   | 327/500 [04:24<03:51,  1.34s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  65%|██████▌   | 327/500 [04:25<03:51,  1.34s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  66%|██████▌   | 328/500 [04:25<03:23,  1.18s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  66%|██████▌   | 328/500 [04:27<03:23,  1.18s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  66%|██████▌   | 329/500 [04:27<03:37,  1.27s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  66%|██████▌   | 329/500 [04:27<03:37,  1.27s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  66%|██████▌   | 330/500 [04:27<03:08,  1.11s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  66%|██████▌   | 330/500 [04:28<03:08,  1.11s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  66%|██████▌   | 331/500 [04:28<02:46,  1.01it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  66%|██████▌   | 331/500 [04:29<02:46,  1.01it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  66%|██████▋   | 332/500 [04:29<02:31,  1.11it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  66%|██████▋   | 332/500 [04:30<02:31,  1.11it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  67%|██████▋   | 333/500 [04:30<02:23,  1.16it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  67%|██████▋   | 333/500 [04:31<02:23,  1.16it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  67%|██████▋   | 334/500 [04:31<02:34,  1.07it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  67%|██████▋   | 334/500 [04:32<02:34,  1.07it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  67%|██████▋   | 335/500 [04:32<02:33,  1.07it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  67%|██████▋   | 335/500 [04:33<02:33,  1.07it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  67%|██████▋   | 336/500 [04:33<02:31,  1.08it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  67%|██████▋   | 336/500 [04:34<02:31,  1.08it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  67%|██████▋   | 337/500 [04:34<02:34,  1.06it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  67%|██████▋   | 337/500 [04:35<02:34,  1.06it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  68%|██████▊   | 338/500 [04:35<02:35,  1.04it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  68%|██████▊   | 338/500 [04:36<02:35,  1.04it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  68%|██████▊   | 339/500 [04:36<02:36,  1.03it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  68%|██████▊   | 339/500 [04:36<02:36,  1.03it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  68%|██████▊   | 340/500 [04:36<02:30,  1.06it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  68%|██████▊   | 340/500 [04:38<02:30,  1.06it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  68%|██████▊   | 341/500 [04:38<02:43,  1.03s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  68%|██████▊   | 341/500 [04:39<02:43,  1.03s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  68%|██████▊   | 342/500 [04:39<02:49,  1.07s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  68%|██████▊   | 342/500 [04:40<02:49,  1.07s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  69%|██████▊   | 343/500 [04:40<02:40,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  69%|██████▊   | 343/500 [04:41<02:40,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  69%|██████▉   | 344/500 [04:41<02:47,  1.07s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  69%|██████▉   | 344/500 [04:42<02:47,  1.07s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  69%|██████▉   | 345/500 [04:42<02:38,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  69%|██████▉   | 345/500 [04:43<02:38,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  69%|██████▉   | 346/500 [04:43<02:28,  1.04it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  69%|██████▉   | 346/500 [04:43<02:28,  1.04it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  69%|██████▉   | 347/500 [04:43<02:15,  1.13it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  69%|██████▉   | 347/500 [04:44<02:15,  1.13it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  70%|██████▉   | 348/500 [04:44<02:09,  1.18it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  70%|██████▉   | 348/500 [04:45<02:09,  1.18it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  70%|██████▉   | 349/500 [04:45<02:19,  1.08it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  70%|██████▉   | 349/500 [04:46<02:19,  1.08it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.62e-02 :  70%|███████   | 350/500 [04:46<02:08,  1.17it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  70%|███████   | 350/500 [04:47<02:08,  1.17it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  70%|███████   | 351/500 [04:47<02:11,  1.14it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  70%|███████   | 351/500 [04:48<02:11,  1.14it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  70%|███████   | 352/500 [04:48<02:19,  1.06it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  70%|███████   | 352/500 [04:49<02:19,  1.06it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  71%|███████   | 353/500 [04:49<02:21,  1.04it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  71%|███████   | 353/500 [04:50<02:21,  1.04it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  71%|███████   | 354/500 [04:50<02:13,  1.09it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  71%|███████   | 354/500 [04:51<02:13,  1.09it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  71%|███████   | 355/500 [04:51<02:32,  1.05s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  71%|███████   | 355/500 [04:52<02:32,  1.05s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  71%|███████   | 356/500 [04:52<02:38,  1.10s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  71%|███████   | 356/500 [04:53<02:38,  1.10s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  71%|███████▏  | 357/500 [04:53<02:36,  1.09s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  71%|███████▏  | 357/500 [04:54<02:36,  1.09s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  72%|███████▏  | 358/500 [04:54<02:31,  1.07s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  72%|███████▏  | 358/500 [04:55<02:31,  1.07s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  72%|███████▏  | 359/500 [04:55<02:28,  1.05s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  72%|███████▏  | 359/500 [04:56<02:28,  1.05s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  72%|███████▏  | 360/500 [04:56<02:24,  1.03s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  72%|███████▏  | 360/500 [04:57<02:24,  1.03s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  72%|███████▏  | 361/500 [04:57<02:18,  1.00it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  72%|███████▏  | 361/500 [04:58<02:18,  1.00it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  72%|███████▏  | 362/500 [04:58<02:20,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  72%|███████▏  | 362/500 [04:59<02:20,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  73%|███████▎  | 363/500 [04:59<02:20,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  73%|███████▎  | 363/500 [05:00<02:20,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  73%|███████▎  | 364/500 [05:00<02:17,  1.01s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  73%|███████▎  | 364/500 [05:01<02:17,  1.01s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  73%|███████▎  | 365/500 [05:01<02:07,  1.06it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  73%|███████▎  | 365/500 [05:02<02:07,  1.06it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  73%|███████▎  | 366/500 [05:02<02:06,  1.06it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  73%|███████▎  | 366/500 [05:03<02:06,  1.06it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  73%|███████▎  | 367/500 [05:03<02:06,  1.05it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  73%|███████▎  | 367/500 [05:04<02:06,  1.05it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  74%|███████▎  | 368/500 [05:04<02:00,  1.09it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  74%|███████▎  | 368/500 [05:05<02:00,  1.09it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  74%|███████▍  | 369/500 [05:05<01:51,  1.18it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  74%|███████▍  | 369/500 [05:06<01:51,  1.18it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  74%|███████▍  | 370/500 [05:06<02:02,  1.06it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  74%|███████▍  | 370/500 [05:07<02:02,  1.06it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  74%|███████▍  | 371/500 [05:07<02:00,  1.07it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  74%|███████▍  | 371/500 [05:08<02:00,  1.07it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  74%|███████▍  | 372/500 [05:08<02:05,  1.02it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  74%|███████▍  | 372/500 [05:09<02:05,  1.02it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  75%|███████▍  | 373/500 [05:09<02:06,  1.00it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  75%|███████▍  | 373/500 [05:10<02:06,  1.00it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  75%|███████▍  | 374/500 [05:10<02:01,  1.03it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  75%|███████▍  | 374/500 [05:11<02:01,  1.03it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  75%|███████▌  | 375/500 [05:11<02:05,  1.01s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  75%|███████▌  | 375/500 [05:12<02:05,  1.01s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  75%|███████▌  | 376/500 [05:12<01:53,  1.09it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  75%|███████▌  | 376/500 [05:12<01:53,  1.09it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  75%|███████▌  | 377/500 [05:12<01:40,  1.22it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  75%|███████▌  | 377/500 [05:13<01:40,  1.22it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  76%|███████▌  | 378/500 [05:13<01:39,  1.23it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  76%|███████▌  | 378/500 [05:14<01:39,  1.23it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  76%|███████▌  | 379/500 [05:14<01:45,  1.15it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  76%|███████▌  | 379/500 [05:15<01:45,  1.15it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  76%|███████▌  | 380/500 [05:15<01:34,  1.27it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  76%|███████▌  | 380/500 [05:15<01:34,  1.27it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  76%|███████▌  | 381/500 [05:15<01:34,  1.26it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  76%|███████▌  | 381/500 [05:16<01:34,  1.26it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  76%|███████▋  | 382/500 [05:16<01:44,  1.13it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  76%|███████▋  | 382/500 [05:17<01:44,  1.13it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  77%|███████▋  | 383/500 [05:17<01:47,  1.09it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  77%|███████▋  | 383/500 [05:18<01:47,  1.09it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  77%|███████▋  | 384/500 [05:18<01:42,  1.13it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  77%|███████▋  | 384/500 [05:19<01:42,  1.13it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  77%|███████▋  | 385/500 [05:19<01:38,  1.17it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  77%|███████▋  | 385/500 [05:20<01:38,  1.17it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  77%|███████▋  | 386/500 [05:20<01:46,  1.07it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  77%|███████▋  | 386/500 [05:21<01:46,  1.07it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  77%|███████▋  | 387/500 [05:21<01:39,  1.13it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  77%|███████▋  | 387/500 [05:21<01:39,  1.13it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  78%|███████▊  | 388/500 [05:21<01:26,  1.30it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  78%|███████▊  | 388/500 [05:22<01:26,  1.30it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  78%|███████▊  | 389/500 [05:22<01:20,  1.38it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  78%|███████▊  | 389/500 [05:23<01:20,  1.38it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  78%|███████▊  | 390/500 [05:23<01:19,  1.39it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  78%|███████▊  | 390/500 [05:23<01:19,  1.39it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  78%|███████▊  | 391/500 [05:24<01:20,  1.35it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  78%|███████▊  | 391/500 [05:25<01:20,  1.35it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  78%|███████▊  | 392/500 [05:25<01:31,  1.18it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  78%|███████▊  | 392/500 [05:26<01:31,  1.18it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  79%|███████▊  | 393/500 [05:26<01:42,  1.05it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  79%|███████▊  | 393/500 [05:27<01:42,  1.05it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  79%|███████▉  | 394/500 [05:27<02:01,  1.15s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  79%|███████▉  | 394/500 [05:29<02:01,  1.15s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  79%|███████▉  | 395/500 [05:29<02:21,  1.34s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  79%|███████▉  | 395/500 [05:30<02:21,  1.34s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  79%|███████▉  | 396/500 [05:30<02:09,  1.24s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  79%|███████▉  | 396/500 [05:31<02:09,  1.24s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  79%|███████▉  | 397/500 [05:32<02:09,  1.26s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  79%|███████▉  | 397/500 [05:33<02:09,  1.26s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  80%|███████▉  | 398/500 [05:33<02:03,  1.21s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  80%|███████▉  | 398/500 [05:34<02:03,  1.21s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  80%|███████▉  | 399/500 [05:34<01:55,  1.15s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  80%|███████▉  | 399/500 [05:35<01:55,  1.15s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 3.92e-03 :  80%|████████  | 400/500 [05:35<01:55,  1.15s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  80%|████████  | 400/500 [05:36<01:55,  1.15s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  80%|████████  | 401/500 [05:36<01:52,  1.14s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  80%|████████  | 401/500 [05:37<01:52,  1.14s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  80%|████████  | 402/500 [05:37<01:53,  1.16s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  80%|████████  | 402/500 [05:38<01:53,  1.16s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  81%|████████  | 403/500 [05:38<01:48,  1.12s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  81%|████████  | 403/500 [05:39<01:48,  1.12s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  81%|████████  | 404/500 [05:39<01:35,  1.01it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  81%|████████  | 404/500 [05:40<01:35,  1.01it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  81%|████████  | 405/500 [05:40<01:36,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  81%|████████  | 405/500 [05:41<01:36,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  81%|████████  | 406/500 [05:41<01:35,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  81%|████████  | 406/500 [05:42<01:35,  1.02s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  81%|████████▏ | 407/500 [05:42<01:42,  1.10s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  81%|████████▏ | 407/500 [05:43<01:42,  1.10s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  82%|████████▏ | 408/500 [05:43<01:44,  1.13s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  82%|████████▏ | 408/500 [05:45<01:44,  1.13s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  82%|████████▏ | 409/500 [05:45<01:44,  1.14s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  82%|████████▏ | 409/500 [05:46<01:44,  1.14s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  82%|████████▏ | 410/500 [05:46<01:44,  1.16s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  82%|████████▏ | 410/500 [05:47<01:44,  1.16s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  82%|████████▏ | 411/500 [05:47<01:42,  1.15s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  82%|████████▏ | 411/500 [05:48<01:42,  1.15s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  82%|████████▏ | 412/500 [05:48<01:41,  1.16s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  82%|████████▏ | 412/500 [05:49<01:41,  1.16s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  83%|████████▎ | 413/500 [05:49<01:39,  1.15s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  83%|████████▎ | 413/500 [05:50<01:39,  1.15s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  83%|████████▎ | 414/500 [05:50<01:40,  1.16s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  83%|████████▎ | 414/500 [05:52<01:40,  1.16s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  83%|████████▎ | 415/500 [05:52<01:39,  1.17s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  83%|████████▎ | 415/500 [05:53<01:39,  1.17s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  83%|████████▎ | 416/500 [05:53<01:36,  1.14s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  83%|████████▎ | 416/500 [05:54<01:36,  1.14s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  83%|████████▎ | 417/500 [05:54<01:34,  1.14s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  83%|████████▎ | 417/500 [05:55<01:34,  1.14s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  84%|████████▎ | 418/500 [05:55<01:32,  1.13s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  84%|████████▎ | 418/500 [05:56<01:32,  1.13s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  84%|████████▍ | 419/500 [05:56<01:37,  1.21s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  84%|████████▍ | 419/500 [05:57<01:37,  1.21s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  84%|████████▍ | 420/500 [05:57<01:33,  1.17s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  84%|████████▍ | 420/500 [05:58<01:33,  1.17s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  84%|████████▍ | 421/500 [05:58<01:24,  1.07s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  84%|████████▍ | 421/500 [05:59<01:24,  1.07s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  84%|████████▍ | 422/500 [05:59<01:21,  1.05s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  84%|████████▍ | 422/500 [06:00<01:21,  1.05s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  85%|████████▍ | 423/500 [06:00<01:14,  1.04it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  85%|████████▍ | 423/500 [06:01<01:14,  1.04it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  85%|████████▍ | 424/500 [06:01<01:14,  1.03it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  85%|████████▍ | 424/500 [06:02<01:14,  1.03it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  85%|████████▌ | 425/500 [06:02<01:09,  1.07it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  85%|████████▌ | 425/500 [06:03<01:09,  1.07it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  85%|████████▌ | 426/500 [06:03<01:12,  1.03it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  85%|████████▌ | 426/500 [06:04<01:12,  1.03it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  85%|████████▌ | 427/500 [06:04<01:09,  1.05it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  85%|████████▌ | 427/500 [06:05<01:09,  1.05it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  86%|████████▌ | 428/500 [06:05<01:07,  1.06it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  86%|████████▌ | 428/500 [06:05<01:07,  1.06it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  86%|████████▌ | 429/500 [06:05<01:03,  1.12it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  86%|████████▌ | 429/500 [06:07<01:03,  1.12it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  86%|████████▌ | 430/500 [06:07<01:07,  1.04it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  86%|████████▌ | 430/500 [06:07<01:07,  1.04it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  86%|████████▌ | 431/500 [06:07<01:01,  1.13it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  86%|████████▌ | 431/500 [06:08<01:01,  1.13it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  86%|████████▋ | 432/500 [06:08<00:58,  1.16it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  86%|████████▋ | 432/500 [06:09<00:58,  1.16it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  87%|████████▋ | 433/500 [06:09<01:00,  1.11it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  87%|████████▋ | 433/500 [06:10<01:00,  1.11it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  87%|████████▋ | 434/500 [06:10<01:03,  1.04it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  87%|████████▋ | 434/500 [06:11<01:03,  1.04it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  87%|████████▋ | 435/500 [06:11<01:03,  1.03it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  87%|████████▋ | 435/500 [06:12<01:03,  1.03it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  87%|████████▋ | 436/500 [06:12<01:06,  1.03s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  87%|████████▋ | 436/500 [06:13<01:06,  1.03s/it]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  87%|████████▋ | 437/500 [06:13<01:01,  1.03it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  87%|████████▋ | 437/500 [06:14<01:01,  1.03it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  88%|████████▊ | 438/500 [06:14<00:57,  1.09it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  88%|████████▊ | 438/500 [06:15<00:57,  1.09it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  88%|████████▊ | 439/500 [06:15<00:55,  1.10it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  88%|████████▊ | 439/500 [06:16<00:55,  1.10it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  88%|████████▊ | 440/500 [06:16<00:56,  1.07it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  88%|████████▊ | 440/500 [06:17<00:56,  1.07it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  88%|████████▊ | 441/500 [06:17<00:54,  1.07it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  88%|████████▊ | 441/500 [06:18<00:54,  1.07it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  88%|████████▊ | 442/500 [06:18<00:57,  1.02it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  88%|████████▊ | 442/500 [06:19<00:57,  1.02it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  89%|████████▊ | 443/500 [06:19<00:52,  1.09it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  89%|████████▊ | 443/500 [06:19<00:52,  1.09it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  89%|████████▉ | 444/500 [06:19<00:44,  1.25it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  89%|████████▉ | 444/500 [06:20<00:44,  1.25it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  89%|████████▉ | 445/500 [06:20<00:39,  1.40it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  89%|████████▉ | 445/500 [06:20<00:39,  1.40it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  89%|████████▉ | 446/500 [06:20<00:34,  1.54it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  89%|████████▉ | 446/500 [06:21<00:34,  1.54it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  89%|████████▉ | 447/500 [06:21<00:37,  1.40it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  89%|████████▉ | 447/500 [06:22<00:37,  1.40it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  90%|████████▉ | 448/500 [06:22<00:38,  1.34it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  90%|████████▉ | 448/500 [06:23<00:38,  1.34it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  90%|████████▉ | 449/500 [06:23<00:38,  1.31it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  90%|████████▉ | 449/500 [06:23<00:38,  1.31it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.24e-03 :  90%|█████████ | 450/500 [06:24<00:38,  1.29it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  90%|█████████ | 450/500 [06:24<00:38,  1.29it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  90%|█████████ | 451/500 [06:24<00:34,  1.40it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  90%|█████████ | 451/500 [06:25<00:34,  1.40it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  90%|█████████ | 452/500 [06:25<00:35,  1.33it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  90%|█████████ | 452/500 [06:26<00:35,  1.33it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  91%|█████████ | 453/500 [06:26<00:33,  1.42it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  91%|█████████ | 453/500 [06:26<00:33,  1.42it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  91%|█████████ | 454/500 [06:26<00:32,  1.42it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  91%|█████████ | 454/500 [06:27<00:32,  1.42it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  91%|█████████ | 455/500 [06:27<00:30,  1.49it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  91%|█████████ | 455/500 [06:27<00:30,  1.49it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  91%|█████████ | 456/500 [06:27<00:28,  1.56it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  91%|█████████ | 456/500 [06:28<00:28,  1.56it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  91%|█████████▏| 457/500 [06:28<00:27,  1.57it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  91%|█████████▏| 457/500 [06:29<00:27,  1.57it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  92%|█████████▏| 458/500 [06:29<00:26,  1.60it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  92%|█████████▏| 458/500 [06:29<00:26,  1.60it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  92%|█████████▏| 459/500 [06:29<00:28,  1.43it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  92%|█████████▏| 459/500 [06:30<00:28,  1.43it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  92%|█████████▏| 460/500 [06:30<00:27,  1.47it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  92%|█████████▏| 460/500 [06:31<00:27,  1.47it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  92%|█████████▏| 461/500 [06:31<00:25,  1.55it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  92%|█████████▏| 461/500 [06:31<00:25,  1.55it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  92%|█████████▏| 462/500 [06:31<00:24,  1.56it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  92%|█████████▏| 462/500 [06:32<00:24,  1.56it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  93%|█████████▎| 463/500 [06:32<00:25,  1.45it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  93%|█████████▎| 463/500 [06:33<00:25,  1.45it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  93%|█████████▎| 464/500 [06:33<00:25,  1.40it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  93%|█████████▎| 464/500 [06:33<00:25,  1.40it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  93%|█████████▎| 465/500 [06:33<00:23,  1.52it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  93%|█████████▎| 465/500 [06:34<00:23,  1.52it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  93%|█████████▎| 466/500 [06:34<00:20,  1.64it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  93%|█████████▎| 466/500 [06:35<00:20,  1.64it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  93%|█████████▎| 467/500 [06:35<00:20,  1.59it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  93%|█████████▎| 467/500 [06:35<00:20,  1.59it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  94%|█████████▎| 468/500 [06:35<00:20,  1.59it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  94%|█████████▎| 468/500 [06:36<00:20,  1.59it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  94%|█████████▍| 469/500 [06:36<00:20,  1.49it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  94%|█████████▍| 469/500 [06:37<00:20,  1.49it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  94%|█████████▍| 470/500 [06:37<00:19,  1.54it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  94%|█████████▍| 470/500 [06:37<00:19,  1.54it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  94%|█████████▍| 471/500 [06:37<00:17,  1.63it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  94%|█████████▍| 471/500 [06:38<00:17,  1.63it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  94%|█████████▍| 472/500 [06:38<00:17,  1.64it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  94%|█████████▍| 472/500 [06:38<00:17,  1.64it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  95%|█████████▍| 473/500 [06:38<00:16,  1.65it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  95%|█████████▍| 473/500 [06:39<00:16,  1.65it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  95%|█████████▍| 474/500 [06:39<00:14,  1.74it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  95%|█████████▍| 474/500 [06:39<00:14,  1.74it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  95%|█████████▌| 475/500 [06:39<00:14,  1.72it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  95%|█████████▌| 475/500 [06:40<00:14,  1.72it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  95%|█████████▌| 476/500 [06:40<00:13,  1.73it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  95%|█████████▌| 476/500 [06:41<00:13,  1.73it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  95%|█████████▌| 477/500 [06:41<00:14,  1.60it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  95%|█████████▌| 477/500 [06:41<00:14,  1.60it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  96%|█████████▌| 478/500 [06:41<00:13,  1.62it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  96%|█████████▌| 478/500 [06:42<00:13,  1.62it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  96%|█████████▌| 479/500 [06:42<00:12,  1.64it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  96%|█████████▌| 479/500 [06:42<00:12,  1.64it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  96%|█████████▌| 480/500 [06:42<00:11,  1.73it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  96%|█████████▌| 480/500 [06:43<00:11,  1.73it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  96%|█████████▌| 481/500 [06:43<00:10,  1.80it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  96%|█████████▌| 481/500 [06:44<00:10,  1.80it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  96%|█████████▋| 482/500 [06:44<00:10,  1.70it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  96%|█████████▋| 482/500 [06:44<00:10,  1.70it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  97%|█████████▋| 483/500 [06:44<00:09,  1.75it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  97%|█████████▋| 483/500 [06:45<00:09,  1.75it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  97%|█████████▋| 484/500 [06:45<00:09,  1.75it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  97%|█████████▋| 484/500 [06:46<00:09,  1.75it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  97%|█████████▋| 485/500 [06:46<00:09,  1.54it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  97%|█████████▋| 485/500 [06:46<00:09,  1.54it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  97%|█████████▋| 486/500 [06:46<00:08,  1.60it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  97%|█████████▋| 486/500 [06:47<00:08,  1.60it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  97%|█████████▋| 487/500 [06:47<00:07,  1.67it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  97%|█████████▋| 487/500 [06:47<00:07,  1.67it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  98%|█████████▊| 488/500 [06:47<00:07,  1.59it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  98%|█████████▊| 488/500 [06:48<00:07,  1.59it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  98%|█████████▊| 489/500 [06:48<00:07,  1.54it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  98%|█████████▊| 489/500 [06:49<00:07,  1.54it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  98%|█████████▊| 490/500 [06:49<00:06,  1.44it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  98%|█████████▊| 490/500 [06:49<00:06,  1.44it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  98%|█████████▊| 491/500 [06:49<00:05,  1.52it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  98%|█████████▊| 491/500 [06:50<00:05,  1.52it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  98%|█████████▊| 492/500 [06:50<00:04,  1.64it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  98%|█████████▊| 492/500 [06:51<00:04,  1.64it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  99%|█████████▊| 493/500 [06:51<00:04,  1.48it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  99%|█████████▊| 493/500 [06:51<00:04,  1.48it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  99%|█████████▉| 494/500 [06:51<00:04,  1.48it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  99%|█████████▉| 494/500 [06:52<00:04,  1.48it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  99%|█████████▉| 495/500 [06:52<00:03,  1.45it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  99%|█████████▉| 495/500 [06:53<00:03,  1.45it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  99%|█████████▉| 496/500 [06:53<00:02,  1.51it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  99%|█████████▉| 496/500 [06:53<00:02,  1.51it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  99%|█████████▉| 497/500 [06:53<00:01,  1.63it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 :  99%|█████████▉| 497/500 [06:54<00:01,  1.63it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 : 100%|█████████▉| 498/500 [06:54<00:01,  1.56it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 : 100%|█████████▉| 498/500 [06:54<00:01,  1.56it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 : 100%|█████████▉| 499/500 [06:54<00:00,  1.67it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 : 100%|█████████▉| 499/500 [06:55<00:00,  1.67it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 : 100%|██████████| 500/500 [06:55<00:00,  1.70it/s]
[TorchDR] DR Loss : 1.22e+01 | Grad norm : 1.56e-04 : 100%|██████████| 500/500 [06:55<00:00,  1.20it/s]

Total running time of the script: (12 minutes 54.890 seconds)

Gallery generated by Sphinx-Gallery