Lmod Warning:
-------------------------------------------------------------------------------
The following dependent module(s) are not currently loaded: curl/8.4.0
(required by: htslib/1.16)
-------------------------------------------------------------------------------




The following have been reloaded with a version change:
  1) curl/8.4.0 => curl/8.17.0     2) openssl/3.0.7 => openssl/3.6.0

2026-01-05 00:02:45.668229: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:05:07.442093: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-01-05 00:05:07.456776: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-01-05 00:05:07.533149: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-01-05 00:05:07.533200: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:05:08.093201: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:05:08.093269: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:05:08.467120: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-01-05 00:05:09.037090: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-01-05 00:05:09.596215: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-01-05 00:05:09.818763: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-01-05 00:05:09.986672: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:05:09.987073: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-01-05 00:05:09.987505: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-01-05 00:05:09.989142: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-01-05 00:05:09.989402: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-01-05 00:05:09.989421: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:05:09.989435: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:05:09.989444: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:05:09.989453: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-01-05 00:05:09.989462: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-01-05 00:05:09.989471: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-01-05 00:05:09.989480: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-01-05 00:05:09.989500: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:05:09.989786: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-01-05 00:05:09.989807: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:05:12.373659: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-01-05 00:05:12.373745: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-01-05 00:05:12.373757: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-01-05 00:05:12.374429: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:61:00.0, compute capability: 8.9)
2026-01-05 00:05:24.537240: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2)
2026-01-05 00:05:24.537808: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-01-05 00:05:25.660621: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:05:27.426252: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:05:27.433095: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:05:41.569438: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-01-05 00:05:41.641999: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
2026-01-05 00:06:07.039974: I tensorflow/stream_executor/cuda/cuda_blas.cc:1838] TensorFloat-32 will be used for the matrix multiplication. This will only be logged once.
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates', 'jitters', 'index', 'status', 'rev_comp'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
2026-01-05 00:16:03.312291: W tensorflow/python/util/util.cc:348] Sets are not currently considered sequences, but this may change in the future, so consider avoiding using them.
2026-01-05 00:16:07.024736: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:16:40.070217: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-01-05 00:16:40.071058: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-01-05 00:16:40.100842: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-01-05 00:16:40.100919: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:16:40.107053: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:16:40.107132: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:16:40.110024: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-01-05 00:16:40.113206: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-01-05 00:16:40.117908: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-01-05 00:16:40.120824: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-01-05 00:16:40.122321: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:16:40.122692: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-01-05 00:16:40.122994: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-01-05 00:16:40.124687: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-01-05 00:16:40.124924: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-01-05 00:16:40.124945: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:16:40.124959: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:16:40.124969: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:16:40.124979: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-01-05 00:16:40.124989: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-01-05 00:16:40.124999: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-01-05 00:16:40.125009: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-01-05 00:16:40.125019: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:16:40.125296: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-01-05 00:16:40.125317: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:16:40.539314: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-01-05 00:16:40.539400: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-01-05 00:16:40.539411: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-01-05 00:16:40.540112: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:61:00.0, compute capability: 8.9)
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/layers/core.py:1059: UserWarning: bpnet.model.arch is not loaded, but a Lambda layer uses it. It may cause errors.
  , UserWarning)
batch:   0%|          | 0/128 [00:00<?, ?it/s]2026-01-05 00:16:42.114824: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2)
2026-01-05 00:16:42.115417: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-01-05 00:16:42.308117: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:16:42.796206: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:16:42.798332: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:16:44.281364: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-01-05 00:16:44.350201: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates', 'true_profiles', 'true_logcounts', 'rev_comp'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
batch:   1%|          | 1/128 [00:22<48:33, 22.94s/it]batch:   2%|▏         | 2/128 [00:23<20:08,  9.59s/it]batch:   2%|▏         | 3/128 [00:23<11:05,  5.32s/it]batch:   3%|▎         | 4/128 [00:23<06:51,  3.31s/it]batch:   4%|▍         | 5/128 [00:23<04:31,  2.21s/it]batch:   5%|▍         | 6/128 [00:24<03:07,  1.54s/it]batch:   5%|▌         | 7/128 [00:24<02:14,  1.11s/it]batch:   6%|▋         | 8/128 [00:24<01:40,  1.20it/s]batch:   7%|▋         | 9/128 [00:24<01:16,  1.55it/s]batch:   8%|▊         | 10/128 [00:25<01:01,  1.92it/s]batch:   9%|▊         | 11/128 [00:25<00:50,  2.31it/s]batch:   9%|▉         | 12/128 [00:25<00:43,  2.68it/s]batch:  10%|█         | 13/128 [00:25<00:37,  3.03it/s]batch:  11%|█         | 14/128 [00:26<00:34,  3.32it/s]batch:  12%|█▏        | 15/128 [00:26<00:31,  3.53it/s]batch:  12%|█▎        | 16/128 [00:26<00:30,  3.72it/s]batch:  13%|█▎        | 17/128 [00:26<00:28,  3.84it/s]batch:  14%|█▍        | 18/128 [00:26<00:27,  3.95it/s]batch:  15%|█▍        | 19/128 [00:27<00:27,  4.03it/s]batch:  16%|█▌        | 20/128 [00:27<00:26,  4.08it/s]batch:  16%|█▋        | 21/128 [00:27<00:25,  4.14it/s]batch:  17%|█▋        | 22/128 [00:27<00:25,  4.16it/s]batch:  18%|█▊        | 23/128 [00:28<00:25,  4.18it/s]batch:  19%|█▉        | 24/128 [00:28<00:24,  4.20it/s]batch:  20%|█▉        | 25/128 [00:28<00:24,  4.20it/s]batch:  20%|██        | 26/128 [00:28<00:24,  4.21it/s]batch:  21%|██        | 27/128 [00:29<00:24,  4.19it/s]batch:  22%|██▏       | 28/128 [00:29<00:24,  4.16it/s]batch:  23%|██▎       | 29/128 [00:29<00:23,  4.16it/s]batch:  23%|██▎       | 30/128 [00:29<00:23,  4.15it/s]batch:  24%|██▍       | 31/128 [00:30<00:23,  4.12it/s]batch:  25%|██▌       | 32/128 [00:30<00:23,  4.12it/s]batch:  26%|██▌       | 33/128 [00:30<00:22,  4.14it/s]batch:  27%|██▋       | 34/128 [00:30<00:22,  4.15it/s]batch:  27%|██▋       | 35/128 [00:31<00:22,  4.16it/s]batch:  28%|██▊       | 36/128 [00:31<00:22,  4.16it/s]batch:  29%|██▉       | 37/128 [00:31<00:21,  4.14it/s]batch:  30%|██▉       | 38/128 [00:31<00:21,  4.15it/s]batch:  30%|███       | 39/128 [00:32<00:21,  4.14it/s]batch:  31%|███▏      | 40/128 [00:32<00:21,  4.18it/s]batch:  32%|███▏      | 41/128 [00:32<00:20,  4.17it/s]batch:  33%|███▎      | 42/128 [00:32<00:20,  4.16it/s]batch:  34%|███▎      | 43/128 [00:32<00:20,  4.19it/s]batch:  34%|███▍      | 44/128 [00:33<00:20,  4.18it/s]batch:  35%|███▌      | 45/128 [00:33<00:19,  4.19it/s]batch:  36%|███▌      | 46/128 [00:33<00:19,  4.19it/s]batch:  37%|███▋      | 47/128 [00:33<00:19,  4.21it/s]batch:  38%|███▊      | 48/128 [00:34<00:19,  4.21it/s]batch:  38%|███▊      | 49/128 [00:34<00:18,  4.18it/s]batch:  39%|███▉      | 50/128 [00:34<00:18,  4.17it/s]batch:  40%|███▉      | 51/128 [00:34<00:18,  4.22it/s]batch:  41%|████      | 52/128 [00:35<00:18,  4.22it/s]batch:  41%|████▏     | 53/128 [00:35<00:17,  4.21it/s]batch:  42%|████▏     | 54/128 [00:35<00:17,  4.21it/s]batch:  43%|████▎     | 55/128 [00:35<00:17,  4.20it/s]batch:  44%|████▍     | 56/128 [00:36<00:17,  4.23it/s]batch:  45%|████▍     | 57/128 [00:36<00:16,  4.24it/s]batch:  45%|████▌     | 58/128 [00:36<00:16,  4.23it/s]batch:  46%|████▌     | 59/128 [00:36<00:16,  4.24it/s]batch:  47%|████▋     | 60/128 [00:36<00:16,  4.24it/s]batch:  48%|████▊     | 61/128 [00:37<00:15,  4.25it/s]batch:  48%|████▊     | 62/128 [00:37<00:15,  4.26it/s]batch:  49%|████▉     | 63/128 [00:37<00:15,  4.22it/s]batch:  50%|█████     | 64/128 [00:37<00:15,  4.23it/s]batch:  51%|█████     | 65/128 [00:38<00:14,  4.22it/s]batch:  52%|█████▏    | 66/128 [00:38<00:17,  3.57it/s]batch:  52%|█████▏    | 67/128 [00:38<00:16,  3.72it/s]batch:  53%|█████▎    | 68/128 [00:39<00:15,  3.84it/s]batch:  54%|█████▍    | 69/128 [00:39<00:14,  3.95it/s]batch:  55%|█████▍    | 70/128 [00:39<00:14,  4.01it/s]batch:  55%|█████▌    | 71/128 [00:39<00:14,  4.05it/s]batch:  56%|█████▋    | 72/128 [00:39<00:13,  4.10it/s]batch:  57%|█████▋    | 73/128 [00:40<00:13,  4.12it/s]batch:  58%|█████▊    | 74/128 [00:40<00:13,  4.14it/s]batch:  59%|█████▊    | 75/128 [00:40<00:12,  4.17it/s]batch:  59%|█████▉    | 76/128 [00:40<00:12,  4.16it/s]batch:  60%|██████    | 77/128 [00:41<00:12,  4.17it/s]batch:  61%|██████    | 78/128 [00:41<00:11,  4.17it/s]batch:  62%|██████▏   | 79/128 [00:41<00:11,  4.19it/s]batch:  62%|██████▎   | 80/128 [00:41<00:11,  4.21it/s]batch:  63%|██████▎   | 81/128 [00:42<00:11,  4.19it/s]batch:  64%|██████▍   | 82/128 [00:42<00:10,  4.19it/s]batch:  65%|██████▍   | 83/128 [00:42<00:10,  4.19it/s]batch:  66%|██████▌   | 84/128 [00:42<00:10,  4.20it/s]batch:  66%|██████▋   | 85/128 [00:43<00:10,  4.18it/s]batch:  67%|██████▋   | 86/128 [00:43<00:10,  4.19it/s]batch:  68%|██████▊   | 87/128 [00:43<00:09,  4.20it/s]batch:  69%|██████▉   | 88/128 [00:43<00:09,  4.19it/s]batch:  70%|██████▉   | 89/128 [00:44<00:09,  4.19it/s]batch:  70%|███████   | 90/128 [00:44<00:09,  4.20it/s]batch:  71%|███████   | 91/128 [00:44<00:08,  4.21it/s]batch:  72%|███████▏  | 92/128 [00:44<00:08,  4.22it/s]batch:  73%|███████▎  | 93/128 [00:45<00:08,  4.21it/s]batch:  73%|███████▎  | 94/128 [00:45<00:08,  4.22it/s]batch:  74%|███████▍  | 95/128 [00:45<00:07,  4.21it/s]batch:  75%|███████▌  | 96/128 [00:45<00:07,  4.22it/s]batch:  76%|███████▌  | 97/128 [00:45<00:07,  4.21it/s]batch:  77%|███████▋  | 98/128 [00:46<00:07,  4.19it/s]batch:  77%|███████▋  | 99/128 [00:46<00:06,  4.20it/s]batch:  78%|███████▊  | 100/128 [00:46<00:06,  4.21it/s]batch:  79%|███████▉  | 101/128 [00:46<00:06,  4.21it/s]batch:  80%|███████▉  | 102/128 [00:47<00:06,  4.19it/s]batch:  80%|████████  | 103/128 [00:47<00:05,  4.20it/s]batch:  81%|████████▏ | 104/128 [00:47<00:05,  4.21it/s]batch:  82%|████████▏ | 105/128 [00:47<00:05,  4.19it/s]batch:  83%|████████▎ | 106/128 [00:48<00:05,  4.21it/s]batch:  84%|████████▎ | 107/128 [00:48<00:04,  4.21it/s]batch:  84%|████████▍ | 108/128 [00:48<00:04,  4.21it/s]batch:  85%|████████▌ | 109/128 [00:48<00:04,  4.20it/s]batch:  86%|████████▌ | 110/128 [00:49<00:04,  4.21it/s]batch:  87%|████████▋ | 111/128 [00:49<00:04,  4.23it/s]batch:  88%|████████▊ | 112/128 [00:49<00:03,  4.23it/s]batch:  88%|████████▊ | 113/128 [00:49<00:03,  4.25it/s]batch:  89%|████████▉ | 114/128 [00:49<00:03,  4.27it/s]batch:  90%|████████▉ | 115/128 [00:50<00:03,  4.27it/s]batch:  91%|█████████ | 116/128 [00:50<00:02,  4.29it/s]batch:  91%|█████████▏| 117/128 [00:50<00:02,  4.25it/s]batch:  92%|█████████▏| 118/128 [00:50<00:02,  4.24it/s]batch:  93%|█████████▎| 119/128 [00:51<00:02,  4.22it/s]batch:  94%|█████████▍| 120/128 [00:51<00:01,  4.26it/s]batch:  95%|█████████▍| 121/128 [00:51<00:01,  4.28it/s]batch:  95%|█████████▌| 122/128 [00:51<00:01,  4.26it/s]batch:  96%|█████████▌| 123/128 [00:52<00:01,  4.28it/s]batch:  97%|█████████▋| 124/128 [00:52<00:00,  4.28it/s]batch:  98%|█████████▊| 125/128 [00:52<00:00,  4.24it/s]batch:  98%|█████████▊| 126/128 [00:52<00:00,  4.25it/s]batch:  99%|█████████▉| 127/128 [00:53<00:00,  4.23it/s]batch: 100%|██████████| 128/128 [00:53<00:00,  4.82it/s]batch: 100%|██████████| 128/128 [00:53<00:00,  2.41it/s]
  0%|          | 0/8160 [00:00<?, ?it/s]  1%|▏         | 118/8160 [00:00<00:06, 1172.84it/s]  3%|▎         | 236/8160 [00:00<00:06, 1165.72it/s]  4%|▍         | 355/8160 [00:00<00:06, 1176.39it/s]  6%|▌         | 473/8160 [00:00<00:06, 1157.83it/s]  7%|▋         | 590/8160 [00:00<00:06, 1162.01it/s]  9%|▊         | 709/8160 [00:00<00:06, 1171.28it/s] 10%|█         | 827/8160 [00:00<00:06, 1160.72it/s] 12%|█▏        | 946/8160 [00:00<00:06, 1167.23it/s] 13%|█▎        | 1064/8160 [00:00<00:06, 1169.33it/s] 14%|█▍        | 1181/8160 [00:01<00:06, 1151.83it/s] 16%|█▌        | 1298/8160 [00:01<00:05, 1156.61it/s] 17%|█▋        | 1415/8160 [00:01<00:05, 1157.81it/s] 19%|█▉        | 1531/8160 [00:01<00:05, 1157.13it/s] 20%|██        | 1649/8160 [00:01<00:05, 1162.76it/s] 22%|██▏       | 1766/8160 [00:01<00:05, 1163.70it/s] 23%|██▎       | 1883/8160 [00:01<00:05, 1158.66it/s] 24%|██▍       | 1999/8160 [00:01<00:05, 1154.46it/s] 26%|██▌       | 2116/8160 [00:01<00:05, 1153.88it/s] 27%|██▋       | 2232/8160 [00:01<00:05, 1155.34it/s] 29%|██▉       | 2348/8160 [00:02<00:05, 1147.93it/s] 30%|███       | 2463/8160 [00:02<00:04, 1142.97it/s] 32%|███▏      | 2578/8160 [00:02<00:04, 1140.24it/s] 33%|███▎      | 2693/8160 [00:02<00:04, 1136.52it/s] 34%|███▍      | 2807/8160 [00:02<00:04, 1131.12it/s] 36%|███▌      | 2922/8160 [00:02<00:04, 1131.25it/s] 37%|███▋      | 3036/8160 [00:02<00:04, 1127.79it/s] 39%|███▊      | 3149/8160 [00:02<00:04, 1124.20it/s] 40%|███▉      | 3262/8160 [00:02<00:04, 1124.30it/s] 41%|████▏     | 3375/8160 [00:02<00:04, 1109.03it/s] 43%|████▎     | 3486/8160 [00:03<00:04, 1107.78it/s] 44%|████▍     | 3599/8160 [00:03<00:04, 1110.72it/s] 45%|████▌     | 3712/8160 [00:03<00:03, 1116.15it/s] 47%|████▋     | 3824/8160 [00:03<00:03, 1102.55it/s] 48%|████▊     | 3935/8160 [00:03<00:03, 1103.88it/s] 50%|████▉     | 4046/8160 [00:03<00:03, 1099.75it/s] 51%|█████     | 4159/8160 [00:03<00:03, 1103.23it/s] 52%|█████▏    | 4270/8160 [00:03<00:03, 1088.72it/s] 54%|█████▎    | 4380/8160 [00:03<00:03, 1087.50it/s] 55%|█████▌    | 4493/8160 [00:03<00:03, 1094.80it/s] 56%|█████▋    | 4606/8160 [00:04<00:03, 1098.79it/s] 58%|█████▊    | 4716/8160 [00:04<00:03, 1097.74it/s] 59%|█████▉    | 4827/8160 [00:04<00:03, 1101.11it/s] 61%|██████    | 4938/8160 [00:04<00:02, 1102.75it/s] 62%|██████▏   | 5049/8160 [00:04<00:02, 1098.55it/s] 63%|██████▎   | 5159/8160 [00:04<00:02, 1089.53it/s] 65%|██████▍   | 5268/8160 [00:04<00:02, 1083.65it/s] 66%|██████▌   | 5378/8160 [00:04<00:02, 1082.96it/s] 67%|██████▋   | 5487/8160 [00:04<00:02, 1080.03it/s] 69%|██████▊   | 5596/8160 [00:04<00:02, 1080.38it/s] 70%|██████▉   | 5705/8160 [00:05<00:02, 1072.63it/s] 71%|███████▏  | 5814/8160 [00:05<00:02, 1072.38it/s] 73%|███████▎  | 5923/8160 [00:05<00:02, 1077.49it/s] 74%|███████▍  | 6031/8160 [00:05<00:01, 1071.93it/s] 75%|███████▌  | 6140/8160 [00:05<00:01, 1076.95it/s] 77%|███████▋  | 6248/8160 [00:05<00:01, 1076.33it/s] 78%|███████▊  | 6356/8160 [00:05<00:01, 1073.98it/s] 79%|███████▉  | 6464/8160 [00:05<00:01, 1068.74it/s] 81%|████████  | 6571/8160 [00:05<00:01, 1066.28it/s] 82%|████████▏ | 6678/8160 [00:05<00:01, 1059.41it/s] 83%|████████▎ | 6785/8160 [00:06<00:01, 1059.66it/s] 84%|████████▍ | 6892/8160 [00:06<00:01, 1059.99it/s] 86%|████████▌ | 6999/8160 [00:06<00:01, 1060.58it/s] 87%|████████▋ | 7106/8160 [00:06<00:00, 1054.74it/s] 88%|████████▊ | 7212/8160 [00:06<00:00, 1035.35it/s] 90%|████████▉ | 7318/8160 [00:06<00:00, 1042.55it/s] 91%|█████████ | 7423/8160 [00:06<00:00, 1040.84it/s] 92%|█████████▏| 7528/8160 [00:06<00:00, 1043.52it/s] 94%|█████████▎| 7634/8160 [00:06<00:00, 1042.81it/s] 95%|█████████▍| 7739/8160 [00:07<00:00, 1044.31it/s] 96%|█████████▌| 7844/8160 [00:07<00:00, 1035.76it/s] 97%|█████████▋| 7949/8160 [00:07<00:00, 1034.03it/s] 99%|█████████▊| 8054/8160 [00:07<00:00, 1037.65it/s]100%|█████████▉| 8158/8160 [00:07<00:00, 1034.32it/s]100%|██████████| 8160/8160 [00:07<00:00, 1100.78it/s]
  0%|          | 0/8160 [00:00<?, ?it/s]  1%|▏         | 118/8160 [00:00<00:06, 1178.73it/s]  3%|▎         | 239/8160 [00:00<00:06, 1184.76it/s]  4%|▍         | 360/8160 [00:00<00:06, 1187.23it/s]  6%|▌         | 480/8160 [00:00<00:06, 1191.45it/s]  7%|▋         | 600/8160 [00:00<00:06, 1187.25it/s]  9%|▉         | 719/8160 [00:00<00:06, 1183.29it/s] 10%|█         | 838/8160 [00:00<00:06, 1185.28it/s] 12%|█▏        | 958/8160 [00:00<00:06, 1189.04it/s] 13%|█▎        | 1077/8160 [00:00<00:05, 1182.45it/s] 15%|█▍        | 1196/8160 [00:01<00:05, 1184.00it/s] 16%|█▌        | 1315/8160 [00:01<00:05, 1172.43it/s] 18%|█▊        | 1433/8160 [00:01<00:05, 1171.64it/s] 19%|█▉        | 1551/8160 [00:01<00:05, 1171.42it/s] 20%|██        | 1669/8160 [00:01<00:05, 1173.77it/s] 22%|██▏       | 1787/8160 [00:01<00:05, 1169.30it/s] 23%|██▎       | 1904/8160 [00:01<00:05, 1164.23it/s] 25%|██▍       | 2022/8160 [00:01<00:05, 1164.32it/s] 26%|██▌       | 2140/8160 [00:01<00:05, 1168.21it/s] 28%|██▊       | 2257/8160 [00:01<00:05, 1146.41it/s] 29%|██▉       | 2373/8160 [00:02<00:05, 1149.70it/s] 31%|███       | 2489/8160 [00:02<00:04, 1141.72it/s] 32%|███▏      | 2604/8160 [00:02<00:04, 1141.77it/s] 33%|███▎      | 2719/8160 [00:02<00:04, 1143.84it/s] 35%|███▍      | 2834/8160 [00:02<00:04, 1135.32it/s] 36%|███▌      | 2950/8160 [00:02<00:04, 1137.59it/s] 38%|███▊      | 3065/8160 [00:02<00:04, 1135.10it/s] 39%|███▉      | 3179/8160 [00:02<00:04, 1121.41it/s] 40%|████      | 3293/8160 [00:02<00:04, 1126.23it/s] 42%|████▏     | 3406/8160 [00:02<00:04, 1120.73it/s] 43%|████▎     | 3519/8160 [00:03<00:04, 1121.94it/s] 45%|████▍     | 3632/8160 [00:03<00:04, 1121.02it/s] 46%|████▌     | 3745/8160 [00:03<00:03, 1122.72it/s] 47%|████▋     | 3858/8160 [00:03<00:03, 1116.92it/s] 49%|████▊     | 3970/8160 [00:03<00:03, 1113.06it/s] 50%|█████     | 4082/8160 [00:03<00:03, 1094.06it/s] 51%|█████▏    | 4194/8160 [00:03<00:03, 1097.02it/s] 53%|█████▎    | 4305/8160 [00:03<00:03, 1099.87it/s] 54%|█████▍    | 4416/8160 [00:03<00:03, 1097.85it/s] 56%|█████▌    | 4529/8160 [00:03<00:03, 1101.64it/s] 57%|█████▋    | 4641/8160 [00:04<00:03, 1106.86it/s] 58%|█████▊    | 4752/8160 [00:04<00:03, 1104.05it/s] 60%|█████▉    | 4863/8160 [00:04<00:02, 1105.31it/s] 61%|██████    | 4974/8160 [00:04<00:02, 1101.57it/s] 62%|██████▏   | 5085/8160 [00:04<00:02, 1097.90it/s] 64%|██████▎   | 5195/8160 [00:04<00:02, 1092.20it/s] 65%|██████▌   | 5305/8160 [00:04<00:02, 1088.21it/s] 66%|██████▋   | 5414/8160 [00:04<00:02, 1083.81it/s] 68%|██████▊   | 5523/8160 [00:04<00:02, 1085.54it/s] 69%|██████▉   | 5632/8160 [00:04<00:02, 1081.72it/s] 70%|███████   | 5741/8160 [00:05<00:02, 1078.61it/s] 72%|███████▏  | 5850/8160 [00:05<00:02, 1077.75it/s] 73%|███████▎  | 5958/8160 [00:05<00:02, 1078.08it/s] 74%|███████▍  | 6066/8160 [00:05<00:01, 1078.16it/s] 76%|███████▌  | 6174/8160 [00:05<00:01, 1073.37it/s] 77%|███████▋  | 6283/8160 [00:05<00:01, 1076.73it/s] 78%|███████▊  | 6391/8160 [00:05<00:01, 1072.12it/s] 80%|███████▉  | 6499/8160 [00:05<00:01, 1065.78it/s] 81%|████████  | 6606/8160 [00:05<00:01, 1066.16it/s] 82%|████████▏ | 6713/8160 [00:05<00:01, 1063.92it/s] 84%|████████▎ | 6820/8160 [00:06<00:01, 1064.11it/s] 85%|████████▍ | 6927/8160 [00:06<00:01, 1063.37it/s] 86%|████████▌ | 7034/8160 [00:06<00:01, 1058.88it/s] 88%|████████▊ | 7141/8160 [00:06<00:00, 1056.23it/s] 89%|████████▉ | 7248/8160 [00:06<00:00, 1054.65it/s] 90%|█████████ | 7355/8160 [00:06<00:00, 1053.13it/s] 91%|█████████▏| 7461/8160 [00:06<00:00, 1050.07it/s] 93%|█████████▎| 7567/8160 [00:06<00:00, 1052.41it/s] 94%|█████████▍| 7673/8160 [00:06<00:00, 1043.58it/s] 95%|█████████▌| 7778/8160 [00:06<00:00, 1045.12it/s] 97%|█████████▋| 7883/8160 [00:07<00:00, 1046.06it/s] 98%|█████████▊| 7988/8160 [00:07<00:00, 1040.33it/s] 99%|█████████▉| 8093/8160 [00:07<00:00, 1041.43it/s]100%|██████████| 8160/8160 [00:07<00:00, 1107.80it/s]
2026-01-05 00:18:06.937854: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:18:15.335133: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-01-05 00:18:15.336256: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-01-05 00:18:15.363742: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-01-05 00:18:15.363850: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:18:15.368138: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:18:15.368245: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:18:15.370165: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-01-05 00:18:15.371448: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-01-05 00:18:15.374614: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-01-05 00:18:15.376328: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-01-05 00:18:15.377408: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:18:15.377834: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-01-05 00:18:15.378487: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-01-05 00:18:15.380151: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-01-05 00:18:15.380443: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-01-05 00:18:15.380477: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:18:15.380494: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:18:15.380507: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:18:15.380520: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-01-05 00:18:15.380533: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-01-05 00:18:15.380546: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-01-05 00:18:15.380558: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-01-05 00:18:15.380576: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:18:15.380869: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-01-05 00:18:15.380896: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:18:15.786207: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-01-05 00:18:15.786298: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-01-05 00:18:15.786309: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-01-05 00:18:15.787086: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:61:00.0, compute capability: 8.9)
2026-01-05 00:18:15.829476: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:196] None of the MLIR optimization passes are enabled (registered 0 passes)
2026-01-05 00:18:15.844204: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-01-05 00:18:21.903837: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:18:22.389099: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:18:22.393289: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:18:23.891062: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-01-05 00:18:23.973980: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
2026-01-05 00:18:40.237650: I tensorflow/stream_executor/cuda/cuda_blas.cc:1838] TensorFloat-32 will be used for the matrix multiplication. This will only be logged once.
RuntimeError: module compiled against API version 0xe but this version of numpy is 0xd
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/layers/core.py:1059: UserWarning: bpnet.model.arch is not loaded, but a Lambda layer uses it. It may cause errors.
  , UserWarning)
