Lmod Warning:
-------------------------------------------------------------------------------
The following dependent module(s) are not currently loaded: curl/8.4.0
(required by: htslib/1.16)
-------------------------------------------------------------------------------




The following have been reloaded with a version change:
  1) curl/8.4.0 => curl/8.17.0     2) openssl/3.0.7 => openssl/3.6.0

2026-05-28 11:41:12.211380: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:43:27.678287: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 11:43:27.679508: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-05-28 11:43:27.697350: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:ca:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 11:43:27.697428: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:43:27.725843: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:43:27.725991: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:43:27.737829: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 11:43:27.749325: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 11:43:27.762449: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 11:43:27.850254: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 11:43:28.155035: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:43:28.155523: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 11:43:28.155893: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-05-28 11:43:28.156042: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 11:43:28.156261: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:ca:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 11:43:28.156279: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:43:28.156294: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:43:28.156304: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:43:28.156313: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 11:43:28.156322: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 11:43:28.156331: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 11:43:28.156340: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 11:43:28.156366: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:43:28.156640: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 11:43:28.156659: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:43:30.063883: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-05-28 11:43:30.063974: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-05-28 11:43:30.063985: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-05-28 11:43:30.064624: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:ca:00.0, compute capability: 8.9)
2026-05-28 11:43:31.942508: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2)
2026-05-28 11:43:31.943045: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-05-28 11:43:36.517845: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:43:37.728874: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:43:37.733775: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:43:50.336995: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-05-28 11:43:50.413226: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
2026-05-28 11:44:14.515763: I tensorflow/stream_executor/cuda/cuda_blas.cc:1838] TensorFloat-32 will be used for the matrix multiplication. This will only be logged once.
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates', 'jitters', 'index', 'status', 'rev_comp'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
2026-05-28 11:53:14.491745: W tensorflow/python/util/util.cc:348] Sets are not currently considered sequences, but this may change in the future, so consider avoiding using them.
2026-05-28 11:53:23.016437: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:53:45.709244: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 11:53:45.714478: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-05-28 11:53:45.742067: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:ca:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 11:53:45.742174: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:53:45.748141: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:53:45.748252: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:53:45.751691: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 11:53:45.753511: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 11:53:45.757203: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 11:53:45.759553: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 11:53:45.761659: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:53:45.762107: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 11:53:45.762400: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-05-28 11:53:45.762564: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 11:53:45.762913: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:ca:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 11:53:45.762942: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:53:45.762958: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:53:45.762968: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:53:45.762979: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 11:53:45.762989: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 11:53:45.762998: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 11:53:45.763008: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 11:53:45.763018: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:53:45.763307: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 11:53:45.763330: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:53:46.197855: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-05-28 11:53:46.197951: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-05-28 11:53:46.197960: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-05-28 11:53:46.198626: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:ca:00.0, compute capability: 8.9)
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/layers/core.py:1059: UserWarning: bpnet.model.arch is not loaded, but a Lambda layer uses it. It may cause errors.
  , UserWarning)
batch:   0%|          | 0/80 [00:00<?, ?it/s]2026-05-28 11:53:47.789762: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2)
2026-05-28 11:53:47.790296: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-05-28 11:53:48.004477: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:53:48.542685: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:53:48.544373: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:53:50.160185: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-05-28 11:53:50.250775: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates', 'true_profiles', 'true_logcounts', 'rev_comp'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
batch:   1%|▏         | 1/80 [00:22<29:32, 22.43s/it]batch:   2%|▎         | 2/80 [00:22<12:11,  9.38s/it]batch:   4%|▍         | 3/80 [00:22<06:40,  5.20s/it]batch:   5%|▌         | 4/80 [00:23<04:06,  3.24s/it]batch:   6%|▋         | 5/80 [00:23<02:41,  2.16s/it]batch:   8%|▊         | 6/80 [00:23<01:51,  1.50s/it]batch:   9%|▉         | 7/80 [00:23<01:19,  1.09s/it]batch:  10%|█         | 8/80 [00:24<00:58,  1.23it/s]batch:  11%|█▏        | 9/80 [00:24<00:44,  1.58it/s]batch:  12%|█▎        | 10/80 [00:24<00:35,  1.97it/s]batch:  14%|█▍        | 11/80 [00:24<00:29,  2.36it/s]batch:  15%|█▌        | 12/80 [00:24<00:24,  2.74it/s]batch:  16%|█▋        | 13/80 [00:25<00:21,  3.08it/s]batch:  18%|█▊        | 14/80 [00:25<00:19,  3.37it/s]batch:  19%|█▉        | 15/80 [00:25<00:18,  3.60it/s]batch:  20%|██        | 16/80 [00:25<00:16,  3.79it/s]batch:  21%|██▏       | 17/80 [00:26<00:16,  3.92it/s]batch:  22%|██▎       | 18/80 [00:26<00:15,  4.03it/s]batch:  24%|██▍       | 19/80 [00:26<00:14,  4.10it/s]batch:  25%|██▌       | 20/80 [00:26<00:14,  4.15it/s]batch:  26%|██▋       | 21/80 [00:27<00:14,  4.20it/s]batch:  28%|██▊       | 22/80 [00:27<00:13,  4.23it/s]batch:  29%|██▉       | 23/80 [00:27<00:13,  4.25it/s]batch:  30%|███       | 24/80 [00:27<00:13,  4.27it/s]batch:  31%|███▏      | 25/80 [00:28<00:12,  4.26it/s]batch:  32%|███▎      | 26/80 [00:28<00:12,  4.28it/s]batch:  34%|███▍      | 27/80 [00:28<00:12,  4.28it/s]batch:  35%|███▌      | 28/80 [00:28<00:12,  4.28it/s]batch:  36%|███▋      | 29/80 [00:28<00:11,  4.27it/s]batch:  38%|███▊      | 30/80 [00:29<00:11,  4.28it/s]batch:  39%|███▉      | 31/80 [00:29<00:11,  4.29it/s]batch:  40%|████      | 32/80 [00:29<00:11,  4.28it/s]batch:  41%|████▏     | 33/80 [00:29<00:10,  4.29it/s]batch:  42%|████▎     | 34/80 [00:30<00:10,  4.28it/s]batch:  44%|████▍     | 35/80 [00:30<00:10,  4.27it/s]batch:  45%|████▌     | 36/80 [00:30<00:10,  4.27it/s]batch:  46%|████▋     | 37/80 [00:30<00:10,  4.26it/s]batch:  48%|████▊     | 38/80 [00:31<00:09,  4.28it/s]batch:  49%|████▉     | 39/80 [00:31<00:09,  4.25it/s]batch:  50%|█████     | 40/80 [00:31<00:09,  4.27it/s]batch:  51%|█████▏    | 41/80 [00:31<00:09,  4.28it/s]batch:  52%|█████▎    | 42/80 [00:31<00:08,  4.29it/s]batch:  54%|█████▍    | 43/80 [00:32<00:08,  4.30it/s]batch:  55%|█████▌    | 44/80 [00:32<00:08,  4.28it/s]batch:  56%|█████▋    | 45/80 [00:32<00:08,  4.29it/s]batch:  57%|█████▊    | 46/80 [00:32<00:07,  4.30it/s]batch:  59%|█████▉    | 47/80 [00:33<00:07,  4.30it/s]batch:  60%|██████    | 48/80 [00:33<00:07,  4.31it/s]batch:  61%|██████▏   | 49/80 [00:33<00:07,  4.29it/s]batch:  62%|██████▎   | 50/80 [00:33<00:06,  4.29it/s]batch:  64%|██████▍   | 51/80 [00:34<00:06,  4.28it/s]batch:  65%|██████▌   | 52/80 [00:34<00:06,  4.29it/s]batch:  66%|██████▋   | 53/80 [00:34<00:06,  4.30it/s]batch:  68%|██████▊   | 54/80 [00:34<00:06,  4.28it/s]batch:  69%|██████▉   | 55/80 [00:35<00:05,  4.29it/s]batch:  70%|███████   | 56/80 [00:35<00:05,  4.30it/s]batch:  71%|███████▏  | 57/80 [00:35<00:05,  4.31it/s]batch:  72%|███████▎  | 58/80 [00:35<00:05,  4.30it/s]batch:  74%|███████▍  | 59/80 [00:35<00:04,  4.29it/s]batch:  75%|███████▌  | 60/80 [00:36<00:04,  4.30it/s]batch:  76%|███████▋  | 61/80 [00:36<00:04,  4.28it/s]batch:  78%|███████▊  | 62/80 [00:36<00:04,  4.30it/s]batch:  79%|███████▉  | 63/80 [00:36<00:03,  4.28it/s]batch:  80%|████████  | 64/80 [00:37<00:03,  4.30it/s]batch:  81%|████████▏ | 65/80 [00:37<00:03,  4.29it/s]batch:  82%|████████▎ | 66/80 [00:37<00:03,  3.69it/s]batch:  84%|████████▍ | 67/80 [00:37<00:03,  3.84it/s]batch:  85%|████████▌ | 68/80 [00:38<00:03,  3.96it/s]batch:  86%|████████▋ | 69/80 [00:38<00:02,  4.05it/s]batch:  88%|████████▊ | 70/80 [00:38<00:02,  4.11it/s]batch:  89%|████████▉ | 71/80 [00:38<00:02,  4.16it/s]batch:  90%|█████████ | 72/80 [00:39<00:01,  4.20it/s]batch:  91%|█████████▏| 73/80 [00:39<00:01,  4.22it/s]batch:  92%|█████████▎| 74/80 [00:39<00:01,  4.25it/s]batch:  94%|█████████▍| 75/80 [00:39<00:01,  4.26it/s]batch:  95%|█████████▌| 76/80 [00:40<00:00,  4.25it/s]batch:  96%|█████████▋| 77/80 [00:40<00:00,  4.26it/s]batch:  98%|█████████▊| 78/80 [00:40<00:00,  4.27it/s]batch:  99%|█████████▉| 79/80 [00:40<00:00,  4.28it/s]batch: 100%|██████████| 80/80 [00:40<00:00,  4.83it/s]batch: 100%|██████████| 80/80 [00:40<00:00,  1.96it/s]
  0%|          | 0/5091 [00:00<?, ?it/s]  2%|▏         | 119/5091 [00:00<00:04, 1188.91it/s]  5%|▍         | 238/5091 [00:00<00:04, 1180.04it/s]  7%|▋         | 358/5091 [00:00<00:04, 1181.06it/s]  9%|▉         | 479/5091 [00:00<00:03, 1185.79it/s] 12%|█▏        | 598/5091 [00:00<00:03, 1186.63it/s] 14%|█▍        | 717/5091 [00:00<00:03, 1178.35it/s] 16%|█▋        | 836/5091 [00:00<00:03, 1179.82it/s] 19%|█▉        | 955/5091 [00:00<00:03, 1180.13it/s] 21%|██        | 1074/5091 [00:00<00:03, 1175.01it/s] 23%|██▎       | 1192/5091 [00:01<00:03, 1172.28it/s] 26%|██▌       | 1310/5091 [00:01<00:03, 1168.87it/s] 28%|██▊       | 1427/5091 [00:01<00:03, 1166.00it/s] 30%|███       | 1544/5091 [00:01<00:03, 1164.54it/s] 33%|███▎      | 1661/5091 [00:01<00:02, 1159.28it/s] 35%|███▍      | 1777/5091 [00:01<00:02, 1155.14it/s] 37%|███▋      | 1893/5091 [00:01<00:02, 1150.23it/s] 39%|███▉      | 2009/5091 [00:01<00:02, 1146.95it/s] 42%|████▏     | 2124/5091 [00:01<00:02, 1142.33it/s] 44%|████▍     | 2239/5091 [00:01<00:02, 1143.30it/s] 46%|████▌     | 2354/5091 [00:02<00:02, 1139.61it/s] 48%|████▊     | 2468/5091 [00:02<00:02, 1136.72it/s] 51%|█████     | 2582/5091 [00:02<00:02, 1131.82it/s] 53%|█████▎    | 2696/5091 [00:02<00:02, 1133.54it/s] 55%|█████▌    | 2810/5091 [00:02<00:02, 1128.22it/s] 57%|█████▋    | 2924/5091 [00:02<00:01, 1125.49it/s] 60%|█████▉    | 3037/5091 [00:02<00:01, 1123.53it/s] 62%|██████▏   | 3150/5091 [00:02<00:01, 1123.93it/s] 64%|██████▍   | 3263/5091 [00:02<00:01, 1117.75it/s] 66%|██████▋   | 3375/5091 [00:02<00:01, 1116.79it/s] 68%|██████▊   | 3487/5091 [00:03<00:01, 1111.29it/s] 71%|███████   | 3599/5091 [00:03<00:01, 1108.11it/s] 73%|███████▎  | 3710/5091 [00:03<00:01, 1103.13it/s] 75%|███████▌  | 3821/5091 [00:03<00:01, 1101.67it/s] 77%|███████▋  | 3932/5091 [00:03<00:01, 1099.60it/s] 79%|███████▉  | 4042/5091 [00:03<00:00, 1098.05it/s] 82%|████████▏ | 4152/5091 [00:03<00:00, 1097.25it/s] 84%|████████▎ | 4262/5091 [00:03<00:00, 1088.11it/s] 86%|████████▌ | 4371/5091 [00:03<00:00, 1087.71it/s] 88%|████████▊ | 4481/5091 [00:03<00:00, 1086.10it/s] 90%|█████████ | 4590/5091 [00:04<00:00, 1082.83it/s] 92%|█████████▏| 4699/5091 [00:04<00:00, 1083.66it/s] 94%|█████████▍| 4808/5091 [00:04<00:00, 1077.77it/s] 97%|█████████▋| 4916/5091 [00:04<00:00, 1078.06it/s] 99%|█████████▊| 5024/5091 [00:04<00:00, 1074.34it/s]100%|██████████| 5091/5091 [00:04<00:00, 1126.27it/s]
  0%|          | 0/5091 [00:00<?, ?it/s]  2%|▏         | 119/5091 [00:00<00:04, 1189.62it/s]  5%|▍         | 238/5091 [00:00<00:04, 1180.06it/s]  7%|▋         | 358/5091 [00:00<00:04, 1181.44it/s]  9%|▉         | 479/5091 [00:00<00:03, 1185.34it/s] 12%|█▏        | 598/5091 [00:00<00:03, 1186.67it/s] 14%|█▍        | 717/5091 [00:00<00:03, 1178.13it/s] 16%|█▋        | 836/5091 [00:00<00:03, 1179.54it/s] 19%|█▉        | 955/5091 [00:00<00:03, 1179.33it/s] 21%|██        | 1073/5091 [00:00<00:03, 1175.80it/s] 23%|██▎       | 1192/5091 [00:01<00:03, 1174.71it/s] 26%|██▌       | 1310/5091 [00:01<00:03, 1170.88it/s] 28%|██▊       | 1428/5091 [00:01<00:03, 1169.29it/s] 30%|███       | 1545/5091 [00:01<00:03, 1161.48it/s] 33%|███▎      | 1662/5091 [00:01<00:02, 1162.86it/s] 35%|███▍      | 1779/5091 [00:01<00:02, 1153.80it/s] 37%|███▋      | 1895/5091 [00:01<00:02, 1148.78it/s] 39%|███▉      | 2010/5091 [00:01<00:02, 1144.72it/s] 42%|████▏     | 2125/5091 [00:01<00:02, 1140.53it/s] 44%|████▍     | 2241/5091 [00:01<00:02, 1144.77it/s] 46%|████▋     | 2356/5091 [00:02<00:02, 1140.66it/s] 49%|████▊     | 2471/5091 [00:02<00:02, 1135.65it/s] 51%|█████     | 2585/5091 [00:02<00:02, 1130.77it/s] 53%|█████▎    | 2699/5091 [00:02<00:02, 1133.23it/s] 55%|█████▌    | 2813/5091 [00:02<00:02, 1127.25it/s] 57%|█████▋    | 2926/5091 [00:02<00:01, 1127.76it/s] 60%|█████▉    | 3039/5091 [00:02<00:01, 1118.18it/s] 62%|██████▏   | 3152/5091 [00:02<00:01, 1120.00it/s] 64%|██████▍   | 3265/5091 [00:02<00:01, 1120.49it/s] 66%|██████▋   | 3378/5091 [00:02<00:01, 1115.33it/s] 69%|██████▊   | 3490/5091 [00:03<00:01, 1110.08it/s] 71%|███████   | 3602/5091 [00:03<00:01, 1107.60it/s] 73%|███████▎  | 3713/5091 [00:03<00:01, 1102.80it/s] 75%|███████▌  | 3824/5091 [00:03<00:01, 1104.46it/s] 77%|███████▋  | 3935/5091 [00:03<00:01, 1100.69it/s] 79%|███████▉  | 4046/5091 [00:03<00:00, 1101.23it/s] 82%|████████▏ | 4157/5091 [00:03<00:00, 1096.89it/s] 84%|████████▍ | 4267/5091 [00:03<00:00, 1092.87it/s] 86%|████████▌ | 4377/5091 [00:03<00:00, 1087.62it/s] 88%|████████▊ | 4487/5091 [00:03<00:00, 1086.29it/s] 90%|█████████ | 4596/5091 [00:04<00:00, 1083.03it/s] 92%|█████████▏| 4705/5091 [00:04<00:00, 1083.94it/s] 95%|█████████▍| 4814/5091 [00:04<00:00, 1077.65it/s] 97%|█████████▋| 4922/5091 [00:04<00:00, 1073.23it/s] 99%|█████████▉| 5031/5091 [00:04<00:00, 1073.15it/s]100%|██████████| 5091/5091 [00:04<00:00, 1126.43it/s]
2026-05-28 11:55:11.587918: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:55:27.706650: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 11:55:27.708023: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-05-28 11:55:27.739734: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:ca:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 11:55:27.739843: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:55:27.755398: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:55:27.755522: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:55:27.761968: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 11:55:27.768348: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 11:55:27.777527: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 11:55:27.783794: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 11:55:27.789349: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:55:27.789814: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 11:55:27.790271: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-05-28 11:55:27.790429: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 11:55:27.790680: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:ca:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 11:55:27.790715: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:55:27.790730: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:55:27.790744: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:55:27.790756: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 11:55:27.790769: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 11:55:27.790781: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 11:55:27.790794: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 11:55:27.790824: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:55:27.791109: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 11:55:27.791140: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:55:28.214023: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-05-28 11:55:28.214089: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-05-28 11:55:28.214099: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-05-28 11:55:28.214787: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:ca:00.0, compute capability: 8.9)
2026-05-28 11:55:28.255381: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:196] None of the MLIR optimization passes are enabled (registered 0 passes)
2026-05-28 11:55:28.270883: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-05-28 11:55:32.341473: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:55:32.832271: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:55:32.836379: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:55:34.321259: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-05-28 11:55:34.395274: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
2026-05-28 11:55:48.690735: I tensorflow/stream_executor/cuda/cuda_blas.cc:1838] TensorFloat-32 will be used for the matrix multiplication. This will only be logged once.
RuntimeError: module compiled against API version 0xe but this version of numpy is 0xd
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/layers/core.py:1059: UserWarning: bpnet.model.arch is not loaded, but a Lambda layer uses it. It may cause errors.
  , UserWarning)
