Lmod Warning:
-------------------------------------------------------------------------------
The following dependent module(s) are not currently loaded: curl/8.4.0
(required by: htslib/1.16)
-------------------------------------------------------------------------------




The following have been reloaded with a version change:
  1) curl/8.4.0 => curl/8.17.0     2) openssl/3.0.7 => openssl/3.6.0

2026-05-11 03:21:36.058300: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-11 03:26:35.551022: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-11 03:26:35.557031: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-05-11 03:26:35.576920: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-11 03:26:35.576975: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-11 03:26:36.088239: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-11 03:26:36.088334: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-11 03:26:36.104162: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-11 03:26:36.111292: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-11 03:26:36.119326: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-11 03:26:36.124898: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-11 03:26:36.130612: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-11 03:26:36.131040: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-11 03:26:36.131419: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-05-11 03:26:36.131576: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-11 03:26:36.131929: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-11 03:26:36.131980: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-11 03:26:36.132002: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-11 03:26:36.132012: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-11 03:26:36.132021: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-11 03:26:36.132030: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-11 03:26:36.132039: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-11 03:26:36.132055: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-11 03:26:36.132081: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-11 03:26:36.132377: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-11 03:26:36.132398: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-11 03:26:36.546763: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-05-11 03:26:36.546866: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-05-11 03:26:36.546877: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-05-11 03:26:36.547518: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:61:00.0, compute capability: 8.9)
2026-05-11 03:26:38.378463: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2)
2026-05-11 03:26:38.378993: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-05-11 03:26:42.893675: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-11 03:26:43.407805: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-11 03:26:43.412368: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-11 03:26:45.030522: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-05-11 03:26:45.118343: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
2026-05-11 03:28:21.671859: I tensorflow/stream_executor/cuda/cuda_blas.cc:1838] TensorFloat-32 will be used for the matrix multiplication. This will only be logged once.
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates', 'jitters', 'index', 'status', 'rev_comp'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
2026-05-11 03:43:40.155819: W tensorflow/python/util/util.cc:348] Sets are not currently considered sequences, but this may change in the future, so consider avoiding using them.
2026-05-11 03:45:38.118548: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-11 03:51:42.242482: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-11 03:51:42.243504: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-05-11 03:51:42.300574: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-11 03:51:42.300653: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-11 03:51:42.342466: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-11 03:51:42.342560: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-11 03:51:42.363527: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-11 03:51:42.384408: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-11 03:51:42.407179: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-11 03:51:42.428462: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-11 03:51:42.449391: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-11 03:51:42.449812: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-11 03:51:42.450123: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-05-11 03:51:42.450282: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-11 03:51:42.450491: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-11 03:51:42.450511: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-11 03:51:42.450524: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-11 03:51:42.450535: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-11 03:51:42.450545: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-11 03:51:42.450555: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-11 03:51:42.450565: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-11 03:51:42.450574: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-11 03:51:42.450584: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-11 03:51:42.450857: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-11 03:51:42.450879: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-11 03:51:42.867826: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-05-11 03:51:42.867926: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-05-11 03:51:42.867936: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-05-11 03:51:42.868589: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:61:00.0, compute capability: 8.9)
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/layers/core.py:1059: UserWarning: bpnet.model.arch is not loaded, but a Lambda layer uses it. It may cause errors.
  , UserWarning)
batch:   0%|          | 0/68 [00:00<?, ?it/s]2026-05-11 03:51:44.711426: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2)
2026-05-11 03:51:44.711915: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-05-11 03:51:45.180142: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-11 03:51:45.718920: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-11 03:51:45.720835: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-11 03:51:47.666487: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-05-11 03:51:47.776011: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates', 'true_profiles', 'true_logcounts', 'rev_comp'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
batch:   1%|▏         | 1/68 [02:31<2:48:46, 151.15s/it]batch:   3%|▎         | 2/68 [02:31<1:08:37, 62.38s/it] batch:   4%|▍         | 3/68 [02:31<36:50, 34.01s/it]  batch:   6%|▌         | 4/68 [02:31<22:03, 20.67s/it]batch:   7%|▋         | 5/68 [02:32<13:58, 13.31s/it]batch:   9%|▉         | 6/68 [02:32<09:09,  8.86s/it]batch:  10%|█         | 7/68 [02:32<06:08,  6.04s/it]batch:  12%|█▏        | 8/68 [02:32<04:11,  4.20s/it]batch:  13%|█▎        | 9/68 [02:33<02:54,  2.96s/it]batch:  15%|█▍        | 10/68 [02:33<02:02,  2.12s/it]batch:  16%|█▌        | 11/68 [02:33<01:27,  1.54s/it]batch:  18%|█▊        | 12/68 [02:33<01:04,  1.15s/it]batch:  19%|█▉        | 13/68 [02:34<00:47,  1.15it/s]batch:  21%|██        | 14/68 [02:34<00:36,  1.47it/s]batch:  22%|██▏       | 15/68 [02:34<00:28,  1.83it/s]batch:  24%|██▎       | 16/68 [02:34<00:23,  2.21it/s]batch:  25%|██▌       | 17/68 [02:34<00:19,  2.58it/s]batch:  26%|██▋       | 18/68 [02:35<00:17,  2.91it/s]batch:  28%|██▊       | 19/68 [02:35<00:15,  3.21it/s]batch:  29%|██▉       | 20/68 [02:35<00:13,  3.45it/s]batch:  31%|███       | 21/68 [02:35<00:12,  3.65it/s]batch:  32%|███▏      | 22/68 [02:36<00:12,  3.80it/s]batch:  34%|███▍      | 23/68 [02:36<00:11,  3.92it/s]batch:  35%|███▌      | 24/68 [02:36<00:10,  4.01it/s]batch:  37%|███▋      | 25/68 [02:36<00:10,  4.06it/s]batch:  38%|███▊      | 26/68 [02:37<00:10,  4.10it/s]batch:  40%|███▉      | 27/68 [02:37<00:09,  4.12it/s]batch:  41%|████      | 28/68 [02:37<00:09,  4.14it/s]batch:  43%|████▎     | 29/68 [02:37<00:09,  4.16it/s]batch:  44%|████▍     | 30/68 [02:38<00:09,  4.17it/s]batch:  46%|████▌     | 31/68 [02:38<00:08,  4.18it/s]batch:  47%|████▋     | 32/68 [02:38<00:08,  4.17it/s]batch:  49%|████▊     | 33/68 [02:38<00:08,  4.18it/s]batch:  50%|█████     | 34/68 [02:39<00:08,  4.18it/s]batch:  51%|█████▏    | 35/68 [02:39<00:07,  4.18it/s]batch:  53%|█████▎    | 36/68 [02:39<00:07,  4.19it/s]batch:  54%|█████▍    | 37/68 [02:39<00:07,  4.18it/s]batch:  56%|█████▌    | 38/68 [02:39<00:07,  4.20it/s]batch:  57%|█████▋    | 39/68 [02:40<00:06,  4.18it/s]batch:  59%|█████▉    | 40/68 [02:40<00:06,  4.19it/s]batch:  60%|██████    | 41/68 [02:40<00:06,  4.19it/s]batch:  62%|██████▏   | 42/68 [02:40<00:06,  4.18it/s]batch:  63%|██████▎   | 43/68 [02:41<00:05,  4.19it/s]batch:  65%|██████▍   | 44/68 [02:41<00:05,  4.18it/s]batch:  66%|██████▌   | 45/68 [02:41<00:05,  4.08it/s]batch:  68%|██████▊   | 46/68 [02:41<00:05,  4.12it/s]batch:  69%|██████▉   | 47/68 [02:42<00:05,  4.15it/s]batch:  71%|███████   | 48/68 [02:42<00:04,  4.17it/s]batch:  72%|███████▏  | 49/68 [02:42<00:04,  4.17it/s]batch:  74%|███████▎  | 50/68 [02:42<00:04,  4.18it/s]batch:  75%|███████▌  | 51/68 [02:43<00:04,  4.20it/s]batch:  76%|███████▋  | 52/68 [02:43<00:03,  4.21it/s]batch:  78%|███████▊  | 53/68 [02:43<00:03,  4.21it/s]batch:  79%|███████▉  | 54/68 [02:43<00:03,  4.20it/s]batch:  81%|████████  | 55/68 [02:44<00:03,  4.20it/s]batch:  82%|████████▏ | 56/68 [02:44<00:02,  4.20it/s]batch:  84%|████████▍ | 57/68 [02:44<00:02,  4.21it/s]batch:  85%|████████▌ | 58/68 [02:44<00:02,  4.21it/s]batch:  87%|████████▋ | 59/68 [02:44<00:02,  4.20it/s]batch:  88%|████████▊ | 60/68 [02:45<00:01,  4.20it/s]batch:  90%|████████▉ | 61/68 [02:45<00:01,  4.20it/s]batch:  91%|█████████ | 62/68 [02:45<00:01,  4.21it/s]batch:  93%|█████████▎| 63/68 [02:45<00:01,  4.21it/s]batch:  94%|█████████▍| 64/68 [02:46<00:00,  4.23it/s]batch:  96%|█████████▌| 65/68 [02:46<00:00,  4.24it/s]batch:  97%|█████████▋| 66/68 [02:46<00:00,  3.67it/s]batch:  99%|█████████▊| 67/68 [02:46<00:00,  3.82it/s]batch: 100%|██████████| 68/68 [02:47<00:00,  4.61it/s]batch: 100%|██████████| 68/68 [02:47<00:00,  2.46s/it]
  0%|          | 0/4312 [00:00<?, ?it/s]  3%|▎         | 126/4312 [00:00<00:03, 1238.86it/s]  6%|▌         | 250/4312 [00:00<00:03, 1228.90it/s]  9%|▊         | 375/4312 [00:00<00:03, 1230.66it/s] 12%|█▏        | 499/4312 [00:00<00:03, 1234.06it/s] 14%|█▍        | 623/4312 [00:00<00:02, 1232.71it/s] 17%|█▋        | 747/4312 [00:00<00:02, 1218.74it/s] 20%|██        | 869/4312 [00:00<00:02, 1211.74it/s] 23%|██▎       | 991/4312 [00:00<00:02, 1206.60it/s] 26%|██▌       | 1114/4312 [00:00<00:02, 1213.54it/s] 29%|██▊       | 1236/4312 [00:01<00:02, 1212.26it/s] 31%|███▏      | 1358/4312 [00:01<00:02, 1206.48it/s] 34%|███▍      | 1479/4312 [00:01<00:02, 1200.94it/s] 37%|███▋      | 1600/4312 [00:01<00:02, 1185.57it/s] 40%|███▉      | 1719/4312 [00:01<00:02, 1176.26it/s] 43%|████▎     | 1838/4312 [00:01<00:02, 1178.52it/s] 45%|████▌     | 1958/4312 [00:01<00:01, 1184.49it/s] 48%|████▊     | 2077/4312 [00:01<00:01, 1159.40it/s] 51%|█████     | 2194/4312 [00:01<00:01, 1154.12it/s] 54%|█████▎    | 2313/4312 [00:01<00:01, 1158.95it/s] 56%|█████▋    | 2431/4312 [00:02<00:01, 1159.38it/s] 59%|█████▉    | 2547/4312 [00:02<00:01, 1149.77it/s] 62%|██████▏   | 2663/4312 [00:02<00:01, 1142.20it/s] 64%|██████▍   | 2778/4312 [00:02<00:01, 1131.33it/s] 67%|██████▋   | 2892/4312 [00:02<00:01, 1119.89it/s] 70%|██████▉   | 3005/4312 [00:02<00:01, 1109.05it/s] 72%|███████▏  | 3116/4312 [00:02<00:01, 1109.23it/s] 75%|███████▍  | 3230/4312 [00:02<00:00, 1113.03it/s] 78%|███████▊  | 3342/4312 [00:02<00:00, 1112.09it/s] 80%|████████  | 3454/4312 [00:02<00:00, 1110.20it/s] 83%|████████▎ | 3566/4312 [00:03<00:00, 1110.60it/s] 85%|████████▌ | 3678/4312 [00:03<00:00, 1109.85it/s] 88%|████████▊ | 3789/4312 [00:03<00:00, 1101.69it/s] 90%|█████████ | 3900/4312 [00:03<00:00, 1091.60it/s] 93%|█████████▎| 4012/4312 [00:03<00:00, 1097.10it/s] 96%|█████████▌| 4122/4312 [00:03<00:00, 1089.19it/s] 98%|█████████▊| 4231/4312 [00:03<00:00, 1078.47it/s]100%|██████████| 4312/4312 [00:03<00:00, 1148.12it/s]
  0%|          | 0/4312 [00:00<?, ?it/s]  3%|▎         | 126/4312 [00:00<00:03, 1238.08it/s]  6%|▌         | 250/4312 [00:00<00:03, 1229.47it/s]  9%|▊         | 375/4312 [00:00<00:03, 1231.97it/s] 12%|█▏        | 499/4312 [00:00<00:03, 1234.49it/s] 14%|█▍        | 623/4312 [00:00<00:02, 1233.17it/s] 17%|█▋        | 747/4312 [00:00<00:02, 1218.70it/s] 20%|██        | 869/4312 [00:00<00:02, 1211.73it/s] 23%|██▎       | 991/4312 [00:00<00:02, 1208.21it/s] 26%|██▌       | 1114/4312 [00:00<00:02, 1214.58it/s] 29%|██▊       | 1236/4312 [00:01<00:02, 1213.77it/s] 31%|███▏      | 1358/4312 [00:01<00:02, 1202.99it/s] 34%|███▍      | 1479/4312 [00:01<00:02, 1201.00it/s] 37%|███▋      | 1600/4312 [00:01<00:02, 1186.35it/s] 40%|███▉      | 1719/4312 [00:01<00:02, 1177.19it/s] 43%|████▎     | 1838/4312 [00:01<00:02, 1179.18it/s] 45%|████▌     | 1959/4312 [00:01<00:01, 1181.75it/s] 48%|████▊     | 2078/4312 [00:01<00:01, 1163.72it/s] 51%|█████     | 2195/4312 [00:01<00:01, 1156.62it/s] 54%|█████▎    | 2313/4312 [00:01<00:01, 1156.95it/s] 56%|█████▋    | 2431/4312 [00:02<00:01, 1157.25it/s] 59%|█████▉    | 2547/4312 [00:02<00:01, 1147.89it/s] 62%|██████▏   | 2662/4312 [00:02<00:01, 1137.09it/s] 64%|██████▍   | 2776/4312 [00:02<00:01, 1124.72it/s] 67%|██████▋   | 2889/4312 [00:02<00:01, 1118.62it/s] 70%|██████▉   | 3001/4312 [00:02<00:01, 1111.17it/s] 72%|███████▏  | 3113/4312 [00:02<00:01, 1107.53it/s] 75%|███████▍  | 3227/4312 [00:02<00:00, 1112.62it/s] 77%|███████▋  | 3339/4312 [00:02<00:00, 1111.86it/s] 80%|████████  | 3451/4312 [00:02<00:00, 1109.76it/s] 83%|████████▎ | 3563/4312 [00:03<00:00, 1109.71it/s] 85%|████████▌ | 3674/4312 [00:03<00:00, 1106.67it/s] 88%|████████▊ | 3785/4312 [00:03<00:00, 1105.81it/s] 90%|█████████ | 3896/4312 [00:03<00:00, 1094.12it/s] 93%|█████████▎| 4006/4312 [00:03<00:00, 1095.35it/s] 95%|█████████▌| 4116/4312 [00:03<00:00, 1088.24it/s] 98%|█████████▊| 4225/4312 [00:03<00:00, 1075.70it/s]100%|██████████| 4312/4312 [00:03<00:00, 1147.43it/s]
2026-05-11 04:02:20.592142: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-11 04:09:14.398208: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-11 04:09:14.455321: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-05-11 04:09:14.509539: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-11 04:09:14.509638: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-11 04:09:14.548983: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-11 04:09:14.549087: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-11 04:09:14.568804: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-11 04:09:14.587778: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-11 04:09:14.609648: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-11 04:09:14.630503: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-11 04:09:14.650798: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-11 04:09:14.651287: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-11 04:09:14.651706: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-05-11 04:09:14.651860: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-11 04:09:14.652147: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-11 04:09:14.652247: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-11 04:09:14.652269: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-11 04:09:14.652283: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-11 04:09:14.652296: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-11 04:09:14.652309: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-11 04:09:14.652322: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-11 04:09:14.652335: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-11 04:09:14.652347: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-11 04:09:14.652678: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-11 04:09:14.652709: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-11 04:09:15.067572: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-05-11 04:09:15.067678: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-05-11 04:09:15.067687: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-05-11 04:09:15.068381: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:61:00.0, compute capability: 8.9)
2026-05-11 04:09:15.108074: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:196] None of the MLIR optimization passes are enabled (registered 0 passes)
2026-05-11 04:09:15.122138: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-05-11 04:09:18.844234: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-11 04:09:19.325657: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-11 04:09:19.329425: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-11 04:09:21.189555: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-05-11 04:09:21.278029: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
2026-05-11 04:11:16.753506: I tensorflow/stream_executor/cuda/cuda_blas.cc:1838] TensorFloat-32 will be used for the matrix multiplication. This will only be logged once.
RuntimeError: module compiled against API version 0xe but this version of numpy is 0xd
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/layers/core.py:1059: UserWarning: bpnet.model.arch is not loaded, but a Lambda layer uses it. It may cause errors.
  , UserWarning)
