Lmod Warning:
-------------------------------------------------------------------------------
The following dependent module(s) are not currently loaded: curl/8.4.0
(required by: htslib/1.16)
-------------------------------------------------------------------------------




The following have been reloaded with a version change:
  1) curl/8.4.0 => curl/8.17.0     2) openssl/3.0.7 => openssl/3.6.0

2026-05-28 11:52:16.374334: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:52:50.607924: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 11:52:50.609290: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-05-28 11:52:50.640016: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:4a:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 11:52:50.640080: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:52:50.651904: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:52:50.651972: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:52:50.657085: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 11:52:50.661481: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 11:52:50.669239: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 11:52:50.675724: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 11:52:50.679513: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:52:50.679967: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 11:52:50.680338: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-05-28 11:52:50.680500: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 11:52:50.680833: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:4a:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 11:52:50.680858: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:52:50.680873: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:52:50.680883: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:52:50.680892: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 11:52:50.680901: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 11:52:50.680910: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 11:52:50.680919: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 11:52:50.680949: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:52:50.681233: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 11:52:50.681255: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:52:51.111646: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-05-28 11:52:51.111720: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-05-28 11:52:51.111731: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-05-28 11:52:51.112387: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:4a:00.0, compute capability: 8.9)
2026-05-28 11:52:52.694419: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2)
2026-05-28 11:52:52.694968: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-05-28 11:52:57.626812: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:52:59.281164: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:52:59.286247: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:53:00.799673: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-05-28 11:53:00.902150: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
2026-05-28 11:53:19.944528: I tensorflow/stream_executor/cuda/cuda_blas.cc:1838] TensorFloat-32 will be used for the matrix multiplication. This will only be logged once.
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates', 'jitters', 'index', 'status', 'rev_comp'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
2026-05-28 12:03:36.413466: W tensorflow/python/util/util.cc:348] Sets are not currently considered sequences, but this may change in the future, so consider avoiding using them.
2026-05-28 12:03:41.073917: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 12:03:59.881261: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 12:03:59.882248: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-05-28 12:03:59.910383: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:4a:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 12:03:59.910485: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 12:03:59.915850: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 12:03:59.915950: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 12:03:59.918727: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 12:03:59.920842: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 12:03:59.925172: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 12:03:59.927333: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 12:03:59.929368: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 12:03:59.929774: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 12:03:59.930081: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-05-28 12:03:59.930215: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 12:03:59.930413: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:4a:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 12:03:59.930432: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 12:03:59.930445: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 12:03:59.930456: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 12:03:59.930466: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 12:03:59.930476: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 12:03:59.930486: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 12:03:59.930495: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 12:03:59.930505: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 12:03:59.930786: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 12:03:59.930808: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 12:04:00.355568: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-05-28 12:04:00.355681: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-05-28 12:04:00.355691: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-05-28 12:04:00.356352: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:4a:00.0, compute capability: 8.9)
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/layers/core.py:1059: UserWarning: bpnet.model.arch is not loaded, but a Lambda layer uses it. It may cause errors.
  , UserWarning)
batch:   0%|          | 0/92 [00:00<?, ?it/s]2026-05-28 12:04:01.909336: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2)
2026-05-28 12:04:01.909864: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-05-28 12:04:02.119916: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 12:04:02.622090: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 12:04:02.623808: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 12:04:04.227975: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-05-28 12:04:04.332878: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates', 'true_profiles', 'true_logcounts', 'rev_comp'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
batch:   1%|          | 1/92 [00:25<38:44, 25.55s/it]batch:   2%|▏         | 2/92 [00:25<15:59, 10.66s/it]batch:   3%|▎         | 3/92 [00:26<08:45,  5.90s/it]batch:   4%|▍         | 4/92 [00:26<05:22,  3.67s/it]batch:   5%|▌         | 5/92 [00:26<03:31,  2.43s/it]batch:   7%|▋         | 6/92 [00:26<02:24,  1.68s/it]batch:   8%|▊         | 7/92 [00:26<01:42,  1.21s/it]batch:   9%|▊         | 8/92 [00:27<01:15,  1.11it/s]batch:  10%|▉         | 9/92 [00:27<00:57,  1.44it/s]batch:  11%|█         | 10/92 [00:27<00:45,  1.82it/s]batch:  12%|█▏        | 11/92 [00:27<00:36,  2.20it/s]batch:  13%|█▎        | 12/92 [00:28<00:31,  2.58it/s]batch:  14%|█▍        | 13/92 [00:28<00:26,  2.94it/s]batch:  15%|█▌        | 14/92 [00:28<00:24,  3.24it/s]batch:  16%|█▋        | 15/92 [00:28<00:22,  3.49it/s]batch:  17%|█▋        | 16/92 [00:29<00:20,  3.68it/s]batch:  18%|█▊        | 17/92 [00:29<00:19,  3.83it/s]batch:  20%|█▉        | 18/92 [00:29<00:18,  3.94it/s]batch:  21%|██        | 19/92 [00:29<00:18,  4.02it/s]batch:  22%|██▏       | 20/92 [00:30<00:17,  4.04it/s]batch:  23%|██▎       | 21/92 [00:30<00:17,  4.10it/s]batch:  24%|██▍       | 22/92 [00:30<00:16,  4.13it/s]batch:  25%|██▌       | 23/92 [00:30<00:16,  4.16it/s]batch:  26%|██▌       | 24/92 [00:30<00:16,  4.18it/s]batch:  27%|██▋       | 25/92 [00:31<00:16,  4.18it/s]batch:  28%|██▊       | 26/92 [00:31<00:15,  4.20it/s]batch:  29%|██▉       | 27/92 [00:31<00:15,  4.20it/s]batch:  30%|███       | 28/92 [00:31<00:15,  4.22it/s]batch:  32%|███▏      | 29/92 [00:32<00:14,  4.22it/s]batch:  33%|███▎      | 30/92 [00:32<00:14,  4.22it/s]batch:  34%|███▎      | 31/92 [00:32<00:14,  4.23it/s]batch:  35%|███▍      | 32/92 [00:32<00:14,  4.20it/s]batch:  36%|███▌      | 33/92 [00:33<00:13,  4.22it/s]batch:  37%|███▋      | 34/92 [00:33<00:13,  4.21it/s]batch:  38%|███▊      | 35/92 [00:33<00:13,  4.23it/s]batch:  39%|███▉      | 36/92 [00:33<00:13,  4.24it/s]batch:  40%|████      | 37/92 [00:34<00:13,  4.22it/s]batch:  41%|████▏     | 38/92 [00:34<00:12,  4.22it/s]batch:  42%|████▏     | 39/92 [00:34<00:12,  4.22it/s]batch:  43%|████▎     | 40/92 [00:34<00:12,  4.23it/s]batch:  45%|████▍     | 41/92 [00:35<00:11,  4.25it/s]batch:  46%|████▌     | 42/92 [00:35<00:11,  4.24it/s]batch:  47%|████▋     | 43/92 [00:35<00:11,  4.26it/s]batch:  48%|████▊     | 44/92 [00:35<00:11,  4.26it/s]batch:  49%|████▉     | 45/92 [00:35<00:11,  4.27it/s]batch:  50%|█████     | 46/92 [00:36<00:10,  4.28it/s]batch:  51%|█████     | 47/92 [00:36<00:10,  4.30it/s]batch:  52%|█████▏    | 48/92 [00:36<00:10,  4.30it/s]batch:  53%|█████▎    | 49/92 [00:36<00:10,  4.28it/s]batch:  54%|█████▍    | 50/92 [00:37<00:09,  4.28it/s]batch:  55%|█████▌    | 51/92 [00:37<00:09,  4.29it/s]batch:  57%|█████▋    | 52/92 [00:37<00:09,  4.29it/s]batch:  58%|█████▊    | 53/92 [00:37<00:09,  4.30it/s]batch:  59%|█████▊    | 54/92 [00:38<00:08,  4.28it/s]batch:  60%|█████▉    | 55/92 [00:38<00:08,  4.29it/s]batch:  61%|██████    | 56/92 [00:38<00:08,  4.28it/s]batch:  62%|██████▏   | 57/92 [00:38<00:08,  4.29it/s]batch:  63%|██████▎   | 58/92 [00:38<00:07,  4.29it/s]batch:  64%|██████▍   | 59/92 [00:39<00:07,  4.25it/s]batch:  65%|██████▌   | 60/92 [00:39<00:07,  4.27it/s]batch:  66%|██████▋   | 61/92 [00:39<00:07,  4.27it/s]batch:  67%|██████▋   | 62/92 [00:39<00:07,  4.27it/s]batch:  68%|██████▊   | 63/92 [00:40<00:06,  4.28it/s]batch:  70%|██████▉   | 64/92 [00:40<00:06,  4.29it/s]batch:  71%|███████   | 65/92 [00:40<00:06,  4.29it/s]batch:  72%|███████▏  | 66/92 [00:40<00:07,  3.70it/s]batch:  73%|███████▎  | 67/92 [00:41<00:06,  3.84it/s]batch:  74%|███████▍  | 68/92 [00:41<00:06,  3.96it/s]batch:  75%|███████▌  | 69/92 [00:41<00:05,  4.06it/s]batch:  76%|███████▌  | 70/92 [00:41<00:05,  4.11it/s]batch:  77%|███████▋  | 71/92 [00:42<00:05,  4.14it/s]batch:  78%|███████▊  | 72/92 [00:42<00:04,  4.19it/s]batch:  79%|███████▉  | 73/92 [00:42<00:04,  4.20it/s]batch:  80%|████████  | 74/92 [00:42<00:04,  4.22it/s]batch:  82%|████████▏ | 75/92 [00:43<00:04,  4.23it/s]batch:  83%|████████▎ | 76/92 [00:43<00:03,  4.23it/s]batch:  84%|████████▎ | 77/92 [00:43<00:03,  4.24it/s]batch:  85%|████████▍ | 78/92 [00:43<00:03,  4.25it/s]batch:  86%|████████▌ | 79/92 [00:44<00:03,  4.25it/s]batch:  87%|████████▋ | 80/92 [00:44<00:02,  4.25it/s]batch:  88%|████████▊ | 81/92 [00:44<00:02,  4.25it/s]batch:  89%|████████▉ | 82/92 [00:44<00:02,  4.26it/s]batch:  90%|█████████ | 83/92 [00:44<00:02,  4.26it/s]batch:  91%|█████████▏| 84/92 [00:45<00:01,  4.27it/s]batch:  92%|█████████▏| 85/92 [00:45<00:01,  4.27it/s]batch:  93%|█████████▎| 86/92 [00:45<00:01,  4.28it/s]batch:  95%|█████████▍| 87/92 [00:45<00:01,  4.29it/s]batch:  96%|█████████▌| 88/92 [00:46<00:00,  4.27it/s]batch:  97%|█████████▋| 89/92 [00:46<00:00,  4.27it/s]batch:  98%|█████████▊| 90/92 [00:46<00:00,  4.27it/s]batch:  99%|█████████▉| 91/92 [00:46<00:00,  4.44it/s]batch: 100%|██████████| 92/92 [00:46<00:00,  1.96it/s]
  0%|          | 0/5814 [00:00<?, ?it/s]  2%|▏         | 119/5814 [00:00<00:04, 1180.66it/s]  4%|▍         | 238/5814 [00:00<00:04, 1180.06it/s]  6%|▌         | 357/5814 [00:00<00:04, 1181.87it/s]  8%|▊         | 476/5814 [00:00<00:04, 1175.72it/s] 10%|█         | 594/5814 [00:00<00:04, 1171.94it/s] 12%|█▏        | 712/5814 [00:00<00:04, 1169.51it/s] 14%|█▍        | 830/5814 [00:00<00:04, 1172.24it/s] 16%|█▋        | 948/5814 [00:00<00:04, 1169.54it/s] 18%|█▊        | 1065/5814 [00:00<00:04, 1163.54it/s] 20%|██        | 1182/5814 [00:01<00:04, 1157.81it/s] 22%|██▏       | 1300/5814 [00:01<00:03, 1164.47it/s] 24%|██▍       | 1417/5814 [00:01<00:03, 1157.94it/s] 26%|██▋       | 1533/5814 [00:01<00:03, 1156.46it/s] 28%|██▊       | 1649/5814 [00:01<00:03, 1152.85it/s] 30%|███       | 1765/5814 [00:01<00:03, 1149.27it/s] 32%|███▏      | 1880/5814 [00:01<00:03, 1145.20it/s] 34%|███▍      | 1995/5814 [00:01<00:03, 1142.66it/s] 36%|███▋      | 2110/5814 [00:01<00:03, 1142.03it/s] 38%|███▊      | 2225/5814 [00:01<00:03, 1138.42it/s] 40%|████      | 2341/5814 [00:02<00:03, 1139.23it/s] 42%|████▏     | 2456/5814 [00:02<00:02, 1137.67it/s] 44%|████▍     | 2570/5814 [00:02<00:02, 1132.40it/s] 46%|████▌     | 2684/5814 [00:02<00:02, 1128.95it/s] 48%|████▊     | 2797/5814 [00:02<00:02, 1127.06it/s] 50%|█████     | 2910/5814 [00:02<00:02, 1120.69it/s] 52%|█████▏    | 3023/5814 [00:02<00:02, 1123.19it/s] 54%|█████▍    | 3136/5814 [00:02<00:02, 1122.42it/s] 56%|█████▌    | 3249/5814 [00:02<00:02, 1116.94it/s] 58%|█████▊    | 3361/5814 [00:02<00:02, 1116.57it/s] 60%|█████▉    | 3473/5814 [00:03<00:02, 1116.85it/s] 62%|██████▏   | 3585/5814 [00:03<00:02, 1113.97it/s] 64%|██████▎   | 3697/5814 [00:03<00:01, 1111.16it/s] 66%|██████▌   | 3809/5814 [00:03<00:01, 1109.13it/s] 67%|██████▋   | 3920/5814 [00:03<00:01, 1108.40it/s] 69%|██████▉   | 4031/5814 [00:03<00:01, 1102.05it/s] 71%|███████   | 4142/5814 [00:03<00:01, 1099.23it/s] 73%|███████▎  | 4253/5814 [00:03<00:01, 1099.81it/s] 75%|███████▌  | 4363/5814 [00:03<00:01, 1097.85it/s] 77%|███████▋  | 4473/5814 [00:03<00:01, 1094.50it/s] 79%|███████▉  | 4583/5814 [00:04<00:01, 1095.56it/s] 81%|████████  | 4694/5814 [00:04<00:01, 1096.67it/s] 83%|████████▎ | 4804/5814 [00:04<00:00, 1090.98it/s] 85%|████████▍ | 4914/5814 [00:04<00:00, 1089.57it/s] 86%|████████▋ | 5023/5814 [00:04<00:00, 1088.98it/s] 88%|████████▊ | 5132/5814 [00:04<00:00, 1083.54it/s] 90%|█████████ | 5241/5814 [00:04<00:00, 1078.96it/s] 92%|█████████▏| 5350/5814 [00:04<00:00, 1081.71it/s] 94%|█████████▍| 5459/5814 [00:04<00:00, 1076.67it/s] 96%|█████████▌| 5567/5814 [00:04<00:00, 1070.61it/s] 98%|█████████▊| 5675/5814 [00:05<00:00, 1072.47it/s] 99%|█████████▉| 5783/5814 [00:05<00:00, 1071.93it/s]100%|██████████| 5814/5814 [00:05<00:00, 1119.88it/s]
  0%|          | 0/5814 [00:00<?, ?it/s]  2%|▏         | 119/5814 [00:00<00:04, 1186.99it/s]  4%|▍         | 238/5814 [00:00<00:04, 1187.91it/s]  6%|▌         | 358/5814 [00:00<00:04, 1184.54it/s]  8%|▊         | 478/5814 [00:00<00:04, 1182.78it/s] 10%|█         | 597/5814 [00:00<00:04, 1181.42it/s] 12%|█▏        | 716/5814 [00:00<00:04, 1181.34it/s] 14%|█▍        | 835/5814 [00:00<00:04, 1177.59it/s] 16%|█▋        | 953/5814 [00:00<00:04, 1175.20it/s] 18%|█▊        | 1071/5814 [00:00<00:04, 1166.40it/s] 20%|██        | 1190/5814 [00:01<00:03, 1167.03it/s] 23%|██▎       | 1309/5814 [00:01<00:03, 1169.21it/s] 25%|██▍       | 1426/5814 [00:01<00:03, 1168.64it/s] 27%|██▋       | 1543/5814 [00:01<00:03, 1162.85it/s] 29%|██▊       | 1660/5814 [00:01<00:03, 1162.55it/s] 31%|███       | 1777/5814 [00:01<00:03, 1160.31it/s] 33%|███▎      | 1894/5814 [00:01<00:03, 1152.15it/s] 35%|███▍      | 2010/5814 [00:01<00:03, 1146.01it/s] 37%|███▋      | 2126/5814 [00:01<00:03, 1145.20it/s] 39%|███▊      | 2241/5814 [00:01<00:03, 1145.54it/s] 41%|████      | 2356/5814 [00:02<00:03, 1138.29it/s] 42%|████▏     | 2470/5814 [00:02<00:02, 1134.13it/s] 44%|████▍     | 2584/5814 [00:02<00:02, 1134.42it/s] 46%|████▋     | 2698/5814 [00:02<00:02, 1130.31it/s] 48%|████▊     | 2812/5814 [00:02<00:02, 1123.44it/s] 50%|█████     | 2925/5814 [00:02<00:02, 1124.49it/s] 52%|█████▏    | 3038/5814 [00:02<00:02, 1121.16it/s] 54%|█████▍    | 3151/5814 [00:02<00:02, 1122.12it/s] 56%|█████▌    | 3264/5814 [00:02<00:02, 1123.78it/s] 58%|█████▊    | 3377/5814 [00:02<00:02, 1120.72it/s] 60%|██████    | 3490/5814 [00:03<00:02, 1120.11it/s] 62%|██████▏   | 3603/5814 [00:03<00:01, 1114.31it/s] 64%|██████▍   | 3716/5814 [00:03<00:01, 1113.00it/s] 66%|██████▌   | 3828/5814 [00:03<00:01, 1109.22it/s] 68%|██████▊   | 3939/5814 [00:03<00:01, 1108.73it/s] 70%|██████▉   | 4050/5814 [00:03<00:01, 1102.81it/s] 72%|███████▏  | 4161/5814 [00:03<00:01, 1100.71it/s] 73%|███████▎  | 4272/5814 [00:03<00:01, 1099.39it/s] 75%|███████▌  | 4382/5814 [00:03<00:01, 1096.98it/s] 77%|███████▋  | 4492/5814 [00:03<00:01, 1093.92it/s] 79%|███████▉  | 4602/5814 [00:04<00:01, 1095.03it/s] 81%|████████  | 4713/5814 [00:04<00:01, 1099.00it/s] 83%|████████▎ | 4823/5814 [00:04<00:00, 1090.22it/s] 85%|████████▍ | 4933/5814 [00:04<00:00, 1086.25it/s] 87%|████████▋ | 5042/5814 [00:04<00:00, 1086.87it/s] 89%|████████▊ | 5151/5814 [00:04<00:00, 1083.86it/s] 90%|█████████ | 5260/5814 [00:04<00:00, 1085.29it/s] 92%|█████████▏| 5369/5814 [00:04<00:00, 1080.65it/s] 94%|█████████▍| 5478/5814 [00:04<00:00, 1075.13it/s] 96%|█████████▌| 5586/5814 [00:04<00:00, 1075.30it/s] 98%|█████████▊| 5694/5814 [00:05<00:00, 1070.03it/s]100%|█████████▉| 5802/5814 [00:05<00:00, 1068.68it/s]100%|██████████| 5814/5814 [00:05<00:00, 1122.50it/s]
2026-05-28 12:05:14.756840: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 12:05:21.927669: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 12:05:21.929154: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-05-28 12:05:21.956844: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:4a:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 12:05:21.956954: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 12:05:21.960757: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 12:05:21.960858: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 12:05:21.962676: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 12:05:21.963771: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 12:05:21.966665: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 12:05:21.968056: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 12:05:21.968945: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 12:05:21.969366: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 12:05:21.969798: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-05-28 12:05:21.969960: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 12:05:21.970199: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:4a:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 12:05:21.970229: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 12:05:21.970244: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 12:05:21.970258: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 12:05:21.970270: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 12:05:21.970283: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 12:05:21.970296: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 12:05:21.970308: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 12:05:21.970321: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 12:05:21.970613: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 12:05:21.970640: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 12:05:22.393946: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-05-28 12:05:22.394055: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-05-28 12:05:22.394065: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-05-28 12:05:22.394768: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:4a:00.0, compute capability: 8.9)
2026-05-28 12:05:22.435030: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:196] None of the MLIR optimization passes are enabled (registered 0 passes)
2026-05-28 12:05:22.450032: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-05-28 12:05:27.022308: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 12:05:27.517561: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 12:05:27.521605: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 12:05:29.021944: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-05-28 12:05:29.127525: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
2026-05-28 12:05:46.975116: I tensorflow/stream_executor/cuda/cuda_blas.cc:1838] TensorFloat-32 will be used for the matrix multiplication. This will only be logged once.
RuntimeError: module compiled against API version 0xe but this version of numpy is 0xd
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/layers/core.py:1059: UserWarning: bpnet.model.arch is not loaded, but a Lambda layer uses it. It may cause errors.
  , UserWarning)
