Lmod Warning:
-------------------------------------------------------------------------------
The following dependent module(s) are not currently loaded: curl/8.4.0
(required by: htslib/1.16)
-------------------------------------------------------------------------------




The following have been reloaded with a version change:
  1) curl/8.4.0 => curl/8.17.0     2) openssl/3.0.7 => openssl/3.6.0

2026-05-28 11:54:45.363208: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:55:21.598384: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 11:55:21.599635: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-05-28 11:55:21.631629: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 11:55:21.643957: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:55:21.653219: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:55:21.653302: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:55:21.657460: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 11:55:21.661269: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 11:55:21.666982: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 11:55:21.671217: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 11:55:21.674250: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:55:21.674713: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 11:55:21.675097: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-05-28 11:55:21.675277: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 11:55:21.675614: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 11:55:21.675639: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:55:21.675656: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:55:21.675666: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:55:21.675675: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 11:55:21.675684: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 11:55:21.675693: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 11:55:21.675702: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 11:55:21.675727: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:55:21.676012: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 11:55:21.676033: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:55:22.106042: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-05-28 11:55:22.106143: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-05-28 11:55:22.106154: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-05-28 11:55:22.106808: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:61:00.0, compute capability: 8.9)
2026-05-28 11:55:23.786418: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2)
2026-05-28 11:55:23.786926: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-05-28 11:55:27.661197: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:55:28.196959: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:55:28.201842: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:55:29.683905: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-05-28 11:55:29.779377: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
2026-05-28 11:55:50.219617: I tensorflow/stream_executor/cuda/cuda_blas.cc:1838] TensorFloat-32 will be used for the matrix multiplication. This will only be logged once.
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates', 'jitters', 'index', 'status', 'rev_comp'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
2026-05-28 12:01:58.358345: W tensorflow/python/util/util.cc:348] Sets are not currently considered sequences, but this may change in the future, so consider avoiding using them.
2026-05-28 12:02:03.292578: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 12:02:12.331781: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 12:02:12.332670: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-05-28 12:02:12.360521: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 12:02:12.360602: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 12:02:12.366041: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 12:02:12.366165: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 12:02:12.368967: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 12:02:12.370843: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 12:02:12.374681: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 12:02:12.376809: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 12:02:12.378662: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 12:02:12.379105: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 12:02:12.379412: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-05-28 12:02:12.379567: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 12:02:12.379845: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 12:02:12.379912: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 12:02:12.379938: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 12:02:12.379949: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 12:02:12.379960: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 12:02:12.379970: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 12:02:12.379980: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 12:02:12.379990: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 12:02:12.379999: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 12:02:12.380322: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 12:02:12.380347: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 12:02:12.810393: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-05-28 12:02:12.810471: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-05-28 12:02:12.810480: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-05-28 12:02:12.811150: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:61:00.0, compute capability: 8.9)
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/layers/core.py:1059: UserWarning: bpnet.model.arch is not loaded, but a Lambda layer uses it. It may cause errors.
  , UserWarning)
batch:   0%|          | 0/54 [00:00<?, ?it/s]2026-05-28 12:02:14.344607: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2)
2026-05-28 12:02:14.345107: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-05-28 12:02:14.579578: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 12:02:15.098407: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 12:02:15.100209: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 12:02:16.638818: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-05-28 12:02:16.754625: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates', 'true_profiles', 'true_logcounts', 'rev_comp'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
batch:   2%|▏         | 1/54 [00:22<20:06, 22.77s/it]batch:   4%|▎         | 2/54 [00:23<08:14,  9.52s/it]batch:   6%|▌         | 3/54 [00:23<04:29,  5.28s/it]batch:   7%|▋         | 4/54 [00:23<02:44,  3.29s/it]batch:   9%|▉         | 5/54 [00:23<01:47,  2.19s/it]batch:  11%|█         | 6/54 [00:23<01:13,  1.52s/it]batch:  13%|█▎        | 7/54 [00:24<00:51,  1.10s/it]batch:  15%|█▍        | 8/54 [00:24<00:37,  1.21it/s]batch:  17%|█▋        | 9/54 [00:24<00:28,  1.56it/s]batch:  19%|█▊        | 10/54 [00:24<00:22,  1.94it/s]batch:  20%|██        | 11/54 [00:25<00:18,  2.33it/s]batch:  22%|██▏       | 12/54 [00:25<00:15,  2.70it/s]batch:  24%|██▍       | 13/54 [00:25<00:13,  3.04it/s]batch:  26%|██▌       | 14/54 [00:25<00:12,  3.33it/s]batch:  28%|██▊       | 15/54 [00:26<00:10,  3.56it/s]batch:  30%|██▉       | 16/54 [00:26<00:10,  3.75it/s]batch:  31%|███▏      | 17/54 [00:26<00:09,  3.88it/s]batch:  33%|███▎      | 18/54 [00:26<00:09,  3.99it/s]batch:  35%|███▌      | 19/54 [00:26<00:08,  4.06it/s]batch:  37%|███▋      | 20/54 [00:27<00:08,  4.12it/s]batch:  39%|███▉      | 21/54 [00:27<00:07,  4.17it/s]batch:  41%|████      | 22/54 [00:27<00:07,  4.20it/s]batch:  43%|████▎     | 23/54 [00:27<00:07,  4.23it/s]batch:  44%|████▍     | 24/54 [00:28<00:07,  4.24it/s]batch:  46%|████▋     | 25/54 [00:28<00:06,  4.23it/s]batch:  48%|████▊     | 26/54 [00:28<00:06,  4.25it/s]batch:  50%|█████     | 27/54 [00:28<00:06,  4.25it/s]batch:  52%|█████▏    | 28/54 [00:29<00:06,  4.27it/s]batch:  54%|█████▎    | 29/54 [00:29<00:05,  4.27it/s]batch:  56%|█████▌    | 30/54 [00:29<00:05,  4.27it/s]batch:  57%|█████▋    | 31/54 [00:29<00:05,  4.27it/s]batch:  59%|█████▉    | 32/54 [00:30<00:05,  4.25it/s]batch:  61%|██████    | 33/54 [00:30<00:04,  4.26it/s]batch:  63%|██████▎   | 34/54 [00:30<00:04,  4.27it/s]batch:  65%|██████▍   | 35/54 [00:30<00:04,  4.27it/s]batch:  67%|██████▋   | 36/54 [00:30<00:04,  4.27it/s]batch:  69%|██████▊   | 37/54 [00:31<00:03,  4.26it/s]batch:  70%|███████   | 38/54 [00:31<00:03,  4.27it/s]batch:  72%|███████▏  | 39/54 [00:31<00:03,  4.26it/s]batch:  74%|███████▍  | 40/54 [00:31<00:03,  4.26it/s]batch:  76%|███████▌  | 41/54 [00:32<00:03,  4.27it/s]batch:  78%|███████▊  | 42/54 [00:32<00:02,  4.24it/s]batch:  80%|███████▉  | 43/54 [00:32<00:02,  4.25it/s]batch:  81%|████████▏ | 44/54 [00:32<00:02,  4.24it/s]batch:  83%|████████▎ | 45/54 [00:33<00:02,  4.24it/s]batch:  85%|████████▌ | 46/54 [00:33<00:01,  4.25it/s]batch:  87%|████████▋ | 47/54 [00:33<00:01,  4.27it/s]batch:  89%|████████▉ | 48/54 [00:33<00:01,  4.27it/s]batch:  91%|█████████ | 49/54 [00:34<00:01,  4.26it/s]batch:  93%|█████████▎| 50/54 [00:34<00:00,  4.26it/s]batch:  94%|█████████▍| 51/54 [00:34<00:00,  4.25it/s]batch:  96%|█████████▋| 52/54 [00:34<00:00,  4.26it/s]batch:  98%|█████████▊| 53/54 [00:34<00:00,  4.85it/s]batch: 100%|██████████| 54/54 [00:34<00:00,  1.55it/s]
  0%|          | 0/3361 [00:00<?, ?it/s]  4%|▎         | 119/3361 [00:00<00:02, 1189.65it/s]  7%|▋         | 238/3361 [00:00<00:02, 1180.46it/s] 11%|█         | 358/3361 [00:00<00:02, 1181.08it/s] 14%|█▍        | 477/3361 [00:00<00:02, 1182.30it/s] 18%|█▊        | 596/3361 [00:00<00:02, 1182.84it/s] 21%|██▏       | 715/3361 [00:00<00:02, 1176.18it/s] 25%|██▍       | 833/3361 [00:00<00:02, 1173.58it/s] 28%|██▊       | 951/3361 [00:00<00:02, 1166.94it/s] 32%|███▏      | 1068/3361 [00:00<00:01, 1162.90it/s] 35%|███▌      | 1185/3361 [00:01<00:01, 1152.18it/s] 39%|███▊      | 1301/3361 [00:01<00:01, 1151.04it/s] 42%|████▏     | 1417/3361 [00:01<00:01, 1145.38it/s] 46%|████▌     | 1532/3361 [00:01<00:01, 1142.83it/s] 49%|████▉     | 1647/3361 [00:01<00:01, 1137.20it/s] 52%|█████▏    | 1761/3361 [00:01<00:01, 1133.25it/s] 56%|█████▌    | 1875/3361 [00:01<00:01, 1133.63it/s] 59%|█████▉    | 1989/3361 [00:01<00:01, 1128.67it/s] 63%|██████▎   | 2102/3361 [00:01<00:01, 1127.16it/s] 66%|██████▌   | 2215/3361 [00:01<00:01, 1121.74it/s] 69%|██████▉   | 2328/3361 [00:02<00:00, 1110.65it/s] 73%|███████▎  | 2440/3361 [00:02<00:00, 1107.31it/s] 76%|███████▌  | 2551/3361 [00:02<00:00, 1104.03it/s] 79%|███████▉  | 2662/3361 [00:02<00:00, 1100.48it/s] 83%|████████▎ | 2773/3361 [00:02<00:00, 1098.13it/s] 86%|████████▌ | 2883/3361 [00:02<00:00, 1093.47it/s] 89%|████████▉ | 2993/3361 [00:02<00:00, 1089.39it/s] 92%|█████████▏| 3102/3361 [00:02<00:00, 1085.79it/s] 96%|█████████▌| 3211/3361 [00:02<00:00, 1078.23it/s] 99%|█████████▉| 3319/3361 [00:02<00:00, 1074.65it/s]100%|██████████| 3361/3361 [00:02<00:00, 1125.52it/s]
  0%|          | 0/3361 [00:00<?, ?it/s]  4%|▎         | 119/3361 [00:00<00:02, 1187.91it/s]  7%|▋         | 238/3361 [00:00<00:02, 1178.65it/s] 11%|█         | 358/3361 [00:00<00:02, 1180.19it/s] 14%|█▍        | 477/3361 [00:00<00:02, 1182.34it/s] 18%|█▊        | 596/3361 [00:00<00:02, 1182.74it/s] 21%|██▏       | 715/3361 [00:00<00:02, 1177.33it/s] 25%|██▍       | 833/3361 [00:00<00:02, 1174.25it/s] 28%|██▊       | 951/3361 [00:00<00:02, 1167.49it/s] 32%|███▏      | 1068/3361 [00:00<00:01, 1162.47it/s] 35%|███▌      | 1185/3361 [00:01<00:01, 1155.01it/s] 39%|███▊      | 1301/3361 [00:01<00:01, 1153.76it/s] 42%|████▏     | 1417/3361 [00:01<00:01, 1147.31it/s] 46%|████▌     | 1532/3361 [00:01<00:01, 1145.04it/s] 49%|████▉     | 1647/3361 [00:01<00:01, 1138.59it/s] 52%|█████▏    | 1761/3361 [00:01<00:01, 1134.42it/s] 56%|█████▌    | 1875/3361 [00:01<00:01, 1134.48it/s] 59%|█████▉    | 1989/3361 [00:01<00:01, 1130.00it/s] 63%|██████▎   | 2103/3361 [00:01<00:01, 1125.03it/s] 66%|██████▌   | 2216/3361 [00:01<00:01, 1122.05it/s] 69%|██████▉   | 2329/3361 [00:02<00:00, 1117.38it/s] 73%|███████▎  | 2441/3361 [00:02<00:00, 1112.51it/s] 76%|███████▌  | 2553/3361 [00:02<00:00, 1106.95it/s] 79%|███████▉  | 2664/3361 [00:02<00:00, 1104.23it/s] 83%|████████▎ | 2775/3361 [00:02<00:00, 1100.93it/s] 86%|████████▌ | 2886/3361 [00:02<00:00, 1091.83it/s] 89%|████████▉ | 2996/3361 [00:02<00:00, 1088.12it/s] 92%|█████████▏| 3105/3361 [00:02<00:00, 1084.43it/s] 96%|█████████▌| 3214/3361 [00:02<00:00, 1077.06it/s] 99%|█████████▉| 3322/3361 [00:02<00:00, 1073.21it/s]100%|██████████| 3361/3361 [00:02<00:00, 1126.24it/s]
2026-05-28 12:03:06.327852: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 12:03:13.345652: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 12:03:13.347037: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-05-28 12:03:13.375786: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 12:03:13.375886: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 12:03:13.380666: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 12:03:13.380775: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 12:03:13.383094: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 12:03:13.384558: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 12:03:13.387869: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 12:03:13.389605: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 12:03:13.391203: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 12:03:13.391650: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 12:03:13.392076: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-05-28 12:03:13.392250: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 12:03:13.392498: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 12:03:13.392532: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 12:03:13.392548: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 12:03:13.392561: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 12:03:13.392574: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 12:03:13.392602: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 12:03:13.392615: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 12:03:13.392628: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 12:03:13.392641: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 12:03:13.392937: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 12:03:13.392963: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 12:03:13.824918: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-05-28 12:03:13.825014: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-05-28 12:03:13.825024: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-05-28 12:03:13.825743: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:61:00.0, compute capability: 8.9)
2026-05-28 12:03:13.866405: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:196] None of the MLIR optimization passes are enabled (registered 0 passes)
2026-05-28 12:03:13.881271: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-05-28 12:03:19.049092: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 12:03:19.540206: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 12:03:19.544069: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 12:03:21.026496: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-05-28 12:03:21.127023: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
2026-05-28 12:03:37.912600: I tensorflow/stream_executor/cuda/cuda_blas.cc:1838] TensorFloat-32 will be used for the matrix multiplication. This will only be logged once.
RuntimeError: module compiled against API version 0xe but this version of numpy is 0xd
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/layers/core.py:1059: UserWarning: bpnet.model.arch is not loaded, but a Lambda layer uses it. It may cause errors.
  , UserWarning)
