Lmod Warning:
-------------------------------------------------------------------------------
The following dependent module(s) are not currently loaded: curl/8.4.0
(required by: htslib/1.16)
-------------------------------------------------------------------------------




The following have been reloaded with a version change:
  1) curl/8.4.0 => curl/8.17.0     2) openssl/3.0.7 => openssl/3.6.0

2026-05-28 11:41:15.089453: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:43:33.724922: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 11:43:33.735180: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-05-28 11:43:33.756633: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:4a:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 11:43:33.756683: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:43:34.342531: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:43:34.342617: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:43:34.833603: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 11:43:35.231029: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 11:43:35.784686: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 11:43:35.967370: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 11:43:36.155477: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:43:36.155887: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 11:43:36.156260: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-05-28 11:43:36.156398: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 11:43:36.156607: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:4a:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 11:43:36.156625: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:43:36.156638: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:43:36.156648: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:43:36.156657: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 11:43:36.156666: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 11:43:36.156674: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 11:43:36.156683: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 11:43:36.156705: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:43:36.156978: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 11:43:36.156997: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:43:38.407083: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-05-28 11:43:38.407161: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-05-28 11:43:38.407172: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-05-28 11:43:38.407820: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:4a:00.0, compute capability: 8.9)
2026-05-28 11:43:40.285357: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2)
2026-05-28 11:43:40.285902: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-05-28 11:43:45.232637: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:43:47.269438: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:43:47.273891: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:44:01.696823: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-05-28 11:44:01.783084: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
2026-05-28 11:44:21.964033: I tensorflow/stream_executor/cuda/cuda_blas.cc:1838] TensorFloat-32 will be used for the matrix multiplication. This will only be logged once.
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates', 'jitters', 'index', 'status', 'rev_comp'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
2026-05-28 11:53:50.643365: W tensorflow/python/util/util.cc:348] Sets are not currently considered sequences, but this may change in the future, so consider avoiding using them.
2026-05-28 11:53:54.759900: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:54:03.414313: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 11:54:03.415179: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-05-28 11:54:03.442821: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:4a:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 11:54:03.442871: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:54:03.449226: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:54:03.449310: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:54:03.452120: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 11:54:03.454583: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 11:54:03.458546: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 11:54:03.461166: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 11:54:03.463053: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:54:03.463436: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 11:54:03.463660: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-05-28 11:54:03.463802: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 11:54:03.464050: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:4a:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 11:54:03.464117: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:54:03.464144: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:54:03.464161: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:54:03.464171: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 11:54:03.464181: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 11:54:03.464191: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 11:54:03.464201: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 11:54:03.464212: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:54:03.464528: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 11:54:03.464551: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:54:03.876767: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-05-28 11:54:03.876868: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-05-28 11:54:03.876877: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-05-28 11:54:03.877537: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:4a:00.0, compute capability: 8.9)
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/layers/core.py:1059: UserWarning: bpnet.model.arch is not loaded, but a Lambda layer uses it. It may cause errors.
  , UserWarning)
batch:   0%|          | 0/92 [00:00<?, ?it/s]2026-05-28 11:54:05.481329: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2)
2026-05-28 11:54:05.481792: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-05-28 11:54:05.674070: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:54:06.190320: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:54:06.192647: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:54:07.778040: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-05-28 11:54:07.869310: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates', 'true_profiles', 'true_logcounts', 'rev_comp'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
batch:   1%|          | 1/92 [00:22<33:25, 22.04s/it]batch:   2%|▏         | 2/92 [00:22<13:49,  9.21s/it]batch:   3%|▎         | 3/92 [00:22<07:35,  5.11s/it]batch:   4%|▍         | 4/92 [00:22<04:40,  3.19s/it]batch:   5%|▌         | 5/92 [00:22<03:04,  2.12s/it]batch:   7%|▋         | 6/92 [00:23<02:07,  1.48s/it]batch:   8%|▊         | 7/92 [00:23<01:30,  1.07s/it]batch:   9%|▊         | 8/92 [00:23<01:07,  1.24it/s]batch:  10%|▉         | 9/92 [00:23<00:51,  1.60it/s]batch:  11%|█         | 10/92 [00:24<00:41,  1.99it/s]batch:  12%|█▏        | 11/92 [00:24<00:33,  2.39it/s]batch:  13%|█▎        | 12/92 [00:24<00:28,  2.76it/s]batch:  14%|█▍        | 13/92 [00:24<00:25,  3.10it/s]batch:  15%|█▌        | 14/92 [00:25<00:22,  3.40it/s]batch:  16%|█▋        | 15/92 [00:25<00:21,  3.63it/s]batch:  17%|█▋        | 16/92 [00:25<00:19,  3.81it/s]batch:  18%|█▊        | 17/92 [00:25<00:19,  3.93it/s]batch:  20%|█▉        | 18/92 [00:25<00:18,  4.04it/s]batch:  21%|██        | 19/92 [00:26<00:17,  4.13it/s]batch:  22%|██▏       | 20/92 [00:26<00:17,  4.17it/s]batch:  23%|██▎       | 21/92 [00:26<00:16,  4.22it/s]batch:  24%|██▍       | 22/92 [00:26<00:16,  4.25it/s]batch:  25%|██▌       | 23/92 [00:27<00:16,  4.28it/s]batch:  26%|██▌       | 24/92 [00:27<00:15,  4.28it/s]batch:  27%|██▋       | 25/92 [00:27<00:15,  4.28it/s]batch:  28%|██▊       | 26/92 [00:27<00:15,  4.29it/s]batch:  29%|██▉       | 27/92 [00:28<00:15,  4.30it/s]batch:  30%|███       | 28/92 [00:28<00:14,  4.30it/s]batch:  32%|███▏      | 29/92 [00:28<00:14,  4.30it/s]batch:  33%|███▎      | 30/92 [00:28<00:14,  4.31it/s]batch:  34%|███▎      | 31/92 [00:28<00:14,  4.32it/s]batch:  35%|███▍      | 32/92 [00:29<00:13,  4.30it/s]batch:  36%|███▌      | 33/92 [00:29<00:13,  4.31it/s]batch:  37%|███▋      | 34/92 [00:29<00:13,  4.30it/s]batch:  38%|███▊      | 35/92 [00:29<00:13,  4.31it/s]batch:  39%|███▉      | 36/92 [00:30<00:12,  4.32it/s]batch:  40%|████      | 37/92 [00:30<00:12,  4.30it/s]batch:  41%|████▏     | 38/92 [00:30<00:12,  4.31it/s]batch:  42%|████▏     | 39/92 [00:30<00:12,  4.28it/s]batch:  43%|████▎     | 40/92 [00:31<00:12,  4.29it/s]batch:  45%|████▍     | 41/92 [00:31<00:11,  4.30it/s]batch:  46%|████▌     | 42/92 [00:31<00:11,  4.28it/s]batch:  47%|████▋     | 43/92 [00:31<00:11,  4.30it/s]batch:  48%|████▊     | 44/92 [00:32<00:11,  4.31it/s]batch:  49%|████▉     | 45/92 [00:32<00:10,  4.32it/s]batch:  50%|█████     | 46/92 [00:32<00:10,  4.32it/s]batch:  51%|█████     | 47/92 [00:32<00:10,  4.33it/s]batch:  52%|█████▏    | 48/92 [00:32<00:10,  4.33it/s]batch:  53%|█████▎    | 49/92 [00:33<00:09,  4.33it/s]batch:  54%|█████▍    | 50/92 [00:33<00:09,  4.32it/s]batch:  55%|█████▌    | 51/92 [00:33<00:09,  4.32it/s]batch:  57%|█████▋    | 52/92 [00:33<00:09,  4.33it/s]batch:  58%|█████▊    | 53/92 [00:34<00:08,  4.33it/s]batch:  59%|█████▊    | 54/92 [00:34<00:08,  4.31it/s]batch:  60%|█████▉    | 55/92 [00:34<00:08,  4.32it/s]batch:  61%|██████    | 56/92 [00:34<00:08,  4.32it/s]batch:  62%|██████▏   | 57/92 [00:35<00:08,  4.32it/s]batch:  63%|██████▎   | 58/92 [00:35<00:07,  4.33it/s]batch:  64%|██████▍   | 59/92 [00:35<00:07,  4.29it/s]batch:  65%|██████▌   | 60/92 [00:35<00:07,  4.32it/s]batch:  66%|██████▋   | 61/92 [00:35<00:07,  4.31it/s]batch:  67%|██████▋   | 62/92 [00:36<00:06,  4.32it/s]batch:  68%|██████▊   | 63/92 [00:36<00:06,  4.32it/s]batch:  70%|██████▉   | 64/92 [00:36<00:06,  4.34it/s]batch:  71%|███████   | 65/92 [00:36<00:06,  4.34it/s]batch:  72%|███████▏  | 66/92 [00:37<00:06,  3.77it/s]batch:  73%|███████▎  | 67/92 [00:37<00:06,  3.91it/s]batch:  74%|███████▍  | 68/92 [00:37<00:05,  4.02it/s]batch:  75%|███████▌  | 69/92 [00:37<00:05,  4.11it/s]batch:  76%|███████▌  | 70/92 [00:38<00:05,  4.18it/s]batch:  77%|███████▋  | 71/92 [00:38<00:04,  4.21it/s]batch:  78%|███████▊  | 72/92 [00:38<00:04,  4.25it/s]batch:  79%|███████▉  | 73/92 [00:38<00:04,  4.26it/s]batch:  80%|████████  | 74/92 [00:39<00:04,  4.27it/s]batch:  82%|████████▏ | 75/92 [00:39<00:03,  4.29it/s]batch:  83%|████████▎ | 76/92 [00:39<00:03,  4.28it/s]batch:  84%|████████▎ | 77/92 [00:39<00:03,  4.30it/s]batch:  85%|████████▍ | 78/92 [00:40<00:03,  4.30it/s]batch:  86%|████████▌ | 79/92 [00:40<00:03,  4.30it/s]batch:  87%|████████▋ | 80/92 [00:40<00:02,  4.30it/s]batch:  88%|████████▊ | 81/92 [00:40<00:02,  4.30it/s]batch:  89%|████████▉ | 82/92 [00:40<00:02,  4.30it/s]batch:  90%|█████████ | 83/92 [00:41<00:02,  4.31it/s]batch:  91%|█████████▏| 84/92 [00:41<00:01,  4.31it/s]batch:  92%|█████████▏| 85/92 [00:41<00:01,  4.30it/s]batch:  93%|█████████▎| 86/92 [00:41<00:01,  4.31it/s]batch:  95%|█████████▍| 87/92 [00:42<00:01,  4.33it/s]batch:  96%|█████████▌| 88/92 [00:42<00:00,  4.31it/s]batch:  97%|█████████▋| 89/92 [00:42<00:00,  4.48it/s]batch:  98%|█████████▊| 90/92 [00:42<00:00,  4.45it/s]batch:  99%|█████████▉| 91/92 [00:42<00:00,  4.42it/s]batch: 100%|██████████| 92/92 [00:43<00:00,  2.14it/s]
  0%|          | 0/5814 [00:00<?, ?it/s]  2%|▏         | 120/5814 [00:00<00:04, 1199.22it/s]  4%|▍         | 240/5814 [00:00<00:04, 1189.66it/s]  6%|▌         | 361/5814 [00:00<00:04, 1191.11it/s]  8%|▊         | 481/5814 [00:00<00:04, 1189.33it/s] 10%|█         | 600/5814 [00:00<00:04, 1187.72it/s] 12%|█▏        | 719/5814 [00:00<00:04, 1179.81it/s] 14%|█▍        | 839/5814 [00:00<00:04, 1180.13it/s] 16%|█▋        | 958/5814 [00:00<00:04, 1180.51it/s] 19%|█▊        | 1077/5814 [00:00<00:04, 1171.21it/s] 21%|██        | 1195/5814 [00:01<00:03, 1173.04it/s] 23%|██▎       | 1313/5814 [00:01<00:03, 1171.00it/s] 25%|██▍       | 1431/5814 [00:01<00:03, 1166.26it/s] 27%|██▋       | 1548/5814 [00:01<00:03, 1160.56it/s] 29%|██▊       | 1665/5814 [00:01<00:03, 1158.90it/s] 31%|███       | 1781/5814 [00:01<00:03, 1154.22it/s] 33%|███▎      | 1897/5814 [00:01<00:03, 1150.38it/s] 35%|███▍      | 2013/5814 [00:01<00:03, 1143.03it/s] 37%|███▋      | 2129/5814 [00:01<00:03, 1141.32it/s] 39%|███▊      | 2244/5814 [00:01<00:03, 1136.35it/s] 41%|████      | 2358/5814 [00:02<00:03, 1133.82it/s] 43%|████▎     | 2472/5814 [00:02<00:02, 1128.60it/s] 44%|████▍     | 2585/5814 [00:02<00:02, 1126.57it/s] 46%|████▋     | 2698/5814 [00:02<00:02, 1124.81it/s] 48%|████▊     | 2811/5814 [00:02<00:02, 1117.08it/s] 50%|█████     | 2923/5814 [00:02<00:02, 1114.58it/s] 52%|█████▏    | 3035/5814 [00:02<00:02, 1108.26it/s] 54%|█████▍    | 3147/5814 [00:02<00:02, 1106.48it/s] 56%|█████▌    | 3259/5814 [00:02<00:02, 1106.81it/s] 58%|█████▊    | 3370/5814 [00:02<00:02, 1106.93it/s] 60%|█████▉    | 3481/5814 [00:03<00:02, 1107.05it/s] 62%|██████▏   | 3592/5814 [00:03<00:02, 1101.55it/s] 64%|██████▎   | 3703/5814 [00:03<00:01, 1100.99it/s] 66%|██████▌   | 3814/5814 [00:03<00:01, 1094.43it/s] 67%|██████▋   | 3924/5814 [00:03<00:01, 1091.18it/s] 69%|██████▉   | 4034/5814 [00:03<00:01, 1088.86it/s] 71%|███████▏  | 4143/5814 [00:03<00:01, 1087.11it/s] 73%|███████▎  | 4252/5814 [00:03<00:01, 1087.08it/s] 75%|███████▌  | 4361/5814 [00:03<00:01, 1081.43it/s] 77%|███████▋  | 4470/5814 [00:03<00:01, 1075.04it/s] 79%|███████▊  | 4578/5814 [00:04<00:01, 1076.35it/s] 81%|████████  | 4686/5814 [00:04<00:01, 1076.90it/s] 82%|████████▏ | 4794/5814 [00:04<00:00, 1073.34it/s] 84%|████████▍ | 4902/5814 [00:04<00:00, 1072.77it/s] 86%|████████▌ | 5010/5814 [00:04<00:00, 1064.25it/s] 88%|████████▊ | 5117/5814 [00:04<00:00, 1061.31it/s] 90%|████████▉ | 5224/5814 [00:04<00:00, 1061.07it/s] 92%|█████████▏| 5331/5814 [00:04<00:00, 1053.77it/s] 94%|█████████▎| 5438/5814 [00:04<00:00, 1057.49it/s] 95%|█████████▌| 5544/5814 [00:04<00:00, 1050.47it/s] 97%|█████████▋| 5650/5814 [00:05<00:00, 1046.90it/s] 99%|█████████▉| 5755/5814 [00:05<00:00, 1047.29it/s]100%|██████████| 5814/5814 [00:05<00:00, 1111.24it/s]
  0%|          | 0/5814 [00:00<?, ?it/s]  2%|▏         | 121/5814 [00:00<00:04, 1207.59it/s]  4%|▍         | 242/5814 [00:00<00:04, 1186.03it/s]  6%|▌         | 363/5814 [00:00<00:04, 1189.50it/s]  8%|▊         | 484/5814 [00:00<00:04, 1191.22it/s] 10%|█         | 604/5814 [00:00<00:04, 1191.40it/s] 12%|█▏        | 724/5814 [00:00<00:04, 1185.01it/s] 14%|█▍        | 843/5814 [00:00<00:04, 1181.86it/s] 17%|█▋        | 962/5814 [00:00<00:04, 1173.85it/s] 19%|█▊        | 1080/5814 [00:00<00:04, 1171.10it/s] 21%|██        | 1198/5814 [00:01<00:03, 1168.82it/s] 23%|██▎       | 1317/5814 [00:01<00:03, 1170.80it/s] 25%|██▍       | 1435/5814 [00:01<00:03, 1166.45it/s] 27%|██▋       | 1552/5814 [00:01<00:03, 1166.04it/s] 29%|██▊       | 1669/5814 [00:01<00:03, 1162.17it/s] 31%|███       | 1786/5814 [00:01<00:03, 1152.46it/s] 33%|███▎      | 1902/5814 [00:01<00:03, 1149.93it/s] 35%|███▍      | 2018/5814 [00:01<00:03, 1141.10it/s] 37%|███▋      | 2133/5814 [00:01<00:03, 1143.68it/s] 39%|███▊      | 2248/5814 [00:01<00:03, 1137.83it/s] 41%|████      | 2362/5814 [00:02<00:03, 1131.20it/s] 43%|████▎     | 2476/5814 [00:02<00:02, 1132.51it/s] 45%|████▍     | 2590/5814 [00:02<00:02, 1126.12it/s] 46%|████▋     | 2703/5814 [00:02<00:02, 1124.42it/s] 48%|████▊     | 2816/5814 [00:02<00:02, 1115.45it/s] 50%|█████     | 2928/5814 [00:02<00:02, 1113.03it/s] 52%|█████▏    | 3040/5814 [00:02<00:02, 1112.47it/s] 54%|█████▍    | 3152/5814 [00:02<00:02, 1108.39it/s] 56%|█████▌    | 3263/5814 [00:02<00:02, 1106.36it/s] 58%|█████▊    | 3374/5814 [00:02<00:02, 1106.46it/s] 60%|█████▉    | 3485/5814 [00:03<00:02, 1102.25it/s] 62%|██████▏   | 3597/5814 [00:03<00:02, 1101.18it/s] 64%|██████▍   | 3708/5814 [00:03<00:01, 1100.73it/s] 66%|██████▌   | 3819/5814 [00:03<00:01, 1099.23it/s] 68%|██████▊   | 3929/5814 [00:03<00:01, 1088.97it/s] 69%|██████▉   | 4039/5814 [00:03<00:01, 1087.35it/s] 71%|███████▏  | 4148/5814 [00:03<00:01, 1085.50it/s] 73%|███████▎  | 4257/5814 [00:03<00:01, 1084.30it/s] 75%|███████▌  | 4366/5814 [00:03<00:01, 1079.41it/s] 77%|███████▋  | 4474/5814 [00:03<00:01, 1076.78it/s] 79%|███████▉  | 4583/5814 [00:04<00:01, 1074.56it/s] 81%|████████  | 4691/5814 [00:04<00:01, 1075.25it/s] 83%|████████▎ | 4799/5814 [00:04<00:00, 1071.86it/s] 84%|████████▍ | 4907/5814 [00:04<00:00, 1071.36it/s] 86%|████████▋ | 5015/5814 [00:04<00:00, 1068.34it/s] 88%|████████▊ | 5122/5814 [00:04<00:00, 1064.85it/s] 90%|████████▉ | 5229/5814 [00:04<00:00, 1058.58it/s] 92%|█████████▏| 5335/5814 [00:04<00:00, 1056.46it/s] 94%|█████████▎| 5441/5814 [00:04<00:00, 1047.35it/s] 95%|█████████▌| 5546/5814 [00:04<00:00, 1046.96it/s] 97%|█████████▋| 5651/5814 [00:05<00:00, 1046.41it/s] 99%|█████████▉| 5756/5814 [00:05<00:00, 1045.64it/s]100%|██████████| 5814/5814 [00:05<00:00, 1110.81it/s]
2026-05-28 11:55:27.895608: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:55:44.194395: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 11:55:44.195713: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-05-28 11:55:44.224687: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:4a:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 11:55:44.224747: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:55:44.230169: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:55:44.230259: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:55:44.232715: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 11:55:44.234478: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 11:55:44.238022: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 11:55:44.239936: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 11:55:44.241401: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:55:44.241790: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 11:55:44.242128: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-05-28 11:55:44.242273: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-05-28 11:55:44.242489: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:4a:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-05-28 11:55:44.242517: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:55:44.242532: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:55:44.242545: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:55:44.242558: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-05-28 11:55:44.242570: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-05-28 11:55:44.242583: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-05-28 11:55:44.242595: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-05-28 11:55:44.242608: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:55:44.242890: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-05-28 11:55:44.242916: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-05-28 11:55:44.652245: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-05-28 11:55:44.652350: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-05-28 11:55:44.652359: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-05-28 11:55:44.653043: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:4a:00.0, compute capability: 8.9)
2026-05-28 11:55:44.691989: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:196] None of the MLIR optimization passes are enabled (registered 0 passes)
2026-05-28 11:55:44.706496: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-05-28 11:55:49.225579: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-05-28 11:55:49.704477: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-05-28 11:55:49.708252: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-05-28 11:55:51.158734: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-05-28 11:55:51.246531: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
2026-05-28 11:56:05.984485: I tensorflow/stream_executor/cuda/cuda_blas.cc:1838] TensorFloat-32 will be used for the matrix multiplication. This will only be logged once.
RuntimeError: module compiled against API version 0xe but this version of numpy is 0xd
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/layers/core.py:1059: UserWarning: bpnet.model.arch is not loaded, but a Lambda layer uses it. It may cause errors.
  , UserWarning)
