Lmod Warning:
-------------------------------------------------------------------------------
The following dependent module(s) are not currently loaded: curl/8.4.0
(required by: htslib/1.16)
-------------------------------------------------------------------------------




The following have been reloaded with a version change:
  1) curl/8.4.0 => curl/8.17.0     2) openssl/3.0.7 => openssl/3.6.0

2026-01-05 00:02:45.668229: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:05:04.361552: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-01-05 00:05:04.375451: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-01-05 00:05:04.467978: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-01-05 00:05:04.468049: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:05:05.452845: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:05:05.452922: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:05:05.993370: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-01-05 00:05:06.714521: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-01-05 00:05:07.491143: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-01-05 00:05:07.896110: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-01-05 00:05:08.017343: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:05:08.017754: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-01-05 00:05:08.018173: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-01-05 00:05:08.019822: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-01-05 00:05:08.020157: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-01-05 00:05:08.020184: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:05:08.020201: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:05:08.020211: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:05:08.020221: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-01-05 00:05:08.020230: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-01-05 00:05:08.020239: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-01-05 00:05:08.020249: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-01-05 00:05:08.020269: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:05:08.020557: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-01-05 00:05:08.020578: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:05:10.934128: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-01-05 00:05:10.934215: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-01-05 00:05:10.934227: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-01-05 00:05:10.934891: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:61:00.0, compute capability: 8.9)
2026-01-05 00:05:24.061665: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2)
2026-01-05 00:05:24.062280: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-01-05 00:05:25.176581: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:05:26.900104: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:05:26.905591: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:05:41.302848: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-01-05 00:05:41.382074: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
2026-01-05 00:06:04.787028: I tensorflow/stream_executor/cuda/cuda_blas.cc:1838] TensorFloat-32 will be used for the matrix multiplication. This will only be logged once.
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates', 'jitters', 'index', 'status', 'rev_comp'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
2026-01-05 00:15:56.352610: W tensorflow/python/util/util.cc:348] Sets are not currently considered sequences, but this may change in the future, so consider avoiding using them.
2026-01-05 00:15:59.791944: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:16:05.553269: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-01-05 00:16:05.554086: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-01-05 00:16:05.579795: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-01-05 00:16:05.579843: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:16:05.583641: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:16:05.583725: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:16:05.585598: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-01-05 00:16:05.586780: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-01-05 00:16:05.589724: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-01-05 00:16:05.591231: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-01-05 00:16:05.592291: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:16:05.592667: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-01-05 00:16:05.592890: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-01-05 00:16:05.594357: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-01-05 00:16:05.594614: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-01-05 00:16:05.594636: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:16:05.594651: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:16:05.594661: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:16:05.594671: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-01-05 00:16:05.594686: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-01-05 00:16:05.594696: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-01-05 00:16:05.594705: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-01-05 00:16:05.594715: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:16:05.594997: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-01-05 00:16:05.595018: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:16:06.003087: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-01-05 00:16:06.003166: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-01-05 00:16:06.003176: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-01-05 00:16:06.003843: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:61:00.0, compute capability: 8.9)
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/layers/core.py:1059: UserWarning: bpnet.model.arch is not loaded, but a Lambda layer uses it. It may cause errors.
  , UserWarning)
batch:   0%|          | 0/78 [00:00<?, ?it/s]2026-01-05 00:16:07.521511: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2)
2026-01-05 00:16:07.522032: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-01-05 00:16:07.715526: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:16:08.204704: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:16:08.207277: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:16:09.699621: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-01-05 00:16:09.775922: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates', 'true_profiles', 'true_logcounts', 'rev_comp'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
batch:   1%|▏         | 1/78 [00:36<46:47, 36.46s/it]batch:   3%|▎         | 2/78 [00:36<19:12, 15.16s/it]batch:   4%|▍         | 3/78 [00:36<10:26,  8.35s/it]batch:   5%|▌         | 4/78 [00:37<06:20,  5.14s/it]batch:   6%|▋         | 5/78 [00:37<04:06,  3.38s/it]batch:   8%|▊         | 6/78 [00:37<02:46,  2.31s/it]batch:   9%|▉         | 7/78 [00:37<01:55,  1.63s/it]batch:  10%|█         | 8/78 [00:38<01:23,  1.19s/it]batch:  12%|█▏        | 9/78 [00:38<01:01,  1.12it/s]batch:  13%|█▎        | 10/78 [00:38<00:46,  1.45it/s]batch:  14%|█▍        | 11/78 [00:38<00:36,  1.82it/s]batch:  15%|█▌        | 12/78 [00:39<00:29,  2.20it/s]batch:  17%|█▋        | 13/78 [00:39<00:25,  2.59it/s]batch:  18%|█▊        | 14/78 [00:39<00:21,  2.94it/s]batch:  19%|█▉        | 15/78 [00:39<00:19,  3.25it/s]batch:  21%|██        | 16/78 [00:40<00:17,  3.50it/s]batch:  22%|██▏       | 17/78 [00:40<00:16,  3.68it/s]batch:  23%|██▎       | 18/78 [00:40<00:15,  3.83it/s]batch:  24%|██▍       | 19/78 [00:40<00:14,  3.94it/s]batch:  26%|██▌       | 20/78 [00:40<00:14,  4.01it/s]batch:  27%|██▋       | 21/78 [00:41<00:13,  4.07it/s]batch:  28%|██▊       | 22/78 [00:41<00:13,  4.11it/s]batch:  29%|██▉       | 23/78 [00:41<00:13,  4.14it/s]batch:  31%|███       | 24/78 [00:41<00:12,  4.17it/s]batch:  32%|███▏      | 25/78 [00:42<00:12,  4.18it/s]batch:  33%|███▎      | 26/78 [00:42<00:12,  4.20it/s]batch:  35%|███▍      | 27/78 [00:42<00:12,  4.18it/s]batch:  36%|███▌      | 28/78 [00:42<00:11,  4.18it/s]batch:  37%|███▋      | 29/78 [00:43<00:11,  4.18it/s]batch:  38%|███▊      | 30/78 [00:43<00:11,  4.18it/s]batch:  40%|███▉      | 31/78 [00:43<00:11,  4.20it/s]batch:  41%|████      | 32/78 [00:43<00:10,  4.18it/s]batch:  42%|████▏     | 33/78 [00:44<00:10,  4.20it/s]batch:  44%|████▎     | 34/78 [00:44<00:10,  4.19it/s]batch:  45%|████▍     | 35/78 [00:44<00:10,  4.19it/s]batch:  46%|████▌     | 36/78 [00:44<00:10,  4.20it/s]batch:  47%|████▋     | 37/78 [00:45<00:09,  4.19it/s]batch:  49%|████▊     | 38/78 [00:45<00:09,  4.20it/s]batch:  50%|█████     | 39/78 [00:45<00:09,  4.18it/s]batch:  51%|█████▏    | 40/78 [00:45<00:09,  4.20it/s]batch:  53%|█████▎    | 41/78 [00:45<00:08,  4.25it/s]batch:  54%|█████▍    | 42/78 [00:46<00:08,  4.25it/s]batch:  55%|█████▌    | 43/78 [00:46<00:08,  4.26it/s]batch:  56%|█████▋    | 44/78 [00:46<00:07,  4.25it/s]batch:  58%|█████▊    | 45/78 [00:46<00:07,  4.25it/s]batch:  59%|█████▉    | 46/78 [00:47<00:07,  4.26it/s]batch:  60%|██████    | 47/78 [00:47<00:07,  4.28it/s]batch:  62%|██████▏   | 48/78 [00:47<00:06,  4.29it/s]batch:  63%|██████▎   | 49/78 [00:47<00:06,  4.28it/s]batch:  64%|██████▍   | 50/78 [00:48<00:06,  4.29it/s]batch:  65%|██████▌   | 51/78 [00:48<00:06,  4.31it/s]batch:  67%|██████▋   | 52/78 [00:48<00:06,  4.32it/s]batch:  68%|██████▊   | 53/78 [00:48<00:05,  4.33it/s]batch:  69%|██████▉   | 54/78 [00:48<00:05,  4.29it/s]batch:  71%|███████   | 55/78 [00:49<00:05,  4.26it/s]batch:  72%|███████▏  | 56/78 [00:49<00:05,  4.26it/s]batch:  73%|███████▎  | 57/78 [00:49<00:04,  4.25it/s]batch:  74%|███████▍  | 58/78 [00:49<00:04,  4.24it/s]batch:  76%|███████▌  | 59/78 [00:50<00:04,  4.23it/s]batch:  77%|███████▋  | 60/78 [00:50<00:04,  4.22it/s]batch:  78%|███████▊  | 61/78 [00:50<00:04,  4.21it/s]batch:  79%|███████▉  | 62/78 [00:50<00:03,  4.25it/s]batch:  81%|████████  | 63/78 [00:51<00:03,  4.24it/s]batch:  82%|████████▏ | 64/78 [00:51<00:03,  4.26it/s]batch:  83%|████████▎ | 65/78 [00:51<00:03,  4.26it/s]batch:  85%|████████▍ | 66/78 [00:51<00:03,  3.67it/s]batch:  86%|████████▌ | 67/78 [00:52<00:02,  3.71it/s]batch:  87%|████████▋ | 68/78 [00:52<00:02,  3.84it/s]batch:  88%|████████▊ | 69/78 [00:52<00:02,  3.97it/s]batch:  90%|████████▉ | 70/78 [00:52<00:01,  4.05it/s]batch:  91%|█████████ | 71/78 [00:53<00:01,  4.11it/s]batch:  92%|█████████▏| 72/78 [00:53<00:01,  4.17it/s]batch:  94%|█████████▎| 73/78 [00:53<00:01,  4.19it/s]batch:  95%|█████████▍| 74/78 [00:53<00:00,  4.20it/s]batch:  96%|█████████▌| 75/78 [00:54<00:00,  4.24it/s]batch:  97%|█████████▋| 76/78 [00:54<00:00,  4.24it/s]batch:  99%|█████████▊| 77/78 [00:54<00:00,  4.26it/s]batch: 100%|██████████| 78/78 [00:54<00:00,  1.43it/s]
  0%|          | 0/4945 [00:00<?, ?it/s]  2%|▏         | 122/4945 [00:00<00:04, 1204.28it/s]  5%|▍         | 244/4945 [00:00<00:03, 1207.81it/s]  7%|▋         | 365/4945 [00:00<00:03, 1204.04it/s] 10%|▉         | 486/4945 [00:00<00:03, 1203.16it/s] 12%|█▏        | 607/4945 [00:00<00:03, 1196.16it/s] 15%|█▍        | 727/4945 [00:00<00:04, 1007.31it/s] 17%|█▋        | 846/4945 [00:00<00:03, 1059.77it/s] 20%|█▉        | 965/4945 [00:00<00:03, 1097.65it/s] 22%|██▏       | 1082/4945 [00:00<00:03, 1117.16it/s] 24%|██▍       | 1201/4945 [00:01<00:03, 1134.67it/s] 27%|██▋       | 1319/4945 [00:01<00:03, 1142.99it/s] 29%|██▉       | 1437/4945 [00:01<00:03, 1148.11it/s] 31%|███▏      | 1554/4945 [00:01<00:02, 1149.95it/s] 34%|███▍      | 1671/4945 [00:01<00:02, 1152.84it/s] 36%|███▌      | 1787/4945 [00:01<00:02, 1150.36it/s] 38%|███▊      | 1903/4945 [00:01<00:02, 1144.38it/s] 41%|████      | 2018/4945 [00:01<00:02, 1136.39it/s] 43%|████▎     | 2135/4945 [00:01<00:02, 1140.84it/s] 46%|████▌     | 2250/4945 [00:01<00:02, 1139.77it/s] 48%|████▊     | 2365/4945 [00:02<00:02, 1142.27it/s] 50%|█████     | 2480/4945 [00:02<00:02, 1135.83it/s] 52%|█████▏    | 2594/4945 [00:02<00:02, 1134.66it/s] 55%|█████▍    | 2708/4945 [00:02<00:01, 1131.96it/s] 57%|█████▋    | 2822/4945 [00:02<00:01, 1130.98it/s] 59%|█████▉    | 2936/4945 [00:02<00:01, 1125.51it/s] 62%|██████▏   | 3050/4945 [00:02<00:01, 1123.53it/s] 64%|██████▍   | 3163/4945 [00:02<00:01, 1123.94it/s] 66%|██████▌   | 3276/4945 [00:02<00:01, 1112.44it/s] 69%|██████▊   | 3388/4945 [00:02<00:01, 1105.54it/s] 71%|███████   | 3499/4945 [00:03<00:01, 1100.26it/s] 73%|███████▎  | 3610/4945 [00:03<00:01, 1093.61it/s] 75%|███████▌  | 3720/4945 [00:03<00:01, 1093.03it/s] 77%|███████▋  | 3830/4945 [00:03<00:01, 1089.64it/s] 80%|███████▉  | 3939/4945 [00:03<00:00, 1084.83it/s] 82%|████████▏ | 4048/4945 [00:03<00:00, 1078.93it/s] 84%|████████▍ | 4156/4945 [00:03<00:00, 1078.67it/s] 86%|████████▌ | 4264/4945 [00:03<00:00, 1075.96it/s] 88%|████████▊ | 4372/4945 [00:03<00:00, 1075.46it/s] 91%|█████████ | 4480/4945 [00:04<00:00, 1072.91it/s] 93%|█████████▎| 4588/4945 [00:04<00:00, 1069.79it/s] 95%|█████████▍| 4695/4945 [00:04<00:00, 1067.97it/s] 97%|█████████▋| 4802/4945 [00:04<00:00, 1061.88it/s] 99%|█████████▉| 4909/4945 [00:04<00:00, 1058.61it/s]100%|██████████| 4945/4945 [00:04<00:00, 1110.90it/s]
  0%|          | 0/4945 [00:00<?, ?it/s]  2%|▏         | 122/4945 [00:00<00:04, 1203.64it/s]  5%|▍         | 244/4945 [00:00<00:03, 1207.41it/s]  7%|▋         | 365/4945 [00:00<00:03, 1200.75it/s] 10%|▉         | 486/4945 [00:00<00:03, 1197.17it/s] 12%|█▏        | 607/4945 [00:00<00:03, 1195.70it/s] 15%|█▍        | 727/4945 [00:00<00:03, 1195.51it/s] 17%|█▋        | 847/4945 [00:00<00:03, 1195.56it/s] 20%|█▉        | 967/4945 [00:00<00:03, 1189.63it/s] 22%|██▏       | 1086/4945 [00:00<00:03, 1185.36it/s] 24%|██▍       | 1205/4945 [00:01<00:03, 1182.90it/s] 27%|██▋       | 1324/4945 [00:01<00:03, 1178.78it/s] 29%|██▉       | 1442/4945 [00:01<00:02, 1172.56it/s] 32%|███▏      | 1560/4945 [00:01<00:02, 1163.24it/s] 34%|███▍      | 1677/4945 [00:01<00:02, 1161.91it/s] 36%|███▋      | 1794/4945 [00:01<00:02, 1149.36it/s] 39%|███▊      | 1909/4945 [00:01<00:02, 1147.13it/s] 41%|████      | 2024/4945 [00:01<00:02, 1142.64it/s] 43%|████▎     | 2141/4945 [00:01<00:02, 1144.60it/s] 46%|████▌     | 2256/4945 [00:01<00:02, 1142.35it/s] 48%|████▊     | 2371/4945 [00:02<00:02, 1143.10it/s] 50%|█████     | 2486/4945 [00:02<00:02, 1134.69it/s] 53%|█████▎    | 2600/4945 [00:02<00:02, 1133.85it/s] 55%|█████▍    | 2714/4945 [00:02<00:01, 1130.53it/s] 57%|█████▋    | 2828/4945 [00:02<00:01, 1130.78it/s] 59%|█████▉    | 2942/4945 [00:02<00:01, 1124.44it/s] 62%|██████▏   | 3055/4945 [00:02<00:01, 1122.17it/s] 64%|██████▍   | 3168/4945 [00:02<00:01, 1117.03it/s] 66%|██████▋   | 3280/4945 [00:02<00:01, 1109.09it/s] 69%|██████▊   | 3391/4945 [00:02<00:01, 1106.26it/s] 71%|███████   | 3502/4945 [00:03<00:01, 1101.38it/s] 73%|███████▎  | 3613/4945 [00:03<00:01, 1095.11it/s] 75%|███████▌  | 3723/4945 [00:03<00:01, 1093.03it/s] 78%|███████▊  | 3833/4945 [00:03<00:01, 1090.14it/s] 80%|███████▉  | 3943/4945 [00:03<00:00, 1081.94it/s] 82%|████████▏ | 4052/4945 [00:03<00:00, 1082.61it/s] 84%|████████▍ | 4161/4945 [00:03<00:00, 1077.89it/s] 86%|████████▋ | 4269/4945 [00:03<00:00, 1075.17it/s] 89%|████████▊ | 4377/4945 [00:03<00:00, 1072.95it/s] 91%|█████████ | 4485/4945 [00:03<00:00, 1071.19it/s] 93%|█████████▎| 4593/4945 [00:04<00:00, 1072.31it/s] 95%|█████████▌| 4701/4945 [00:04<00:00, 1071.64it/s] 97%|█████████▋| 4809/4945 [00:04<00:00, 1066.91it/s] 99%|█████████▉| 4916/4945 [00:04<00:00, 1061.82it/s]100%|██████████| 4945/4945 [00:04<00:00, 1124.68it/s]
2026-01-05 00:17:24.505153: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:17:30.780460: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-01-05 00:17:30.781607: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2026-01-05 00:17:30.808996: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-01-05 00:17:30.809080: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:17:30.813131: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:17:30.813241: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:17:30.815174: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-01-05 00:17:30.816370: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-01-05 00:17:30.819438: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-01-05 00:17:30.820990: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-01-05 00:17:30.822053: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:17:30.822460: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-01-05 00:17:30.823107: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2026-01-05 00:17:30.824759: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2026-01-05 00:17:30.825066: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2026-01-05 00:17:30.825103: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:17:30.825120: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:17:30.825133: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:17:30.825147: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2026-01-05 00:17:30.825159: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2026-01-05 00:17:30.825172: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2026-01-05 00:17:30.825184: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2026-01-05 00:17:30.825197: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:17:30.825491: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2026-01-05 00:17:30.825531: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2026-01-05 00:17:31.235653: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2026-01-05 00:17:31.235741: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2026-01-05 00:17:31.235752: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2026-01-05 00:17:31.236495: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:61:00.0, compute capability: 8.9)
2026-01-05 00:17:31.278732: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:196] None of the MLIR optimization passes are enabled (registered 0 passes)
2026-01-05 00:17:31.293130: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2026-01-05 00:17:35.340284: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2026-01-05 00:17:35.846010: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2026-01-05 00:17:35.850704: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2026-01-05 00:17:37.277046: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2026-01-05 00:17:37.350959: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
2026-01-05 00:17:51.308696: I tensorflow/stream_executor/cuda/cuda_blas.cc:1838] TensorFloat-32 will be used for the matrix multiplication. This will only be logged once.
RuntimeError: module compiled against API version 0xe but this version of numpy is 0xd
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/layers/core.py:1059: UserWarning: bpnet.model.arch is not loaded, but a Lambda layer uses it. It may cause errors.
  , UserWarning)
