Lmod Warning:
-------------------------------------------------------------------------------
The following dependent module(s) are not currently loaded: curl/8.4.0
(required by: htslib/1.16)
-------------------------------------------------------------------------------




The following have been reloaded with a version change:
  1) curl/8.4.0 => curl/8.17.0     2) openssl/3.0.7 => openssl/3.6.0

2025-11-17 17:54:08.460868: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2025-11-17 17:54:26.368949: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2025-11-17 17:54:26.371600: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2025-11-17 17:54:26.395454: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2025-11-17 17:54:26.395504: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2025-11-17 17:54:26.946825: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2025-11-17 17:54:26.946896: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2025-11-17 17:54:27.332731: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2025-11-17 17:54:27.882839: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2025-11-17 17:54:28.423873: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2025-11-17 17:54:28.678388: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2025-11-17 17:54:28.821210: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2025-11-17 17:54:28.821606: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2025-11-17 17:54:28.822039: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2025-11-17 17:54:28.823675: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2025-11-17 17:54:28.823933: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2025-11-17 17:54:28.823955: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2025-11-17 17:54:28.823968: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2025-11-17 17:54:28.823977: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2025-11-17 17:54:28.823986: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2025-11-17 17:54:28.823996: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2025-11-17 17:54:28.824005: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2025-11-17 17:54:28.824014: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2025-11-17 17:54:28.824033: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2025-11-17 17:54:28.824324: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2025-11-17 17:54:28.824345: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2025-11-17 17:54:31.233670: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2025-11-17 17:54:31.233754: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2025-11-17 17:54:31.233765: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2025-11-17 17:54:31.234434: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:61:00.0, compute capability: 8.9)
2025-11-17 17:54:43.805886: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2)
2025-11-17 17:54:43.806475: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2025-11-17 17:54:45.011800: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2025-11-17 17:54:46.816511: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2025-11-17 17:54:46.821718: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2025-11-17 17:55:00.338976: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2025-11-17 17:55:00.481119: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
2025-11-17 17:55:24.780495: I tensorflow/stream_executor/cuda/cuda_blas.cc:1838] TensorFloat-32 will be used for the matrix multiplication. This will only be logged once.
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates', 'jitters', 'index', 'status', 'rev_comp'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
2025-11-17 17:58:58.286721: W tensorflow/python/util/util.cc:348] Sets are not currently considered sequences, but this may change in the future, so consider avoiding using them.
2025-11-17 17:59:02.568237: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2025-11-17 17:59:12.240772: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2025-11-17 17:59:12.241683: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2025-11-17 17:59:12.268736: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2025-11-17 17:59:12.268786: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2025-11-17 17:59:12.275474: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2025-11-17 17:59:12.275561: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2025-11-17 17:59:12.278434: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2025-11-17 17:59:12.280837: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2025-11-17 17:59:12.285441: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2025-11-17 17:59:12.288614: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2025-11-17 17:59:12.290828: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2025-11-17 17:59:12.291214: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2025-11-17 17:59:12.291448: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2025-11-17 17:59:12.292934: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2025-11-17 17:59:12.293193: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2025-11-17 17:59:12.293216: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2025-11-17 17:59:12.293230: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2025-11-17 17:59:12.293240: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2025-11-17 17:59:12.293250: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2025-11-17 17:59:12.293260: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2025-11-17 17:59:12.293270: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2025-11-17 17:59:12.293279: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2025-11-17 17:59:12.293290: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2025-11-17 17:59:12.293573: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2025-11-17 17:59:12.293594: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2025-11-17 17:59:12.725441: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2025-11-17 17:59:12.725531: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2025-11-17 17:59:12.725540: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2025-11-17 17:59:12.726217: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:61:00.0, compute capability: 8.9)
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/layers/core.py:1059: UserWarning: bpnet.model.arch is not loaded, but a Lambda layer uses it. It may cause errors.
  , UserWarning)
batch:   0%|          | 0/36 [00:00<?, ?it/s]2025-11-17 17:59:14.318733: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2)
2025-11-17 17:59:14.319326: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2025-11-17 17:59:14.530536: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2025-11-17 17:59:15.039662: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2025-11-17 17:59:15.041901: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2025-11-17 17:59:16.670617: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2025-11-17 17:59:16.802853: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py:595: UserWarning: Input dict contained keys ['coordinates', 'true_profiles', 'true_logcounts', 'rev_comp'] which did not match any model input. They will be ignored by the model.
  [n for n in tensors.keys() if n not in ref_input_names])
batch:   3%|▎         | 1/36 [00:29<17:05, 29.30s/it]batch:   6%|▌         | 2/36 [00:29<06:55, 12.21s/it]batch:   8%|▊         | 3/36 [00:29<03:42,  6.74s/it]batch:  11%|█         | 4/36 [00:30<02:13,  4.17s/it]batch:  14%|█▍        | 5/36 [00:30<01:25,  2.76s/it]batch:  17%|█▋        | 6/36 [00:30<00:56,  1.90s/it]batch:  19%|█▉        | 7/36 [00:30<00:39,  1.35s/it]batch:  22%|██▏       | 8/36 [00:30<00:27,  1.00it/s]batch:  25%|██▌       | 9/36 [00:31<00:20,  1.31it/s]batch:  28%|██▊       | 10/36 [00:31<00:15,  1.67it/s]batch:  31%|███       | 11/36 [00:31<00:12,  2.05it/s]batch:  33%|███▎      | 12/36 [00:31<00:09,  2.43it/s]batch:  36%|███▌      | 13/36 [00:32<00:08,  2.79it/s]batch:  39%|███▉      | 14/36 [00:32<00:07,  3.13it/s]batch:  42%|████▏     | 15/36 [00:32<00:06,  3.39it/s]batch:  44%|████▍     | 16/36 [00:32<00:05,  3.61it/s]batch:  47%|████▋     | 17/36 [00:33<00:05,  3.77it/s]batch:  50%|█████     | 18/36 [00:33<00:04,  3.93it/s]batch:  53%|█████▎    | 19/36 [00:33<00:04,  4.01it/s]batch:  56%|█████▌    | 20/36 [00:33<00:03,  4.07it/s]batch:  58%|█████▊    | 21/36 [00:34<00:03,  4.12it/s]batch:  61%|██████    | 22/36 [00:34<00:03,  4.17it/s]batch:  64%|██████▍   | 23/36 [00:34<00:03,  4.21it/s]batch:  67%|██████▋   | 24/36 [00:34<00:02,  4.23it/s]batch:  69%|██████▉   | 25/36 [00:34<00:02,  4.25it/s]batch:  72%|███████▏  | 26/36 [00:35<00:02,  4.25it/s]batch:  75%|███████▌  | 27/36 [00:35<00:02,  4.26it/s]batch:  78%|███████▊  | 28/36 [00:35<00:01,  4.28it/s]batch:  81%|████████  | 29/36 [00:35<00:01,  4.26it/s]batch:  83%|████████▎ | 30/36 [00:36<00:01,  4.25it/s]batch:  86%|████████▌ | 31/36 [00:36<00:01,  4.26it/s]batch:  89%|████████▉ | 32/36 [00:36<00:00,  4.24it/s]batch:  92%|█████████▏| 33/36 [00:36<00:00,  4.26it/s]batch:  94%|█████████▍| 34/36 [00:37<00:00,  4.25it/s]batch:  97%|█████████▋| 35/36 [00:37<00:00,  4.27it/s]batch: 100%|██████████| 36/36 [00:37<00:00,  1.04s/it]
  0%|          | 0/2255 [00:00<?, ?it/s]  5%|▌         | 119/2255 [00:00<00:01, 1188.65it/s] 11%|█         | 238/2255 [00:00<00:01, 1186.40it/s] 16%|█▌        | 357/2255 [00:00<00:01, 1175.87it/s] 21%|██        | 475/2255 [00:00<00:01, 1174.90it/s] 26%|██▋       | 593/2255 [00:00<00:01, 1161.27it/s] 31%|███▏      | 710/2255 [00:00<00:01, 1158.02it/s] 37%|███▋      | 826/2255 [00:00<00:01, 1146.36it/s] 42%|████▏     | 941/2255 [00:00<00:01, 1144.76it/s] 47%|████▋     | 1056/2255 [00:00<00:01, 1135.39it/s] 52%|█████▏    | 1170/2255 [00:01<00:00, 1129.01it/s] 57%|█████▋    | 1283/2255 [00:01<00:00, 1121.91it/s] 62%|██████▏   | 1396/2255 [00:01<00:00, 1116.52it/s] 67%|██████▋   | 1508/2255 [00:01<00:00, 1109.84it/s] 72%|███████▏  | 1619/2255 [00:01<00:00, 1099.98it/s] 77%|███████▋  | 1730/2255 [00:01<00:00, 1093.19it/s] 82%|████████▏ | 1840/2255 [00:01<00:00, 1085.59it/s] 86%|████████▋ | 1949/2255 [00:01<00:00, 1073.96it/s] 91%|█████████ | 2057/2255 [00:01<00:00, 1069.89it/s] 96%|█████████▌| 2164/2255 [00:01<00:00, 1062.38it/s]100%|██████████| 2255/2255 [00:02<00:00, 1110.41it/s]
  0%|          | 0/2255 [00:00<?, ?it/s]  5%|▌         | 119/2255 [00:00<00:01, 1187.71it/s] 11%|█         | 238/2255 [00:00<00:01, 1185.96it/s] 16%|█▌        | 357/2255 [00:00<00:01, 1175.55it/s] 21%|██        | 475/2255 [00:00<00:01, 1175.48it/s] 26%|██▋       | 593/2255 [00:00<00:01, 1161.56it/s] 31%|███▏      | 710/2255 [00:00<00:01, 1158.12it/s] 37%|███▋      | 826/2255 [00:00<00:01, 1151.64it/s] 42%|████▏     | 942/2255 [00:00<00:01, 1144.22it/s] 47%|████▋     | 1057/2255 [00:00<00:01, 1114.67it/s] 52%|█████▏    | 1169/2255 [00:01<00:00, 1111.19it/s] 57%|█████▋    | 1281/2255 [00:01<00:00, 1111.71it/s] 62%|██████▏   | 1393/2255 [00:01<00:00, 1108.00it/s] 67%|██████▋   | 1504/2255 [00:01<00:00, 1098.96it/s] 72%|███████▏  | 1614/2255 [00:01<00:00, 1094.86it/s] 76%|███████▋  | 1724/2255 [00:01<00:00, 1088.00it/s] 81%|████████▏ | 1833/2255 [00:01<00:00, 1083.12it/s] 86%|████████▌ | 1942/2255 [00:01<00:00, 1072.92it/s] 91%|█████████ | 2050/2255 [00:01<00:00, 1064.00it/s] 96%|█████████▌| 2157/2255 [00:01<00:00, 1057.08it/s]100%|██████████| 2255/2255 [00:02<00:00, 1105.59it/s]
2025-11-17 18:00:09.318949: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2025-11-17 18:00:18.071466: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2025-11-17 18:00:18.072659: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1
2025-11-17 18:00:18.101571: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2025-11-17 18:00:18.101654: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2025-11-17 18:00:18.106876: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2025-11-17 18:00:18.106984: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2025-11-17 18:00:18.109372: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2025-11-17 18:00:18.111135: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2025-11-17 18:00:18.115022: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2025-11-17 18:00:18.117148: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2025-11-17 18:00:18.118646: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2025-11-17 18:00:18.119064: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2025-11-17 18:00:18.119747: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2025-11-17 18:00:18.121348: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set
2025-11-17 18:00:18.121673: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties: 
pciBusID: 0000:61:00.0 name: NVIDIA L40S computeCapability: 8.9
coreClock: 2.52GHz coreCount: 142 deviceMemorySize: 44.52GiB deviceMemoryBandwidth: 804.75GiB/s
2025-11-17 18:00:18.121710: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2025-11-17 18:00:18.121727: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2025-11-17 18:00:18.121741: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2025-11-17 18:00:18.121754: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10
2025-11-17 18:00:18.121767: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10
2025-11-17 18:00:18.121780: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10
2025-11-17 18:00:18.121793: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.11
2025-11-17 18:00:18.121806: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2025-11-17 18:00:18.122103: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0
2025-11-17 18:00:18.122135: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
2025-11-17 18:00:18.572668: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix:
2025-11-17 18:00:18.572756: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267]      0 
2025-11-17 18:00:18.572767: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0:   N 
2025-11-17 18:00:18.573524: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 42313 MB memory) -> physical GPU (device: 0, name: NVIDIA L40S, pci bus id: 0000:61:00.0, compute capability: 8.9)
2025-11-17 18:00:18.617092: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:196] None of the MLIR optimization passes are enabled (registered 0 passes)
2025-11-17 18:00:18.631969: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 2500000000 Hz
2025-11-17 18:00:21.113967: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2025-11-17 18:00:21.636170: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
2025-11-17 18:00:21.640458: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.8
2025-11-17 18:00:23.202975: W tensorflow/stream_executor/gpu/asm_compiler.cc:63] Running ptxas --version returned 256
2025-11-17 18:00:23.322567: W tensorflow/stream_executor/gpu/redzone_allocator.cc:314] Internal: ptxas exited with non-zero error code 256, output: 
Relying on driver to perform ptx compilation. 
Modify $PATH to customize ptxas location.
This message will be only logged once.
2025-11-17 18:00:43.457896: I tensorflow/stream_executor/cuda/cuda_blas.cc:1838] TensorFloat-32 will be used for the matrix multiplication. This will only be logged once.
RuntimeError: module compiled against API version 0xe but this version of numpy is 0xd
/home/users/shouvikm/miniconda3/envs/bpnet/lib/python3.7/site-packages/tensorflow/python/keras/layers/core.py:1059: UserWarning: bpnet.model.arch is not loaded, but a Lambda layer uses it. It may cause errors.
  , UserWarning)
