Comment
Author: Admin | 2025-04-27
Hi,Thanks for sharing this great work. I'm trying to run the samples on smaller gpu: GTX1060 6Gb.The "Einstein" examples runs fine, but when I run the fox example I get:32--[FullyFusedMLP(neurons=64,layers=3)]-->113:49:05 INFO Color model: 3--[SphericalHarmonics]-->16+16--[FullyFusedMLP(neurons=64,layers=4)]-->313:49:05 INFO total_encoding_params=13074912 total_network_params=972813:49:06 ERROR Uncaught exception: ***\dependencies\tiny-cuda-nn\include\tiny-cuda-nn/gpu_memory.h:531 cuMemSetAccess(m_base_address + m_size, n_bytes_to_allocate, &access_desc, 1) failed with error CUDA_ERROR_OUT_OF_MEMORYCould not free memory: ***\dependencies\tiny-cuda-nn\include\tiny-cuda-nn/gpu_memory.h:452 cuMemAddressFree(m_base_address, m_max_size) failed with error CUDA_ERROR_INVALID_VALUE">13:49:05 SUCCESS Loaded 50 images of size 1080x1920 after 1s13:49:05 INFO cam_aabb=[min=[1.0229,-1.33309,-0.378748], max=[2.46175,1.00721,1.41295]]13:49:05 INFO Loading network config from: configs\nerf\base.json13:49:05 INFO GridEncoding: Nmin=16 b=1.51572 F=2 T=2^19 L=16Warning: FullyFusedMLP is not supported for the selected architecture 61. Falling back to CutlassMLP. For maximum performance, raise the target GPU architecture to 75+.Warning: FullyFusedMLP is not supported for the selected architecture 61. Falling back to CutlassMLP. For maximum performance, raise the target GPU architecture to 75+.13:49:05 INFO Density model: 3--[HashGrid]-->32--[FullyFusedMLP(neurons=64,layers=3)]-->113:49:05 INFO Color model: 3--[SphericalHarmonics]-->16+16--[FullyFusedMLP(neurons=64,layers=4)]-->313:49:05 INFO total_encoding_params=13074912 total_network_params=972813:49:06 ERROR Uncaught exception: ***\dependencies\tiny-cuda-nn\include\tiny-cuda-nn/gpu_memory.h:531 cuMemSetAccess(m_base_address + m_size, n_bytes_to_allocate, &access_desc, 1) failed with error CUDA_ERROR_OUT_OF_MEMORYCould not free memory: ***\dependencies\tiny-cuda-nn\include\tiny-cuda-nn/gpu_memory.h:452 cuMemAddressFree(m_base_address, m_max_size) failed with error CUDA_ERROR_INVALID_VALUEIs it still possible to run this example with some modified parameters for gpu's with lower memory, or should I give up?Small note:atomicAdd(__half2) is also not supported on my architecture (=61). I needed to disable it in "common_device.cuh". You must be logged in to vote When building I get:, N_DIMS, 1> &, T *, T *, const Eigen::Vector2i &, const Eigen::Vector2f &) [with N_DIMS=2U, T=float]" D:\***\src\testbed_nerf.cu(1512): hereD:\***\include\neural-graphics-primitives/common_device.cuh(128): error : no instance of overloaded function "atomicAdd" matches the argument list [D:***\build\ngp.vcxproj] argument types are: (__half2 *, {...}) detected during instantiation of "void ngp::deposit_image_gradient(const Eigen::Matrix, N_DIMS, 1> &, T *, T *, const Eigen::Vector2i &, const Eigen::Vector2f &) [with N_DIMS=2U, T=float]" D:\***\src\testbed_nerf.cu(1512): here">D:***\include\neural-graphics-primitives/common_device.cuh(127): error : no instance of overloaded function "atomicAdd" matches the argument list [D:\***\build\ngp.vcxproj] argument types are: (__half2 *, {...}) detected during instantiation of "void ngp::deposit_image_gradient(const Eigen::Matrix, N_DIMS, 1> &, T *, T *, const Eigen::Vector2i &, const Eigen::Vector2f &) [with N_DIMS=2U, T=float]" D:\***\src\testbed_nerf.cu(1512): hereD:\***\include\neural-graphics-primitives/common_device.cuh(128): error : no instance of overloaded function "atomicAdd" matches the argument list [D:***\build\ngp.vcxproj]… View full answer Hi there,you might be able to further squeeze down the memory usage by reducing the resolution --width 1280 --height 720, but I'm unsure this will be enough.Regarding atomicAdd(__half2): I'm surprised actually. How does this error manifest? I'd like to make this codebase work on as wide a range of GPUs as possible and both the CUDA documentation and CI suggest it should work on compute capability 61. You must be logged in to vote 0 replies When building I get:, N_DIMS, 1> &, T *, T *, const Eigen::Vector2i &, const Eigen::Vector2f &) [with N_DIMS=2U, T=float]" D:\***\src\testbed_nerf.cu(1512): hereD:\***\include\neural-graphics-primitives/common_device.cuh(128): error : no instance of overloaded function "atomicAdd" matches the argument list [D:***\build\ngp.vcxproj] argument types
Add Comment