1112 Commits (cbc1c6d8cad7ac04be95fa947dd25fb3e0c01d6e)
 

Author SHA1 Message Date
Stefano Sinigardi cbc1c6d8ca improve CMake alternative, add instructions in README 6 years ago
Stefano Sinigardi c88ae98289 add CMakeLists.txt and necessary modules 6 years ago
AlexeyAB 3d9c8530a0 Use Tensor Cores only when (channels % 8 == 0) and (filters % 8 == 0) 6 years ago
AlexeyAB 28106c0fd8 Optimized memory allocation for XNOR on CPU 6 years ago
AlexeyAB 449fcfed75 Fix for GCC on ARM 32/64-bit 6 years ago
AlexeyAB 00e992a600 Compile fix 6 years ago
AlexeyAB 5448e07445 Try to fuse conv_xnor+shortcut -> conv_xnor 6 years ago
AlexeyAB 9e138adf09 more accurate time measurements in Demo 6 years ago
AlexeyAB 7dff7365cb Minor demo fix 6 years ago
AlexeyAB f154d2070a Fixed RNN (LSTM, RNN, CRNN, GRU) for CUDNN_HALF=1 6 years ago
AlexeyAB 9e07605bc5 get_connected_workspace_size() and get_convolutional_workspace_size() 6 years ago
AlexeyAB 6832290eee Fixed set_batch_network(), when workspace larger for smaller batch 6 years ago
AlexeyAB 58de6b2d3d Minor fix for CHECK_CUDA() 6 years ago
AlexeyAB 98103552fb Minor fix 6 years ago
AlexeyAB 6c28da5def Draw top5 accuracy on the Loss-chart for training Classifier 6 years ago
AlexeyAB fc663f6efe Another minor fix 6 years ago
AlexeyAB 9bb7455a0e Minor fix 6 years ago
Alexey c999f53e9d
Merge pull request #2359 from aughey/master 6 years ago
AlexeyAB 7587d47c46 Partial fixed 6 years ago
AlexeyAB 0543278a5b Partial fixed 6 years ago
John Aughey 022ce74fe9 Rewriting darknet_video.py to reuse darknet.py as a lib 6 years ago
Alexey 64b217aa86
Update Readme.md 6 years ago
AlexeyAB c50b0e0c8a Minor Python and C API improvement 6 years ago
Alexey b76f1c0006
Merge pull request #2352 from aughey/master 6 years ago
AlexeyAB 285088adc4 Fixed checking CC for enabling Tensor Cores 6 years ago
AlexeyAB e1bbeb8367 CUDNN_HALF and CC 7.5 by default in darknet.sln 6 years ago
AlexeyAB fa1415e3c2 CUDNN_HALF and CC 7.5 by default in darknet.sln 6 years ago
John Aughey c00d3c92db Making a fast API compatible way to copy image data. 6 years ago
John Aughey 7e9416aa80 Making a pointer version of network_predict for python. 6 years ago
John Aughey 8726d7b0db Optimizing network_predict_image to resize only if necessary. 6 years ago
AlexeyAB edfdf2c20e Fixed bug in Tensor Cores training 6 years ago
AlexeyAB 12b6e93893 CHECK_CUDA is used everywhere 6 years ago
AlexeyAB ce2e0eff00 DEBUG=1 fixed 6 years ago
AlexeyAB d767e8ca38 Minor fixes 6 years ago
AlexeyAB 5446d19576 Checks Compute Capability and forcibly disables Tensor Cores for CC < 7.0 6 years ago
AlexeyAB f7cb538b32 Compile fix 6 years ago
AlexeyAB 584f840b40 CUDA_CHECK definition for debug 6 years ago
AlexeyAB 61156239e0 Minor performance improvement 6 years ago
AlexeyAB dc7e7f035d improve XNOR Tensor Cores GEMM - N 2x unrolled - minor performance improvement 6 years ago
AlexeyAB 41814fc4b3 Minor fixes 6 years ago
AlexeyAB ff0733ed40 Speedup repack_input_kernel_bin() 6 years ago
AlexeyAB 2d747cab2b Minor fixes 6 years ago
AlexeyAB f91d5a5e09 Fixed __shfl() and __ballot() warnings 6 years ago
Alexey e1ec8a8b07
Update Readme.md 6 years ago
AlexeyAB f09a9c3315 XNOR uses Tensor Cores on Turing GPU CC>=7.3 (not Volta) 6 years ago
AlexeyAB e17bd9ba8f Minor fix 6 years ago
AlexeyAB a607784626 Added crnn.train.cfg just for test 6 years ago
AlexeyAB c7309c1fdb Fixed CRNN (RNN based on Convolution) layer 6 years ago
AlexeyAB bd91d0a908 Add try-catch to the http_stream.cpp 6 years ago
AlexeyAB c71354ab2e Added cudaGetLastError() for cudaHostAlloc() to reset last cuda error 6 years ago