Commit Graph

1180 Commits

Author SHA1 Message Date
AlexeyAB ce2e0eff00 DEBUG=1 fixed 2019-02-05 00:36:17 +03:00
AlexeyAB d767e8ca38 Minor fixes 2019-02-04 23:29:06 +03:00
AlexeyAB 5446d19576 Checks Compute Capability and forcibly disables Tensor Cores for CC < 7.0 2019-02-04 23:28:40 +03:00
AlexeyAB f7cb538b32 Compile fix 2019-02-03 00:37:00 +03:00
AlexeyAB 584f840b40 CUDA_CHECK definition for debug 2019-02-03 00:19:04 +03:00
AlexeyAB 61156239e0 Minor performance improvement 2019-02-03 00:18:30 +03:00
AlexeyAB dc7e7f035d improve XNOR Tensor Cores GEMM - N 2x unrolled - minor performance improvement 2019-02-02 17:57:30 +03:00
AlexeyAB 41814fc4b3 Minor fixes 2019-02-02 15:16:57 +03:00
AlexeyAB ff0733ed40 Speedup repack_input_kernel_bin() 2019-02-02 15:16:25 +03:00
AlexeyAB 2d747cab2b Minor fixes 2019-02-02 03:16:30 +03:00
AlexeyAB f91d5a5e09 Fixed __shfl() and __ballot() warnings 2019-02-02 03:16:05 +03:00
Alexey e1ec8a8b07
Update Readme.md 2019-02-02 00:58:09 +03:00
AlexeyAB f09a9c3315 XNOR uses Tensor Cores on Turing GPU CC>=7.3 (not Volta) 2019-02-02 00:24:34 +03:00
AlexeyAB e17bd9ba8f Minor fix 2019-02-01 01:32:26 +03:00
AlexeyAB a607784626 Added crnn.train.cfg just for test 2019-02-01 01:32:03 +03:00
AlexeyAB c7309c1fdb Fixed CRNN (RNN based on Convolution) layer 2019-02-01 01:30:02 +03:00
AlexeyAB bd91d0a908 Add try-catch to the http_stream.cpp 2019-01-31 14:33:05 +03:00
AlexeyAB c71354ab2e Added cudaGetLastError() for cudaHostAlloc() to reset last cuda error 2019-01-31 14:22:07 +03:00
AlexeyAB 381f90ebb8 Fixed CUDA error checking 2019-01-29 13:46:30 +03:00
AlexeyAB 2790464de1 Another compile fix 2019-01-29 00:11:32 +03:00
AlexeyAB ae8a8e6016 Compile fix 2019-01-29 00:05:08 +03:00
AlexeyAB 640bdbc063 LSTM, RNN, GRU - use connected_layer that uses cuDNN. Fixed CRNN for conv-layer with cuDNN. 2019-01-28 23:50:51 +03:00
AlexeyAB 0e1f3eaf35 Fixed DLL/SO 2019-01-28 20:32:30 +03:00
AlexeyAB 3692c174c5 Compile fix 2019-01-28 20:25:14 +03:00
AlexeyAB 110b5240a4 Fixed LSTM-layer 2019-01-28 20:22:14 +03:00
AlexeyAB 85b99872cb Use non-default stream for all CUDA-functions 2019-01-28 20:19:26 +03:00
AlexeyAB 00b87281f3 Fixed RNN (RNN, GRU, LSTM) with cuDNN (batch-norm) 2019-01-27 03:42:44 +03:00
AlexeyAB 9576cd4d89 Fixed memory allocation 2019-01-26 23:25:09 +03:00
AlexeyAB 090d934c0f Minor speedup on CPU 2019-01-26 19:12:46 +03:00
AlexeyAB 630f441e08 Minor CPU speedup - i7 6500K: 1000ms (AVX=1) instead of 1500ms (old AVX=1) and 2000ms (AVX=0) 2019-01-26 02:54:41 +03:00
AlexeyAB 1b15e2f8df Compile fix on Windows 2019-01-24 20:30:15 +03:00
Alexey da044776d1
Merge pull request #2282 from davidssmith/master
add LSTM layer
2019-01-24 20:19:57 +03:00
AlexeyAB a7366a5a0a Compile fix for CC < 7.3 2019-01-24 20:19:01 +03:00
David Smith 96773df469 add lstm_layer.o to Makefile 2019-01-24 09:38:45 -06:00
David Smith 5e778cd91e add LSTM layer 2019-01-23 22:02:09 -06:00
Alexey 29aa716bd9
Update Readme.md 2019-01-23 18:04:31 +03:00
AlexeyAB 2d3220cef5 Look at wmma::bmma_sync(), bmmaBitOpXOR, bmmaAccumulateOpPOPC 2019-01-23 00:35:44 +03:00
Alexey b47db904ee
Merge pull request #2272 from Sauraus/master
gcc on OSX required explicit return value for empty (char *) in detec…
2019-01-22 21:45:38 +03:00
Antek S. Baranski 8960fbfb3f gcc on OSX required explicit return value for empty (char *) in detection_to_json 2019-01-22 10:17:42 -08:00
AlexeyAB 2cd37ec73e Another minor fix 2019-01-22 17:31:07 +03:00
AlexeyAB 0541428f78 Merge branch 'master' of github.com:AlexeyAB/darknet 2019-01-22 16:24:22 +03:00
AlexeyAB 46be08db37 Minor fix 2019-01-22 16:23:44 +03:00
Alexey ec9b989c0a
Update Readme.md 2019-01-21 22:40:56 +03:00
AlexeyAB 81f7fc2c7b Fixed network resize memory allocation 2019-01-21 15:32:57 +03:00
AlexeyAB 226322523e Fixed calc_anchors 2019-01-21 14:54:51 +03:00
Alexey 8b7494d920
Update Readme.md 2019-01-21 02:38:58 +03:00
AlexeyAB 17019854c3 XNOR minor fix 2019-01-19 03:18:50 +03:00
AlexeyAB 6e99e852ff Network resize is fixed 2019-01-18 23:30:10 +03:00
AlexeyAB 0e022d0912 Fixed timer 2019-01-18 21:29:06 +03:00
AlexeyAB 4ed6fd1ada Fix for compilation on Google Colab 2019-01-18 19:54:05 +03:00