AlexeyAB
|
ce2e0eff00
|
DEBUG=1 fixed
|
2019-02-05 00:36:17 +03:00 |
AlexeyAB
|
d767e8ca38
|
Minor fixes
|
2019-02-04 23:29:06 +03:00 |
AlexeyAB
|
5446d19576
|
Checks Compute Capability and forcibly disables Tensor Cores for CC < 7.0
|
2019-02-04 23:28:40 +03:00 |
AlexeyAB
|
f7cb538b32
|
Compile fix
|
2019-02-03 00:37:00 +03:00 |
AlexeyAB
|
584f840b40
|
CUDA_CHECK definition for debug
|
2019-02-03 00:19:04 +03:00 |
AlexeyAB
|
61156239e0
|
Minor performance improvement
|
2019-02-03 00:18:30 +03:00 |
AlexeyAB
|
dc7e7f035d
|
improve XNOR Tensor Cores GEMM - N 2x unrolled - minor performance improvement
|
2019-02-02 17:57:30 +03:00 |
AlexeyAB
|
41814fc4b3
|
Minor fixes
|
2019-02-02 15:16:57 +03:00 |
AlexeyAB
|
ff0733ed40
|
Speedup repack_input_kernel_bin()
|
2019-02-02 15:16:25 +03:00 |
AlexeyAB
|
2d747cab2b
|
Minor fixes
|
2019-02-02 03:16:30 +03:00 |
AlexeyAB
|
f91d5a5e09
|
Fixed __shfl() and __ballot() warnings
|
2019-02-02 03:16:05 +03:00 |
Alexey
|
e1ec8a8b07
|
Update Readme.md
|
2019-02-02 00:58:09 +03:00 |
AlexeyAB
|
f09a9c3315
|
XNOR uses Tensor Cores on Turing GPU CC>=7.3 (not Volta)
|
2019-02-02 00:24:34 +03:00 |
AlexeyAB
|
e17bd9ba8f
|
Minor fix
|
2019-02-01 01:32:26 +03:00 |
AlexeyAB
|
a607784626
|
Added crnn.train.cfg just for test
|
2019-02-01 01:32:03 +03:00 |
AlexeyAB
|
c7309c1fdb
|
Fixed CRNN (RNN based on Convolution) layer
|
2019-02-01 01:30:02 +03:00 |
AlexeyAB
|
bd91d0a908
|
Add try-catch to the http_stream.cpp
|
2019-01-31 14:33:05 +03:00 |
AlexeyAB
|
c71354ab2e
|
Added cudaGetLastError() for cudaHostAlloc() to reset last cuda error
|
2019-01-31 14:22:07 +03:00 |
AlexeyAB
|
381f90ebb8
|
Fixed CUDA error checking
|
2019-01-29 13:46:30 +03:00 |
AlexeyAB
|
2790464de1
|
Another compile fix
|
2019-01-29 00:11:32 +03:00 |
AlexeyAB
|
ae8a8e6016
|
Compile fix
|
2019-01-29 00:05:08 +03:00 |
AlexeyAB
|
640bdbc063
|
LSTM, RNN, GRU - use connected_layer that uses cuDNN. Fixed CRNN for conv-layer with cuDNN.
|
2019-01-28 23:50:51 +03:00 |
AlexeyAB
|
0e1f3eaf35
|
Fixed DLL/SO
|
2019-01-28 20:32:30 +03:00 |
AlexeyAB
|
3692c174c5
|
Compile fix
|
2019-01-28 20:25:14 +03:00 |
AlexeyAB
|
110b5240a4
|
Fixed LSTM-layer
|
2019-01-28 20:22:14 +03:00 |
AlexeyAB
|
85b99872cb
|
Use non-default stream for all CUDA-functions
|
2019-01-28 20:19:26 +03:00 |
AlexeyAB
|
00b87281f3
|
Fixed RNN (RNN, GRU, LSTM) with cuDNN (batch-norm)
|
2019-01-27 03:42:44 +03:00 |
AlexeyAB
|
9576cd4d89
|
Fixed memory allocation
|
2019-01-26 23:25:09 +03:00 |
AlexeyAB
|
090d934c0f
|
Minor speedup on CPU
|
2019-01-26 19:12:46 +03:00 |
AlexeyAB
|
630f441e08
|
Minor CPU speedup - i7 6500K: 1000ms (AVX=1) instead of 1500ms (old AVX=1) and 2000ms (AVX=0)
|
2019-01-26 02:54:41 +03:00 |
AlexeyAB
|
1b15e2f8df
|
Compile fix on Windows
|
2019-01-24 20:30:15 +03:00 |
Alexey
|
da044776d1
|
Merge pull request #2282 from davidssmith/master
add LSTM layer
|
2019-01-24 20:19:57 +03:00 |
AlexeyAB
|
a7366a5a0a
|
Compile fix for CC < 7.3
|
2019-01-24 20:19:01 +03:00 |
David Smith
|
96773df469
|
add lstm_layer.o to Makefile
|
2019-01-24 09:38:45 -06:00 |
David Smith
|
5e778cd91e
|
add LSTM layer
|
2019-01-23 22:02:09 -06:00 |
Alexey
|
29aa716bd9
|
Update Readme.md
|
2019-01-23 18:04:31 +03:00 |
AlexeyAB
|
2d3220cef5
|
Look at wmma::bmma_sync(), bmmaBitOpXOR, bmmaAccumulateOpPOPC
|
2019-01-23 00:35:44 +03:00 |
Alexey
|
b47db904ee
|
Merge pull request #2272 from Sauraus/master
gcc on OSX required explicit return value for empty (char *) in detec…
|
2019-01-22 21:45:38 +03:00 |
Antek S. Baranski
|
8960fbfb3f
|
gcc on OSX required explicit return value for empty (char *) in detection_to_json
|
2019-01-22 10:17:42 -08:00 |
AlexeyAB
|
2cd37ec73e
|
Another minor fix
|
2019-01-22 17:31:07 +03:00 |
AlexeyAB
|
0541428f78
|
Merge branch 'master' of github.com:AlexeyAB/darknet
|
2019-01-22 16:24:22 +03:00 |
AlexeyAB
|
46be08db37
|
Minor fix
|
2019-01-22 16:23:44 +03:00 |
Alexey
|
ec9b989c0a
|
Update Readme.md
|
2019-01-21 22:40:56 +03:00 |
AlexeyAB
|
81f7fc2c7b
|
Fixed network resize memory allocation
|
2019-01-21 15:32:57 +03:00 |
AlexeyAB
|
226322523e
|
Fixed calc_anchors
|
2019-01-21 14:54:51 +03:00 |
Alexey
|
8b7494d920
|
Update Readme.md
|
2019-01-21 02:38:58 +03:00 |
AlexeyAB
|
17019854c3
|
XNOR minor fix
|
2019-01-19 03:18:50 +03:00 |
AlexeyAB
|
6e99e852ff
|
Network resize is fixed
|
2019-01-18 23:30:10 +03:00 |
AlexeyAB
|
0e022d0912
|
Fixed timer
|
2019-01-18 21:29:06 +03:00 |
AlexeyAB
|
4ed6fd1ada
|
Fix for compilation on Google Colab
|
2019-01-18 19:54:05 +03:00 |