Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/marian-nmt/intgemm.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorKenneth Heafield <github@kheafield.com>2019-11-29 13:47:15 +0300
committerKenneth Heafield <github@kheafield.com>2019-11-29 13:47:15 +0300
commitfaa4a3aef2062d1dd2da51730caf80b5875a9f10 (patch)
tree72aaa2c7e53035a7fd2b5f9f6df0884ad9d2fd1a /README.md
parentc5f7b8462114001a32e4034dbfd83adb70af6895 (diff)
Update README in light of license
Diffstat (limited to 'README.md')
-rw-r--r--README.md30
1 files changed, 3 insertions, 27 deletions
diff --git a/README.md b/README.md
index 5b1c476..9e1550f 100644
--- a/README.md
+++ b/README.md
@@ -69,37 +69,13 @@ intgemm::Int8Shift::Multiply(A_prepared.begin(), B_prepared.begin(), A_rows, wid
```
## Quantization
-Floating-point values are multiplied by a user-specified constant then rounded to an integer.
+Floating-point values are multiplied by a user-specified constant then rounded to an integer.
In 16 bit, Jacob Devlin recommends 1024.0 for neural networks to prevent the aforementioned overflow.
-In 8 bit, use 127.0 / the largest value. Quantization will saturate so it's possible to use larger multipliers to obtain clipping.
+In 8 bit, use 127.0 / the largest value (use MaxAbsolute). Quantization will saturate so it's possible to use larger multipliers to obtain clipping.
## Acknowledgments
The original 16-bit SSE2 code came from:
-Sharp Models on Dull Hardware: Fast and Accurate Neural Machine Translation Decoding on the CPU by Jacob Devlin
-https://arxiv.org/abs/1705.01991
-
-Under a license:
-
-Copyright (c) 2017 Microsoft Corporation
-
-Permission is hereby granted, free of charge, to any person obtaining a copy
-of this software and associated documentation files (the "Software"), to deal
-in the Software without restriction, including without limitation the rights
-to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
-copies of the Software, and to permit persons to whom the Software is
-furnished to do so, subject to the following conditions:
-
-The above copyright notice and this permission notice shall be included in all
-copies or substantial portions of the Software.
-
-THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
-IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
-FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
-AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
-LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
-OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
-SOFTWARE.
-
+Sharp Models on Dull Hardware: Fast and Accurate Neural Machine Translation Decoding on the CPU by Jacob Devlin https://arxiv.org/abs/1705.01991 under the MIT license.