Update README.md

PiperOrigin-RevId: 328169260
author: Benoit Jacob <benoitjacob@google.com> 2020-08-24 20:47:06 +0300
committer: Copybara-Service <copybara-worker@google.com> 2020-08-24 20:47:47 +0300
commit: 29a155b0b0cff2c3da2b54201f039e9c07a4a695 (patch)
tree: c44c40e3db9ddd5926a44f5a4e7f042047edc359
parent: ce0e559b2dcd6cefd18cd31a4e188a1b3a276e32 (diff)
1 files changed, 4 insertions, 9 deletions
diff --git a/README.md b/README.md
index a3a8142..f341563 100644
--- a/README.md
+++ b/README.md
@@ -10,18 +10,13 @@ ruy supports both floating-point and 8bit-integer-quantized matrices.
 
 ## Efficiency
 
-ruy is designed to achieve maximal performance not just on very large sizes, as
+ruy is designed to achieve high performance not just on very large sizes, as
 is the focus of many established libraries, but on whatever are the actual sizes
 and shapes of matrices most critical in current TensorFlow Lite applications.
 This often means quite small sizes, e.g. 100x100 or even 50x50, and all sorts of
-rectangular shapes.
-
-ruy is currently only optimized for the ARM architectures (both 64-bit and
-32-bit code). Optimization for the Intel x86 architecture is in progress.
-
-ruy is currently optimized only for the following combination of storage orders:
-LHS = row-major, RHS = column-major, destination = column-major. All other
-combinations of storage orders fall back to slow reference code at the moment.
+rectangular shapes. It's not as fast as completely specialized code for each
+shape, but it aims to offer a good compromise of speed across all shapes and a
+small binary size.
 
 ## Documentation
author	Benoit Jacob <benoitjacob@google.com>	2020-08-24 20:47:06 +0300
committer	Copybara-Service <copybara-worker@google.com>	2020-08-24 20:47:47 +0300
commit	29a155b0b0cff2c3da2b54201f039e9c07a4a695 (patch)
tree	c44c40e3db9ddd5926a44f5a4e7f042047edc359
parent	ce0e559b2dcd6cefd18cd31a4e188a1b3a276e32 (diff)