clear-pkgs-linux-iot-lts2018/0603-drm-i915-Use-64-bit-wr...

59 lines
2.3 KiB
Diff
Raw Normal View History

From 0000000000000000000000000000000000000000 Mon Sep 17 00:00:00 2001
2018-10-11 02:06:46 +08:00
From: Zhao Yakui <yakui.zhao@intel.com>
Date: Fri, 14 Sep 2018 16:10:20 +0800
Subject: [PATCH] drm/i915: Use 64-bit write to optimize writing fence_reg
2018-10-11 02:06:46 +08:00
On VGPU scenario the read/write operation of fence_reg will be trapped
by the GVT-g. Then gvt-g follows the HW spec to program the fence_reg.
And the gvt-g takes care of updating the fence reg correctly for any trapped
value of fence reg.
So it is unnecessary to read/write fence reg several times. It is enough that
the fence reg is written only value in 64-bit mode. This will help
to reduce the redundant trap of fence_reg mmio operation.
V1->V2: Add back the condition judgement of !pipelined
Signed-off-by: Zhao Yakui <yakui.zhao@intel.com>
Reviewed-by: He Min <min.he@intel.com>
---
drivers/gpu/drm/i915/i915_gem_fence_reg.c | 15 ++++++++++++---
1 file changed, 12 insertions(+), 3 deletions(-)
diff --git a/drivers/gpu/drm/i915/i915_gem_fence_reg.c b/drivers/gpu/drm/i915/i915_gem_fence_reg.c
2020-05-05 09:02:46 +08:00
index d548ac05c..317e376cc 100644
2018-10-11 02:06:46 +08:00
--- a/drivers/gpu/drm/i915/i915_gem_fence_reg.c
+++ b/drivers/gpu/drm/i915/i915_gem_fence_reg.c
@@ -63,6 +63,7 @@ static void i965_write_fence_reg(struct drm_i915_fence_reg *fence,
i915_reg_t fence_reg_lo, fence_reg_hi;
int fence_pitch_shift;
u64 val;
+ struct drm_i915_private *dev_priv = fence->i915;
if (INTEL_GEN(fence->i915) >= 6) {
fence_reg_lo = FENCE_REG_GEN6_LO(fence->id);
@@ -92,9 +93,17 @@ static void i965_write_fence_reg(struct drm_i915_fence_reg *fence,
val |= I965_FENCE_REG_VALID;
}
- if (!pipelined) {
- struct drm_i915_private *dev_priv = fence->i915;
-
+ if (intel_vgpu_active(dev_priv)) {
+ /* Use the 64-bit RW to write fence reg on VGPU mode.
+ * The GVT-g can trap the written val of VGPU to program the
+ * fence reg. And the fence write in gvt-g follows the
+ * sequence of off/read/double-write/read. This assures that
+ * the fence reg is configured as expected.
+ * At the same time the 64-bit op can help to reduce the num
+ * of VGPU trap for the fence reg.
+ */
+ I915_WRITE64_FW(fence_reg_lo, val);
+ } else if (!pipelined) {
/* To w/a incoherency with non-atomic 64-bit register updates,
* we split the 64-bit update into two 32-bit writes. In order
* for a partial fence not to be evaluated between writes, we
--
https://clearlinux.org
2018-10-11 02:06:46 +08:00