acrn-hypervisor

Commit Graph

Author	SHA1	Message	Date
Wu Zhou	6a430de814	hv: remove CPU frequency control from guests The design of ACRN CPU performance management is to let hardware do the autonomous frequency selection(or set to a fixed value), and remove guest's ability to control CPU frequency. This patch is to remove guest's ability to control CPU frequency by removing the guests' HWP/EIST CPUIDs and blocking the related MSR accesses. Including: - Remove CPUID.06H:EAX[7..11] (HWP) - Remove CPUID.01H:ECX[7] (EIST) - Inject #GP(0) upon accesses to MSR_IA32_PM_ENABLE, MSR_IA32_HWP_CAPABILITIES, MSR_IA32_HWP_REQUEST, MSR_IA32_HWP_STATUS, MSR_IA32_HWP_INTERRUPT, MSR_IA32_HWP_REQUEST_PKG - Emulate MSR_IA32_PERF_CTL. Value written to MSR_IA32_PERF_CTL is just stored for reading. This is like how the native environment would behavior when EIST is disabled from BIOS. - Emulate MSR_IA32_PERF_STATUS by filling it with base frequency state. This is consistent with Windows, which displays current frequency as base frequency when running in VM. - Hide the IA32_MISC_ENABLE bit 16 (EIST enable) from guests. This bit is dependent to CPUID.01H:ECX[7] according to SDM. - Remove CPID.06H:ECX[0] (hardware coordination feedback) - Inject #GP(0) upon accesses to IA32_MPERF, IA32_APERF Also DM do not need to generate _PSS/_PPC for post-launched VMs anymore. This is done by letting hypercall HC_PM_GET_CPU_STATE sub command ACRN_PMCMD_GET_PX_CNT and ACRN_PMCMD_GET_PX_DATA return (-1). Tracked-On: #8168 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2022-09-21 03:48:58 +08:00
Jian Jun Chen	1bf984890b	hv: tsc: start HPET counter before calibration HPET is used to calibrate the tsc frequency if system fails to get the accurate frequency from CPUID 0x15. But on some platforms (for example: the emulated ACRN on QEMU) HPET is not started by default, which causes the failure of calibration TSC by HPET. Tracked-On: #8113 Signed-off-by: Jian Jun Chen <jian.jun.chen@intel.com> Reviewed-by: Zhao Yakui <yakui.zhao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2022-09-15 03:14:01 +08:00
Wu Zhou	8c5bb8b471	hv: change 'DISABLED' settings to 'ENABLED' In order to improve DX, 'DISABLED' style configurator settings are changed to 'ENABLED' style. The meaning of the MACROs are reversed, so in the hv code we have to change '#ifndef' -> '#ifdef' or '#ifdef' -> '#ifndef'. Including: - MCE_ON_PSC_DISABLED -> MCE_ON_PSC_ENABLED - ENFORCE_TURNOFF_AC -> SPLIT_LOCK_DETECTION_ENABLED - ENFORCE_TURNOFF_GP -> UC_LOCK_DETECTION_ENABLED Tracked-On: #7661 Acked-by: Eddie Dong <eddie.dong@intel.com> Signed-off-by: Wu Zhou <wu.zhou@intel.com>	2022-08-17 09:23:33 +08:00
Minggui Cao	83164d6030	hv: shell: improve console to modify input easier 1. make memcpy_erms as a public API; add a new one memcpy_erms_backwards, which supports to copy data from tail to head. 2. improve to use right/left/home/end key to move cursor, and support delete/backspace key to modify current input command. Tracked-On: #7931 Signed-off-by: Minggui Cao <minggui.cao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2022-07-28 23:31:43 +08:00
Jian Jun Chen	22a302599a	hv: tlfs: fix the incorrect vLAPIC freq MSR When LAPIC timer is working in oneshot or periodic mode, OS uses initial counter register/current counter register to program a timer. Both initial counter and current counter depend on the LAPIC frequency. ACRN emulated vLAPIC timer based on the TSC. vLAPIC freq is the same as TSC freq. Tracked-On: #7876 Signed-off-by: Jian Jun Chen <jian.jun.chen@intel.com> Reviewed-by: Zhao Yakui <yakui.zhao@intel.com>	2022-07-26 05:53:19 +08:00
Yifan Liu	4f4da08490	hv: cve hotfix: Disable RRSBA on platform using retpoline For platform that supports RRSBA (Restricted Return Stack Buffer Alternate), using retpoline may not be sufficient to guard against branch history injection or intra-mode branch target injection. RRSBA must be disabled to prevent CPUs from using alternate predictors for RETs. Quoting Intel CVE-2022-0001/CVE-2022-0002: Where software is using retpoline as a mitigation for BHI or intra-mode BTI, and the processor both enumerates RRSBA and enumerates RRSBA_DIS controls, it should disable this behavior. ... Software using retpoline as a mitigation for BHI or intra-mode BTI should use these new indirect predictor controls to disable alternate predictors for RETs. See: https://www.intel.com/content/www/us/en/developer/articles/technical/ software-security-guidance/technical-documentation/branch-history-injection.html Tracked-On: #7907 Signed-off-by: Yifan Liu <yifan1.liu@intel.com>	2022-07-22 09:38:41 +08:00
Jian Jun Chen	c88860250e	hv: tlfs: add tlfs TSC freq MSR support for WaaG TLFS defined 2 vMSRs which can be used by Windows guest to get the TSC/APIC frequencies from hypervisor. This patch adds the support of HV_X64_MSR_TSC_FREQUENCY/HV_X64_MSR_APIC_FREQUENCY vMSRS whose availability is exposed by CPUID.0x40000003:EAX[bit11] and EDX[bit8]. v1->v2: - revise commit message to highlight that the changes are for WaaG Tracked-On: #7876 Signed-off-by: Jian Jun Chen <jian.jun.chen@intel.com> Reviewed-by: Zhao Yakui <yakui.zhao@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2022-07-18 16:15:29 +08:00
Jian Jun Chen	97a2919138	hv: tsc: calibrate TSC by HPET On some platforms CPUID.0x15:ECX is zero and CPUID.0x16 can only return the TSC frequency in MHZ which is not accurate. For example the TSC frequency obtained by CPUID.0x16 is 2300 MHZ and the TSC frequency calibrated by HPET is 2303.998 MHZ which is much closer to the actual TSC frequency 2304.000 MHZ. This patch adds the support of using HPET to calibrate TSC when HPET is available and CPUID.0x15:ECX is zero. v3->v4: - move calc_tsc_by_hpet into hpet_calibrate_tsc v2->v3: - remove the NULL check in hpet_init - remove ""& 0xFFFFFFFFU" in tsc_read_hpet - add comment for the counter wrap in the low 32 bits in calc_tsc_by_hpet - use a dedicated function for hpet_calibrate_tsc v1->v2: - change native_calibrate_tsc_cpuid_0x15/0x16 to native_calculate_tsc_cpuid_0x15/0x16 - move hpet_init to BSP init - encapsulate both HPET and PIT calibration to one function - revise the commit message with an example" Tracked-On: #7876 Signed-off-by: Jian Jun Chen <jian.jun.chen@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2022-07-17 16:48:47 +08:00
Ziheng Li	eb8bcb06b3	Update copyright year range in code headers Modified the copyright year range in code, and corrected "int32_tel" into "Intel" in two "hypervisor/include/debug/profiling.h" and "hypervisor/include/debug/profiling_internal.h". Tracked-On: #7559 Signed-off-by: Ziheng Li <ziheng.li@intel.com>	2022-07-15 11:48:35 +08:00
Yifan Liu	05460f151a	hv: Serialize WBINVD using wbinvd_lock As mentioned in previous patch, wbinvd utilizes the vcpu_make_request and signal_event call pair to stall other vcpus. Due to the fact that these two calls are not thread-safe, we need to avoid concurrent call to this API pair. This patch adds wbinvd lock to serialize wbinvd emulation. Tracked-On: #7887 Signed-off-by: Yifan Liu <yifan1.liu@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2022-07-14 09:05:37 +08:00
Yifan Liu	745e70fb06	hv: Change sched_event back to boolean-based implementation Commit `d575edf79a` changes the internal implementation of wait_event and signal_event to use a counter instead of a boolean value. The background was: ACRN utilizes vcpu_make_request and signal_event pair to shoot down other vcpus and let them wait for signals. vcpu_make_request eventually leads to target vcpu calling wait_event. However vcpu_make_request/signal_event pair was not thread-safe, and concurrent calls of this pair of API could lead to problems. One such example is the concurrent wbinvd emulation, where vcpus may concurrently issue vcpu_make_request/signal_event to synchronize wbinvd emulation. `d575edf` commit uses a counter in internal implementation of wait_event/signal_event to avoid data races. However by using a counter, the wait/signal pair now carries semantics of semaphores instead of events. Semaphores require caller to carefully plan their calls instead of multiply signaling any number of times to the same event, which deviates from the original "event" semantics. This patch changes the API implementation back to boolean-based, and re-resolve the issue of concurrent wbinvd in next patch. This also partially reverts commit `10963b04d1`, which was introduced because of the `d575edf`. Tracked-On: #7887 Signed-off-by: Yifan Liu <yifan1.liu@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2022-07-14 09:05:37 +08:00
Fei Li	df3390f401	hv: vtd: reset the one-shot bits for GCMD_REG If multiple control fields in GCMD_REG register need to be modified, software must serialize the modifications through multiple writes to this register. So one-shot bits (bits 30-29, 27 and 24) in gcmd should not been set. Otherwise, other control field may be written to GCMD_REG at the same time with one-shot bit (Clearing one-shot bit has no effect, software sets this field would set/update this control field used by hardware). Tracked-On: #7381 Signed-off-by: Fei Li <fei1.li@intel.com>	2022-05-20 09:30:25 +08:00
Chenli Wei	5f0588b5f8	hv: move the define of MAX_IR_ENTRIES to offline tool There is an issue of calculate 2^n roundup of CONFIG_MAX_PT_IRQ_ENTRIES, and the code style is very ugly when we use macro to fix it. So this patch move MAX_IR_ENTRIES to offline tool which could do align check and calculate it automatically. Signed-off-by: Chenli Wei <chenli.wei@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2022-05-20 09:08:47 +08:00
Yonghua Huang	961b5d16f4	hv: update SSRAM regions EPT memory type to WB when SSRAM regions are assigned to service VM to support virtulization of SSRAM for post-launched RTVMs, service VM need to access all SSRAM regions for management, typically, service VM does data cleanup in SSRAM region when it is reclaimed from a shutdown RTVM. This patch update memory type from UC(by default) to WB, else SSARM region will be evicted when access from guest happens. Tracked-On: #7425 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com>	2022-05-10 15:45:52 +08:00
Zhou, Wu	3ba5b1522f	hv: fix post RTVM booting failure with SSRAM When booting prelaunch RTVM with SSRAM enabled, we need to delete the SSRAM region that is used by prelaunch RTVM from Service VM EPT mapping. If it is not used, or it is not fully used, the SSRAM or the rest SSRAM should be in Service VM map. But current code has a issue that it always deletes all SSRAM region from Service VM EPT, even when no SSRAM is enabled for prelaunch RTVM. This could cause the post RTVM with SSRAM boot failure, as DM checks and removes SSRAM region from Service VM EPT during post RTVM setup. Changing get_software_sram_size() to PRE_RTVM_SW_SRAM_MAX_SIZE could solve the issue, as PRE_RTVM_SW_SRAM_MAX_SIZE is the SSRAM size that prelaunch RTVM actually uses. Tracked-On: #7401 Signed-off-by: Zhou, Wu <wu.zhou@intel.com>	2022-05-06 14:41:58 +08:00
Jiang, Yanting	599894e571	Fix: write xmm registers correctly The movdqu instruction moves unaligned double quadword (128 bit) contained in XMM registers. This patch uses pointers as input parameters of the function write_xmm_0_2() to get 128-bit value from 64-bit array for each XMM register. Tracked-On: #7380 Reviewed-by: Fei Li <fei1.li@intel.com> Signed-off-by: Jiang, Yanting <yanting.jiang@intel.com>	2022-05-06 10:29:33 +08:00
Tw	e2f7b1fc51	hv: remove obsolete declarations related to RDT Since CAT support for hybrid platform is landed, let's remove some old declarations which are no longer used. Tracked-On: #6690 Signed-off-by: Tw <wei.tan@intel.com>	2022-04-26 14:27:01 +08:00
Chenli Wei	ed1c638c87	hv: refine for HPAn setting The current code only supports 2 HPA regions per VM. This patch extended ACRN to support 2+ HPA regions per VM, to use host memory better if it is scatted among multiple regions. This patch uses an array to describe the hpa region for the VM, and change the logic of ve820 to support multiple regions. This patch dependent on the config tool and GPA SSRAM change Tracked-On: #6690 Signed-off-by: Chenli Wei <chenli.wei@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2022-04-22 14:46:05 +08:00
Tw	e246ada6b0	hv: fix core type error when launch RTVM use atom core When CPUID executes with EAX set to 1AH, the processor returns information about hybrid capabilities. This information is percpu related, and should be obtained directly from the physical cpu. Tracked-On: #6899 Signed-off-by: Tw <wei.tan@intel.com>	2022-04-21 09:21:16 +08:00
Yonghua Huang	80292a482d	hv: remove pgentry_present field in struct pgtable Page table entry present check is page table type specific and static, e.g. just need to check bit0 of page entry for entries of MMU page table and bit2~bit0 for EPT page table case. hence no need to check it by callback function every time. This patch remove 'pgentry_present' callback field and add a new bitmask field for this page entry present check. It can get better performance especially when this check is executed frequently. Tracked-On: #7327 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Eddie Dong <eddie.dong@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com> Reviewed-by: Yu Wang <yu1.wang@intel.com>	2022-04-20 17:38:02 +08:00
Tw	deb35a0de9	hv: fix cpuid 0x2 mismatch when launch RTVM use atom core When CPUID executes with EAX set to 02H, the processor returns information about cache and TLB information. This information is percpu related, and should be obtained directly from the physical cpu. BTW, this patch is backported from v2.7 branch. Tracked-On: #6931 Signed-off-by: Tw <wei.tan@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2022-04-20 16:13:10 +08:00
Zhou, Wu	32cb5954f2	hv: refine the hard-coded GPA SSRAM area size Using the SSRAM area size extracted by config_tools, the patch changes the hard-coded GPA SSRAM area size to its actual size, so that pre-launched VMs can support large(>8MB) SSRAM area. When booting service VM, the SSRAM area has to be removed from Service VM's mem space, because they are passed-through to the pre-rt VM. The code was bugged since it was using the SSRAM area's GPA in the pre-rt VM. Changed it to GPA in Service VM. Tracked-On: #7212 Acked-by: Eddie Dong <eddie.dong@intel.com> Signed-off-by: Zhou, Wu <wu.zhou@intel.com>	2022-04-18 16:47:23 +08:00
Tw	3c384a489c	hv: support CAT on hybrid platform On hybrid platform(e.g. ADL), there may be multiple instances of same level caches for different type of processors, The current design only supports one global `rdt_info` for each RDT resource type. In order to support hybrid platform, this patch introduce `rdt_ins` to represents the "instance". Also, the number of `rdt_info` is dynamically generated by config-tool to match with physical board. Tracked-On: projectacrn#6690 Signed-off-by: Tw <wei.tan@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2022-04-18 15:33:11 +08:00
Tw	19da21c898	hv: remove RDT information detection As RDT related information will be offered by config-tool dynamically, and HV is just a consumer of that. So there's no need to do this detection at startup anymore. Tracked-On: projectacrn#6690 Signed-off-by: Tw <wei.tan@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2022-04-18 15:33:11 +08:00
Geoffroy Van Cutsem	8b16be9185	Remove "All rights reserved" string headers Many of the license and Intel copyright headers include the "All rights reserved" string. It is not relevant in the context of the BSD-3-Clause license that the code is released under. This patch removes those strings throughout the code (hypervisor, devicemodel and misc). Tracked-On: #7254 Signed-off-by: Geoffroy Van Cutsem <geoffroy.vancutsem@intel.com>	2022-04-06 13:21:02 +08:00
Fei Li	3f7501db38	hv: mmio: replace hi_mmio with mmio64 Now HI_MMIO_xxx is duplicate with MMIO64_xxx. This patch replace HI_MMIO_xxx with MMIO64_xxx. Tracked-On: #6011 Signed-off-by: Fei Li <fei1.li@intel.com>	2022-03-29 15:34:29 +08:00
Minggui Cao	05ca1d7641	hv: fix a bug about host/guest msr store/load Unify the handling of host/guest MSR area in VMCS. Remove the emum value as the element index when there are a few of MSRs in host/guest area. Because the index could be changed if one element not used. So, use a variable to save the index which will be used. Tracked-On: #6966 Acked-by: Eddie Dong <eddie.dong@intel.com> Signed-off-by: Minggui Cao <minggui.cao@intel.com>	2022-03-28 12:00:01 +08:00
Minggui Cao	b3bd153180	hv: expose PEBS capability and MSR as PMU_PT flag Requirement: in CPU partition VM (RTVM), vtune or perf can be used to sample hotspot code path to tune the RT performance, It need support PMU/PEBS (Processor Event Based Sampling). Intel TCC asks for it, too. It exposes PEBS related capabilities/features and MSRs to CPU partition VM, like RTVM. PEBS is a part of PMU. Also PEBS needs DS (Debug Store) feature to support. So DS is exposed too. Limitation: current it just support PEBS feature in VM level, when CPU traps to HV, the performance counter will stop. Perf global control MSR is used to do this work. So, the counters shall be close to native. Tracked-On: #6966 Acked-by: Anthony Xu <anthony.xu@intel.com> Signed-off-by: Minggui Cao <minggui.cao@intel.com>	2022-03-10 14:34:33 +08:00
Minggui Cao	299c56bb68	hv: add a flag for PMU passthrough to guest VM Add a flag: GUEST_FLAG_PMU_PASSTHROUGH to indicate if PMU (Performance Monitor Unit) is passthrough to guest VM. Tracked-On: #6966 Acked-by: Anthony Xu <anthony.xu@intel.com> Signed-off-by: Minggui Cao <minggui.cao@intel.com>	2022-03-10 14:34:33 +08:00
Minggui Cao	3b1deda0eb	hv: revert NMI notification by INIT signal NMI is used to notify LAPIC-PT RTVM, to kick its CPU into hypervisor. But NMI could be used by system devices, like PMU (Performance Monitor Unit). So use INIT signal as the partition CPU notification function, to replace injecting NMI. Also remove unused NMI as notification related code. Tracked-On: #6966 Acked-by: Anthony Xu <anthony.xu@intel.com> Signed-off-by: Minggui Cao <minggui.cao@intel.com>	2022-03-10 14:34:33 +08:00
Chenli Wei	c4c7835c12	hv: refine the vept module Now the vept module uses a mixture of nept and vept, it's better to refine it. So this patch rename nept to vept and simplify the interface of vept init module. Tracked-On: #6690 Acked-by: Anthony Xu <anthony.xu@intel.com> Signed-off-by: Chenli Wei <chenli.wei@intel.com>	2022-03-08 16:41:46 +08:00
Wen Qian	e2f1990548	hv: change error code of undefined hypercall This patch adds ENOTTY and ENOSYS to indicate undefined and obsoleted request hyercall respectively, and uses ENOTTY as error code for undefined hypercall instead of EINVAL to consistent with the ACRN kernel's return value. Tracked-On: #7029 Signed-off-by: Wen Qian <qian.wen@intel.com> Signed-off-by: Li Fei <fei1.li@intel.com> Acked-by: Wang, Yu1 <yu1.wang@intel.com>	2022-02-21 09:25:50 +08:00
Mingqiang Chi	3d5c3c4754	hv:fix violations of coding guideline C-ST-04 The coding guideline rule C-ST-04 requires that a 'if' statement followed by one or more 'else if' statement shall be terminated by an 'else' statement which contains either appropriate action or a comment. Tracked-On: #6776 Signed-off-by: Mingqiang Chi <mingqiang.chi@intel.com> Acked-by: Anthony Xu <anthony.xu@intel.com>	2022-02-18 18:41:07 +08:00
Chenli Wei	b7a99f4530	hv: replace the CONFIG_PLATFORM_RAM_SIZE with get_e820_ram_size for vept Now the vept table was allocate dynamically, but the table size of vept was calculated by the CONFIG_PLATFORM_RAM_SIZE which was predefined by config tool. It's not complete change and can't support single binary for different boards/platforms. So this patch will replace the CONFIG_PLATFORM_RAM_SIZE and get the top ram size from hv_E820 interface for vept. Tracked-On: #6690 Acked-by: Anthony Xu <anthony.xu@intel.com> Signed-off-by: Chenli Wei <chenli.wei@linux.intel.com>	2022-02-18 18:39:43 +08:00
Chenli Wei	5432b52b12	hv: replace the CONFIG_PLATFORM_RAM_SIZE with get_e820_ram_size for ept Now the EPT module use predefined parameter "CONFIG_PLATFORM_RAM_SIZE" to calculate the ept table size. After change the EPT table to dynamic allocate to support single binary for different boards/platforms, the ept table size should dynamic calculate too. So this patch replace CONFIG_PLATFORM_RAM_SIZE by the hv_e820_ram_size to get the RAM info on run time. Tracked-On: #6690 Acked-by: Anthony Xu <anthony.xu@intel.com> Signed-off-by: Chenli Wei <chenli.wei@linux.intel.com>	2022-02-18 18:39:43 +08:00
Chenli Wei	148a3d334c	hv: replace the CONFIG_PLATFORM_RAM_SIZE with get_e820_ram_size for mmu CONFIG_PLATFORM_RAM_SIZE is predefined by config tool and mmu use it to calculate the table size and predefine the ppt table. This patch will change the ppt to allocate dynamically and get the table size by the hv_e820_ram_size interface which could get the RAM info on run time and replace the CONFIG_PLATFORM_RAM_SIZE. Tracked-On: #6690 Acked-by: Anthony Xu <anthony.xu@intel.com> Signed-off-by: Chenli Wei <chenli.wei@linux.intel.com>	2022-02-18 18:39:43 +08:00
Chenli Wei	22e08d541f	hv: calculate hv_e820_ram_size dynamically The e820 module could get the RAM info on run time, but the RAM size and MAX address was limited by CONFIG_PLATFORM_RAM_SIZE which was predefined by config tool. Current solution can't support single binary for different boards or platforms and the CONFIG_PLATFORM_RAM_SIZE can't matching the RAM size if user have not update config tools setting after the device changed. So this patch remove the CONFIG_PLATFORM_RAM_SIZE and calculate ram size on run time. Tracked-On: #6690 Acked-by: Anthony Xu <anthony.xu@intel.com> Signed-off-by: Chenli Wei <chenli.wei@linux.intel.com>	2022-02-18 18:39:43 +08:00
Chenli Wei	01fa6de42c	hv: refine the e820 add new entry logic Sometimes the memory to be allocated is not at the end of an entry, that means we have to break one enty into 2 smaller entries, there are two ways to add the new entry to hv_e820, adds to the end or insert it. The initial e820 table is ordered, that's why the e820_alloc_memory interface asssum all entries was sorted, but add new entry to the end will break the orde of hv_e820. So we use insert_e820_entry to replace the add_e820_entry, the new interfeac will keep the orde and users do not need sort again after alloc region Tracked-On: #6690 Acked-by: Anthony Xu <anthony.xu@intel.com> Signed-off-by: Chenli Wei<chenli.wei@linux.intel.com>	2022-02-09 10:14:01 +08:00
Yonghua Huang	364b2b1428	hv: remove HC_GET_PLATFORM_INFO hypercall support HC_GET_PLATFORM_INFO hypercall is not supported anymore, hence to remove related function and data structure definition. Tracked-On: #6690 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com>	2022-02-09 10:11:11 +08:00
Chenli Wei	635a6da1c0	hv:refine the min address logic of high memory Current mmu assum the high memory start from 4G,it's not true for some platform. The map logic use "high64_max_ram - 4G" to calculate the high ram size without any check,it's an issue when the platform have no high memory. So this patch add high64_min_ram variable to calculate the min address of high memory and check the high64_min_ram to fix the previou issue. Tracked-On: #6690 Acked-by: Anthony Xu <anthony.xu@intel.com> Signed-off-by: Chenli Wei <chenli.wei@linux.intel.com>	2022-01-27 09:11:40 +08:00
Wen Qian	5d7465a055	hv: fix bug when set MSR_IA32_COPY_PLATFORM_TO_LOCAL before setting MSR_IA32_COPY_LOCAL_TO_PLATFORM The current code would inject GP to guest, when there's no IWKeyBackup, and the guest tried to write MSR MSR_IA32_COPY_PLATFORM_TO_LOCAL(0xd92) to copy IWKeyBackup for the platform to the IWKey for this logical processor. This patch fixes it by adjusting the code logic, and it'll do nothing instead of inject GP if no valid IWKeyBackup. This patch alse add checking for the value being written to avoid setting reserved MSR bits. Tracked-On: #7018 Signed-off-by: Wen Qian <qian.wen@intel.com> Signed-off-by: Li Fei <fei1.li@intel.com> Acked-by: Anthony Xu <anthony.xu@intel.com>	2022-01-21 14:35:59 +08:00
Chenli Wei	c93c2224e0	hv: alloc multiboot modules memory Now the multiboot modules memory have not reserve,it's an issue if these memory alloc and write before VM start. Incorrect allocation of multiboot modules memory will cause VM lost data or start faild. So we find these modules memory range and reserve these memory from e820 entry. All these memory will realloc to VM which own them before the vm start. Tracked-On: #6690 Acked-by: Anthony Xu <anthony.xu@intel.com> Signed-off-by: Chenli Wei <chenli.wei@intel.com>	2022-01-21 13:38:06 +08:00
Mingqiang Chi	b6b69f2178	hv: fix violations of coding guideline C-FN-16 The coding guideline rule C-FN-16 requires that 'Mixed-use of C code and assembly code in a single function shall not be allowed', this patch wraps inline assembly to inline functions. Tracked-On: #6776 Signed-off-by: Mingqiang Chi <mingqiang.chi@intel.com> Acked-by: Anthony Xu <anthony.xu@intel.com> v1-->v2: use inline functions for read/write XMM registers	2022-01-13 08:29:02 +08:00
Mingqiang Chi	83c0a97fb1	hv: fix clang analyzer deadcode remove the dead code since the variable is never used. Tracked-On: #6776 Signed-off-by: Mingqiang Chi <mingqiang.chi@intel.com>	2022-01-07 13:47:32 +08:00
Yuanyuan Zhao	8f114d82af	hv: rename CONFIG_IOMMU_BUS_NUM Rename `CONFIG_IOMMU_BUS_NUM` to `ACFG_MAX_PCI_BUS_NUM`. Configure tool will calculate `ACFG_MAX_PCI_BUS_NUM` base on the max pci num which is used by VF. So user needn't care about `ACFG_MAX_PCI_BUS_NUM`, and memory will be used resonable. Tracked-On: #6942 Signed-off-by: Yuanyuan Zhao <yuanyuan.zhao@linux.intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com> Acked-by: Victor Sun <victor.sun@intel.com> Acked-by: Anthony Xu <anthony.xu@intel.com>	2021-12-13 11:06:10 +08:00
Mingqiang Chi	3555aae4ac	hv:remove 2 bits vmx capability check remove is_valid_xsave_combination api, assume the hardware or QEMU can guarantee that support XSAVE on CPU side and XSAVE_XRSTR on VMX side or not. will add offline-tool in QEMU platform to avoid the user use wrong XSAVE configurations. remov check VMX_PROCBASED_CTLS2_XSVE_XRSTR based on the above reason. for VMX_PROCBASED_CTLS2_PAUSE_LOOP, now it will panic if run ACRN over QEMU, here remove it from essential check, and it will print error information when set this bit if there is no the hardware capability. v1-v2: remove is_valid_xsave_combination Tracked-On: #6584 Signed-off-by: Mingqiang Chi <mingqiang.chi@intel.com> Acked-by: Anthony Xu <anthony.xu@intel.com>	2021-12-13 08:52:52 +08:00
Yifan Liu	5c9456462b	hv && config-tool: Add compilation option to disable all interrupts in HV This patch adds an option CONFIG_KEEP_IRQ_DISABLED to hv (default n) and config-tool so that when this option is 'y', all interrupts in hv root mode will be permanently disabled. With this option to be 'y', all interrupts received in root mode will be handled in external interrupt vmexit after next VM entry. The postpone latency is negligible. This new configuration is a requirement from x86 TEE's secure/non-secure interrupt flow support. Many race conditions can be avoided when keeping IRQ off. v5: Rename CONFIG_ACRN_KEEP_IRQ_DISABLED to CONFIG_KEEP_IRQ_DISABLED v4: Change CPU_IRQ_ENABLE/DISABLE to CPU_IRQ_ENABLE_ON_CONFIG/DISABLE_ON_CONFIG and guard them using CONFIG_ACRN_KEEP_IRQ_DISABLED v3: CONFIG_ACRN_DISABLE_INTERRUPT -> CONFIG_ACRN_KEEP_IRQ_DISABLED Add more comment in commit message Tracked-On: #6571 Signed-off-by: Yifan Liu <yifan1.liu@intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com> Acked-by: Anthony Xu <anthony.xu@intel.com>	2021-12-10 09:50:17 +08:00
Yifan Liu	fa6b55db68	hv: tee: Handling x86_tee secure interrupts corner cases Previous upstreamed patches handles the secure/non-secure interrupts in handle_x86_tee_int. However there is a corner case in which there might be unhandled secure interrupts (in a very short time window) when TEE yields vCPU. For this case we always make sure that no secure interrupts are pending in TEE's vlapic before scheduling REE. Also in previous patches, if non-secure interrupt comes when TEE is handling its secure interrupts, hypervisor injects a predefined vector into TEE's vlapic. TEE does not consume this vector in secure interrupt handling routine so it stays in vIRR, but it should be cleared because the actual interrupt will be consumed in REE after VM Entry. v3: Fix comments on interrupt priority v2: Add comments explaining the priority of secure/non-secure interrupts Tracked-On: #6571 Signed-off-by: Yifan Liu <yifan1.liu@intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com> Acked-by: Anthony Xu <anthony.xu@intel.com>	2021-12-09 10:47:16 +08:00
Yifan Liu	fd7ab300a8	hv: tee: Rename TEE_NOTIFICATION_VECTOR to TEE_FIXED_NONSECURE_VECTOR The TEE_NOTIFICATION_VECTOR can sometimes be confused with TEE's PI notification vector. So rename it to TEE_FIXED_NONSECURE_VECTOR for better readability. No logic change. v3: Add more comments in commit message. Tracked-On: #6571 Signed-off-by: Yifan Liu <yifan1.liu@intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com> Acked-by: Anthony Xu <anthony.xu@intel.com>	2021-12-09 10:47:16 +08:00
Yifan Liu	702a71639f	hv: Add two vlapic APIs Sometimes HV would like to know if there are specific interrupt pending in vIRR, and clears them if necessary (such as in x86_tee case). This patch adds two APIs: get_next_pending_intr and clear_pending_intr. This patch also moves the inline api prio() from vlapic.c to vlapic.h v3: Remove apicv_get_next_pending_intr and apicv_clear_pending_intr and use vlapic_get_next_pending_intr and vlapic_clear_pending_intr directly. v2: get_pending_intr -> get_next_pending_intr apicv_basic/advanced_clear_pending_intr -> apicv_clear_pending_intr apicv_basic/advanced_get_pending_intr -> apicv_get_next_pending_intr has_pending_intr kept Tracked-On: #6571 Signed-off-by: Yifan Liu <yifan1.liu@intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com> Acked-by: Anthony Xu <anthony.xu@intel.com>	2021-12-09 10:47:16 +08:00

1 2 3 4 5 ...

2286 Commits