acrn-hypervisor

Commit Graph

Author	SHA1	Message	Date
Zide Chen	ccfdf9cdd7	hv: nested: enable nested virtualization Allow guest set CR4_VMXE if CONFIG_NVMX_ENABLED is set: - move CR4_VMXE from CR4_EMULATED_RESERVE_BITS to CR4_TRAP_AND_EMULATE_BITS so that CR4_VMXE is removed from cr4_reserved_bits_mask. - force CR4_VMXE to be removed from cr4_rsv_bits_guest_value so that CR4_VMXE is able to be set. Expose VMX feature (CPUID01.01H:ECX[5]) to L1 guests whose GUEST_FLAG_NVMX_ENABLED is set. Assuming guest hypervisor (L1) is KVM, and KVM uses EPT for L2 guests. Constraints on ACRN VM. - LAPIC passthrough should be enabled. - use SCHED_NOOP scheduler. Tracked-On: #5923 Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Signed-off-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-05-13 16:16:30 +08:00
Zide Chen	dd90eccc25	hv: move invvpid and invept helper code from mmu.c to mmu.h moving invvpid and invept helper code from mmu.c to mmu.h, so that they can be accessed by the nested virtualization code. No logical changes. Tracked-On: #5923 Signed-off-by: Zide Chen <zide.chen@intel.com> Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-05-13 16:16:30 +08:00
Shuo A Liu	3fffa68665	hv: Support WAITPKG instructions in guest VM TPAUSE, UMONITOR or UMWAIT instructions execution in guest VM cause a #UD if "enable user wait and pause" (bit 26) of VMX_PROCBASED_CTLS2 is not set. To fix this issue, set the bit 26 of VMX_PROCBASED_CTLS2. Besides, these WAITPKG instructions uses MSR_IA32_UMWAIT_CONTROL. So load corresponding vMSR value during context switch in of a vCPU. Please note, the TPAUSE or UMWAIT instruction causes a VM exit if the "RDTSC exiting" and "enable user wait and pause" are both 1. In ACRN hypervisor, "RDTSC exiting" is always 0. So TPAUSE or UMWAIT doesn't cause a VM exit. Performance impact: MSR_IA32_UMWAIT_CONTROL read costs ~19 cycles; MSR_IA32_UMWAIT_CONTROL write costs ~63 cycles. Tracked-On: #6006 Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com>	2021-05-13 14:19:50 +08:00
dongshen	ebadf00de8	hv: some coding style fixes Fix issues reported by checkpatch.pl Tracked-On: #5917 Signed-off-by: dongshen <dongsheng.x.zhang@intel.com>	2021-05-12 16:50:34 +08:00
Junjie Mao	ea4eadf0a5	hv: hypercalls: refactor permission-checking and dispatching logic The current permission-checking and dispatching mechanism of hypercalls is not unified because: 1. Some hypercalls require the exact vCPU initiating the call, while the others only need to know the VM. 2. Different hypercalls have different permission requirements: the trusty-related ones are enabled by a guest flag, while the others require the initiating VM to be the Service OS. Without a unified logic it could be hard to scale when more kinds of hypercalls are added later. The objectives of this patch are as follows. 1. All hypercalls have the same prototype and are dispatched by a unified logic. 2. Permissions are checked by a unified logic without consulting the hypercall ID. To achieve the first objective, this patch modifies the type of the first parameter of hcall_* functions (which are the callbacks implementing the hypercalls) from `struct acrn_vm ` to `struct acrn_vcpu `. The doxygen-style documentations are updated accordingly. To achieve the second objective, this patch adds to `struct hc_dispatch` a `permission_flags` field which specifies the guest flags that must ALL be set for a VM to be able to invoke the hypercall. The default value (which is 0UL) indicates that this hypercall is for SOS only. Currently only the `permission_flag` of trusty-related hypercalls have the non-zero value GUEST_FLAG_SECURE_WORLD_ENABLED. With `permission_flag`, the permission checking logic of hypercalls is unified as follows. 1. General checks i. If the VM is neither SOS nor having any guest flag that allows certain hypercalls, it gets #UD upon executing the `vmcall` instruction. ii. If the VM is allowed to execute the `vmcall` instruction, but attempts to execute it in ring 1, 2 or 3, the VM gets #GP(0). 2. Hypercall-specific checks i. If the hypercall is for SOS (i.e. `permission_flag` is 0), the initiating VM must be SOS and the specified target VM cannot be a pre-launched VM. Otherwise the hypercall returns -EINVAL without further actions. ii. If the hypercall requires certain guest flags, the initiating VM must have all the required flags. Otherwise the hypercall returns -EINVAL without further actions. iii. A hypercall with an unknown hypercall ID makes the hypercall returns -EINVAL without further actions. The logic above is different from the current implementation in the following aspects. 1. A pre-launched VM now gets #UD (rather than #GP(0)) when it attempts to execute `vmcall` in ring 1, 2 or 3. 2. A pre-launched VM now gets #UD (rather than the return value -EPERM) when it attempts to execute a trusty hypercall in ring 0. 3. The SOS now gets the return value -EINVAL (rather than -EPERM) when it attempts to invoke a trusty hypercall. 4. A post-launched VM with trusty support now gets the return value -EINVAL (rather than #UD) when it attempts to invoke a non-trusty hypercall or an invalid hypercall. v1 -> v2: - Update documentation that describe hypercall behavior. - Fix Doxygen warnings Tracked-On: #5924 Signed-off-by: Junjie Mao <junjie.mao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-05-12 13:43:41 +08:00
Liang Yi	688a41c290	hv: mod: do not use explicit arch name when including headers Instead of "#include <x86/foo.h>", use "#include <asm/foo.h>". In other words, we are adopting the same practice in Linux kernel. Tracked-On: #5920 Signed-off-by: Liang Yi <yi.liang@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-05-08 11:15:46 +08:00
Li Fei1	f3327364c3	hv: mmu: fix a minor bug We should only map [low32_max_ram, 4G) MMIO region as UC attribute, not map [low32_max_ram, low32_max_ram + 4G) region as UC attribute. Otherwise, the HV will complain [4G, low32_max_ram + 4G) region has already mapped. Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com>	2021-04-29 08:57:13 +08:00
Shuo A Liu	dc88c2e397	hv: Save/restore MSR_IA32_CSTAR during context switch Both Windows guest and Linux guest use the MSR MSR_IA32_CSTAR, while Linux uses it rarely. Now vcpu context switch doesn't save/restore it. Windows detects the change of the MSR and rises a exception. Do the save/resotre MSR_IA32_CSTAR during context switch. Tracked-On: #5899 Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-04-23 11:21:52 +08:00
Jian Jun Chen	31b8b698ce	hv: TLFS: Add tsc_offset support for reference time TLFS spec defines that when a VM is created, the value of HV_X64_MSR_TIME_REF_COUNT is set to zero. Now tsc_offset is not supported properly, so guest get a drifted reference time. This patch implements tsc_offset. tsc_scale and tsc_offset are calculated when a VM is launched and are saved in struct acrn_hyperv of struct acrn_vm. Tracked-On: #5956 Signed-off-by: Jian Jun Chen <jian.jun.chen@intel.com> Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-04-23 10:48:07 +08:00
Jian Jun Chen	b4312efbd7	hv: TLFS: inject #GP to guest VM for writing of read-only MSRs TLFS spec defines that HV_X64_MSR_VP_INDEX and HV_X64_MSR_TIME_REF_COUNT are read-only MSRs. Any attempt to write to them results in a #GP fault. Fix the issue by returning error in handler hyperv_wrmsr() of MSRs HV_X64_MSR_VP_INDEX/HV_X64_MSR_TIME_REF_COUNT emulation. Tracked-On: #5956 Signed-off-by: Jian Jun Chen <jian.jun.chen@intel.com> Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-04-23 10:48:07 +08:00
Jian Jun Chen	dd524d076d	hv: TLFS: Setup hypercall page according to the vcpu mode TLFS spec defines different hypercall ABIs for X86 and x64. Currently x64 hypercall interface is not supported well. Setup the hypercall interface page according to the vcpu mode. Tracked-On: #5956 Signed-off-by: Jian Jun Chen <jian.jun.chen@intel.com> Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-04-23 10:48:07 +08:00
Li Fei1	628bca5cad	hv: pgtable: use new algo to calculate PPT/EPT_PD_PAGE_NUM In order to support platform (such as Ander Lake) which physical address width bits is 46, the current code need to reserve 2^16 PD page ((2^46) / (2^30)). This is a complete waste of memory. This patch would reserve PD page by three parts: 1. DRAM - may take PD_PAGE_NUM(CONFIG_PLATFORM_RAM_SIZE) PD pages at most; 2. low MMIO - may take PD_PAGE_NUM(MEM_1G << 2U) PD pages at most; 3. high MMIO - may takes (CONFIG_MAX_PCI_DEV_NUM * 6U) PD pages (may plus PDPT entries if its size is larger than 1GB ) at most for: (a) MMIO BAR size must be a power of 2 from 16 bytes; (b) MMIO BAR base address must be power of two in size and are aligned with its size. Tracked-On: #5929 Signed-off-by: Li Fei1 <fei1.li@intel.com>	2021-04-22 14:35:57 +08:00
Li Fei1	053c09e764	hv: cpu_cap: PAW over 39 bits must support 1GB large page The platform which physical-address width over 39 bits must support 1GB large page (Both MMU and VMX sides ). This could save lots of page table pages for EPT MMIO mapping. Tracked-On: #5929 Signed-off-by: Li Fei1 <fei1.li@intel.com>	2021-04-22 14:35:57 +08:00
Li Fei1	41e2d40d1f	hv: e820: remove get_mem_range_info No one uses get_mem_range_info to get the top/bottom/size of the physical memory. We could get these informations by e820 table easily. Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com> Acked-by: eddie Dong <eddie.dong@intel.com>	2021-04-21 14:00:44 +08:00
Li Fei1	3a465388d4	hv: guest: remove get_mem_range_info in prepare_sos_vm_memmap We used get_mem_range_info to get the top memory address and then use this address as the high 64 bits max memory address of SOS. This assumes the platform must have high memory space. This patch removes the assumption. It will set high 64 bits max memory address of SOS to 4G by default (Which means there's no 64 bits high memory), then update the high 64 bits max memory address if the SOS really has high memory space. Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com> Acked-by: eddie Dong <eddie.dong@intel.com>	2021-04-21 14:00:44 +08:00
Li Fei1	901e8c869e	hv: vE820: calculate SOS memory size by vE820 tables SOS's memory size could be calculated by its vE820 Tables easily. Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com> Acked-by: eddie Dong <eddie.dong@intel.com>	2021-04-21 14:00:44 +08:00
Li Fei1	ad15053304	hv: mmu: remove get_mem_range_info in init_paging We used get_mem_range_info to get the top memory address and then use this address as the high 64 bits max memory address. This assumes the platform must have high memory space. This patch calculates the high 64 bits max memory address according the e820 tables and removes the assumption "The platform must have high memory space" by map the low RAM region and high RAM region separately. Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com> Acked-by: eddie Dong <eddie.dong@intel.com>	2021-04-21 14:00:44 +08:00
Li Fei1	6137347411	hv: smp: fix an isuue about SMP sync Now BSP may launch VMs before APs have not done its initilization, for example, sched_control for per-cpu. However, when we initilize the vcpu thread data, it will access the object (scheduler) of the sched_control of APs. As a result, it will trigger the PF. This patch would waits each physical has done its initilization before to continue to execute. Tracked-On: #5929 Signed-off-by: Li Fei1 <fei1.li@intel.com>	2021-04-21 10:54:48 +08:00
Li Fei1	5f281df548	hv: serializng: use mfence to ensure trampoline code was updated Using the MFENCE to make sure trampoline code has been updated (clflush) into memory beforing start APs. Tracked-On: #5929 Signed-off-by: Li Fei1 <fei1.li@intel.com>	2021-04-21 10:54:48 +08:00
Li Fei1	e049abb542	hv: vcpuid: hide new cpuid 0x1b/0x1f Hide CPUID 0x1b (PCONFIG) and 0x1f (Extended Topology Enumeration Leaf) Tracked-On: #5929 Signed-off-by: Li Fei1 <fei1.li@intel.com>	2021-04-20 13:28:44 +08:00
Li Fei1	31f48d12a2	hv: memory order: use mfence to strengthen the fast string operations order Use MFENCE to strengthen the fast string operations execute order to ensure all trampoline code was updated before flush it into the memory. Tracked-On: #5929 Signed-off-by: Li Fei1 <fei1.li@intel.com>	2021-04-20 13:28:44 +08:00
Yifan Liu	b80c388b52	hv: Hide HLAT to guest For platform with HLAT (Hypervisor-managed Linear Address Translation) capability, the hypervisor shall hide this feature to its guest. This patch adds MSR_IA32_VMX_PROCBASED_CTLS3 MSR to unsupported MSR list. The presence of this MSR is determined by 1-setting of bit 49 of MSR MSR_IA32_VMX_PROCBASED_CTLS. which is already in unsupported MSR list. [2] Related documentations: [1] Intel Architecture Instruction Set Extensions, version Feb 16, 2021, Ch 6.12 [2] Intel KeyLocker Specification, Sept 2020, Ch 7.2 Tracked-On: #5895 Signed-off-by: Yifan Liu <yifan1.liu@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-04-07 13:47:47 +08:00
Li Fei1	d1ae797742	hv: pgtable: move sanitize_pte into pagetable.c sanitize_pte is used to set page table entry to map to an sanitized page to mitigate l1tf. It should belongs to pgtable module. So move it to pagetable.c Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com>	2021-03-29 13:28:55 +08:00
Li Fei1	ef90bb6db3	hv:pgtable: rename lookup_address to pgtable_lookup_entry lookup_address is used to lookup a pagetable entry by an address. So rename it to pgtable_lookup_entry to indicate this clearly. Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-03-29 13:28:55 +08:00
Li Fei1	36ddd87a09	hv: pgtable: remove alloc_ept_page alloc_page/free_page should been called in pagetable module. In order to do this, we add pgtable_create_root and pgtable_create_trusty_root to create PML4 page table page for normal world and secure world. After this done, no one uses alloc_ept_page. So remove it. Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-03-29 13:28:55 +08:00
Li Fei1	ea701c63c7	hv: pgtable: add pgtable_create_trusty_root Add pgtable_create_trusty_root to allocate a page for trusty PML4 page table page. This function also copy PDPT entries from Normal world to Secure world. Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-03-29 13:28:55 +08:00
Li Fei1	596c349600	hv: pgtable: add pgtable_create_root Add pgtable_create_root to allocate a page for PMl4 page table page. Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-03-29 13:28:55 +08:00
Li Fei1	eb52e2193a	hv: pgtable: refine name for pgtable add/modify/del Rename mmu_add to pgtable_add_map; Rename mmu_modify_or_del to pgtable_modify_or_del_map. And move these functions declaration into pgtable.h Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-03-29 13:28:55 +08:00
Liang Yi	33ef656462	hv/mod-irq: use arch specific header files Requires explicit arch path name in the include directive. The config scripts was also updated to reflect this change. Tracked-On: #5825 Signed-off-by: Peter Fang <peter.fang@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-03-24 11:38:14 +08:00
Liang Yi	df36da1b80	hv/mod_irq: do not include x86/irq.h in common/irq.h Each .c file includes the arch specific irq header file (with full path) by itself if required. Tracked-On: #5825 Signed-off-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-03-24 11:38:14 +08:00
Liang Yi	741a208a02	hv/mod_irq: cleanup x86 lapic/ioapic header files Declarations referenced nowhere else are moved into the c file. Tracked-On: #5825 Signed-off-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-03-24 11:38:14 +08:00
Liang Yi	ff732cfb2a	hv/mod_irq: move guest interrupt API out of x86/irq.h A new x86/guest/virq.h head file now contains all guest related interrupt handling API. Tracked-On: #5825 Signed-off-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-03-24 11:38:14 +08:00
Liang Yi	798015876c	hv/mod_irq: move NMI and exception handler out of x86/irq.c Each of them now resides in a separate .c file. Tracked-On: #5825 Signed-off-by: Yang, Yu-chu <yu-chu.yang@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-03-24 11:38:14 +08:00
Liang Yi	3a50f949e1	hv/mod_irq: split irq.c into arch/x86/irq.c and common/irq.c The common irq file is responsible for managing the central irq_desc data structure and provides the following APIs for host interrupt handling. - init_interrupt() - reserve_irq_num() - request_irq() - free_irq() - set_irq_trigger_mode() - do_irq() API prototypes, constant and data structures belonging to common interrupt handling are all moved into include/common/irq.h. Conversely, the following arch specific APIs are added which are called from the common code at various points: - init_irq_descs_arch() - setup_irqs_arch() - init_interrupt_arch() - free_irq_arch() - request_irq_arch() - pre_irq_arch() - post_irq_arch() Tracked-On: #5825 Signed-off-by: Peter Fang <peter.fang@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-03-24 11:38:14 +08:00
Liang Yi	c46e3c71ac	hv/mod_irq: decouple irq number reservation from ioapic This is done be adding irq_rsvd_bitmap as an auxiliary bitmap besides irq_alloc_bitmap. Tracked-On: #5825 Signed-off-by: Peter Fang <peter.fang@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-03-24 11:38:14 +08:00
Liang Yi	038e0cae92	hv/mod_irq: split IRQ handling into common and arch specific parts The common IRQ handling routine calls arch specific functions pre_irq_arch() and post_irq_arch() before and after calling the registered action function respectively. Tracked-On: #5825 Signed-off-by: Peter Fang <peter.fang@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-03-24 11:38:14 +08:00
Liang Yi	ac3e0a1718	hv/mod_irq: split irq initialization into common and arch specific parts The common part initializes the global irq_desc data structure while the arch specific part initialize the HW and its own irq data. This is one of the preparation steps for spliting IRQ handling into common and architecture specific parts. Tracked-On: #5825 Signed-off-by: Peter Fang <peter.fang@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-03-24 11:38:14 +08:00
Liang Yi	f3cae9e258	hv/mod_irq: hide arch specific data in irq_desc Arch specific IRQ data is now an opaque pointer in irq_desc. This is a preparation step for spliting IRQ handling into common and architecture specific parts. Tracked-On: #5825 Signed-off-by: Peter Fang <peter.fang@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-03-24 11:38:14 +08:00
Li Fei1	9000381f34	hv: pgtable: move pgtable definition to pgtable.h This patch moves pgtable definition to pgtable.h and include the proper header file for page module. Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-03-11 13:48:52 +08:00
Li Fei1	0278a3f46e	hv: pgatble: move the EPT page table related APIs to ept.c Move the EPT page table related APIs to ept.c. page module only provides APIs to allocate/free page for page table page. pagetabl module only provides APIs to add/modify/delete/lookup page table entry. The page pool and the page table related APIs for EPT should defined in EPT module. Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-03-11 13:48:52 +08:00
Li Fei1	5c71ca456a	hv: pgatble: move the MMU page table related APIs to mmu.c Move the MMU page table related APIs to mmu.c. page module only provides APIs to allocate/free page for page table page. pagetabl module only provides APIs to add/modify/delete/lookup page table entry. The page pool and the page table related APIs for MMU should defined in MMU module. Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-03-11 13:48:52 +08:00
Li Fei1	15d68675e9	hv: pgtable: separate common APIs for MMU/EPT We would move the MMU page table related APIs to mmu.c and move the EPT related APIs to EPT.c. The page table module only provides APIs to add/modify/delete/lookup page table entry. This patch separates common APIs and adds separate APIs of page table module for MMU/EPT. Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com>	2021-03-11 13:48:52 +08:00
Li Fei1	80bd3ac02a	hv: trusty: move post_uos_sworld_memory into vm.c post_uos_sworld_memory are used for post-launched VM which support trusty. It's more VM related. So move it definition into vm.c Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-03-11 13:48:52 +08:00
Yonghua Huang	1a011bd91b	hv: disable guest MONITOR-WAIT support when SW SRAM is configured Per-core software SRAM L2 cache may be flushed by 'mwait' extension instruction, which guest VM may execute to enter core deep sleep. Such kind of flushing is not expected when software SRAM is enabled for RTVM. Hypervisor disables MONITOR-WAIT support on both hypervisor and VMs sides to protect above software SRAM from being flushed. This patch disable ACRN guest MONITOR-WAIT support if software SRAM is configured. Tracked-On: #5649 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-03-11 09:42:44 +08:00
Yonghua Huang	ae43b2a847	hv: disable host MONITOR-WAIT support when SW SRAM is enabled Per-core software SRAM L2 cache may be flushed by 'mwait' extension instruction, which guest VM may execute to enter core deep sleep. Such kind of flushing is not expected when software SRAM is enabled for RTVM. Hypervisor disables MONITOR-WAIT support on both hypervisor and VMs sides to protect above software SRAM from being flushed. This patch disable hypervisor(host) MONITOR-WAIT support and refine software sram initializaion flow. Tracked-On: #5649 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-03-11 09:42:44 +08:00
Yonghua Huang	ea44bb6c4d	hv: wrap function to check software SRAM support Below boolean function are defined in this patch: - is_software_sram_enabled() to check if SW SRAM feature is enabled or not. - set global variable 'is_sw_sram_initialized' to file static. Tracked-On: #5649 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-03-11 09:42:44 +08:00
Li Fei1	768e483cd2	hv: pgtable: rename 'struct memory_ops' to 'struct pgtable' The fields and APIs in old 'struct memory_ops' are used to add/modify/delete page table (page or entry). So rename 'struct memory_ops' to 'struct pgtable'. Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-03-10 11:42:13 +08:00
Li Fei1	ef98fa69ce	hv: pgtable: remove get_default_access_right API Use default_access_right field to replace get_default_access_right API. Tracked-On: #5830 Signed-off-by: Li Fei1 <fei1.li@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-03-10 11:42:13 +08:00
Li Fei1	7c6a52037a	refine ept_flush_leaf_page Refine the logic how to skip the pSRAM region when flushing cache. Tracked-On: #5330 Signed-off-by: Li Fei1 <fei1.li@intel.com>	2021-03-03 14:44:25 +08:00
Li Fei1	1db32f4d03	hv: ept: build 4KB page mapping in EPT for code pages of rtvm RTVM is enforced to use 4KB pages to mitigate CVE-2018-12207 and performance jitter, which may be introduced by splitting large page into 4KB pages on demand. It works fine in previous hardware platform where the size of address space for the RTVM is relatively small. However, this is a problem when the platforms support 64 bits high MMIO space, which could be super large and therefore consumes large # of EPT page table pages. This patch optimize it by using large page for purely data pages, such as MMIO spaces, even for the RTVM. Signed-off-by: Li Fei1 <fei1.li@intel.com> Tracked-On: #5788	2021-03-03 13:46:49 +08:00
Li Fei1	01b54241c6	hv: ept: only treak execution right for large pages To mitigate the page size change MCE vulnerability (CVE-2018-12207), ACRN would clear the execution permission in the EPT paging-structure entries for large pages and then intercept an EPT execution-permission violation caused by an attempt to execution an instruction in the guest. However, the current code would clear the execution permission in the EPT paging- structure entries for small pages too when we clearing the the execution permission for large pages. This would trigger extra EPT violation VM exits. This patch fix this issue. Signed-off-by: Li Fei1 <fei1.li@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com> Tracked-On: #5788	2021-03-03 13:46:49 +08:00
Li Fei1	97a9c5151b	kv: kconfig: remove some unused ram size kconfig SOS_RAM_SIZE/UOS_RAM_SIZE Kconfig are only used to calculate how many pages we should reserve for the VM EPT mapping. Now we reserve pages for each VM EPT pagetable mapping by the PLATFORM_RAM_SIZE not the VM RAM SIZE. This could simplify the reserve logic for us: not need to take care variable corner cases. We could make assume we reserve enough pages base on the VM could not use the resources beyond the platform hardware resources. So remove these two unused VM ram size kconfig. Signed-off-by: Li Fei1 <fei1.li@intel.com> Tracked-On: #5788	2021-03-01 13:10:04 +08:00
Li Fei1	0579e2ee24	hv: page: add free_page Add free_page to free page when unmap pagetable. Signed-off-by: Li Fei1 <fei1.li@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com> Tracked-On: #5788	2021-03-01 13:10:04 +08:00
Li Fei1	8d9f12f3b7	hv: page: use dynamic page allocation for pagetable mapping For FuSa's case, we remove all dynamic memory allocation use in ACRN HV. Instead, we use static memory allocation or embedded data structure. For pagetable page, we prefer to use an index (hva for MMU, gpa for EPT) to get a page from a special page pool. The special page pool should be big enougn for each possible index. This is not a big problem when we don't support 64 bits MMIO. Without 64 bits MMIO support, we could use the index to search addrss not larger than DRAM_SIZE + 4G. However, if ACRN plan to support 64 bits MMIO in SOS, we could not use the static memory alocation any more. This is because there's a very huge hole between the top DRAM address and the bottom 64 bits MMIO address. We could not reserve such many pages for pagetable mapping as the CPU physical address bits may very large. This patch will use dynamic page allocation for pagetable mapping. We also need reserve a big enough page pool at first. For HV MMU, we don't use 4K granularity page table mapping, we need reserve PML4, PDPT and PD pages according the maximum physical address space (PPT va and pa are identical mapping); For each VM EPT, we reserve PML4, PDPT and PD pages according to the maximum physical address space too, (the EPT address sapce can't beyond the physical address space), and we reserve PT pages by real use cases of DRAM, low MMIO and high MMIO. Signed-off-by: Li Fei1 <fei1.li@intel.com> Tracked-On: #5788	2021-03-01 13:10:04 +08:00
Li Fei1	5621fabbcb	hv: memory: remove get_sworld_memory_base API memory_ops structure will be changed to store page table related fields. However, secure world memory base address is not one of them, it's VM related. So save sworld_memory_base_hva in vm_arch structure directly. Signed-off-by: Li Fei1 <fei1.li@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com> Tracked-On: #5788	2021-03-01 13:10:04 +08:00
Victor Sun	26abc82f3c	HV: panic on 0 address when do e820_alloc_memory Current memory allocation algorithm is to find the available address from the highest possible address below max_address. If the function returns 0, means all memory is used up and we have to put the resource at address 0, this is dangerous for a running hypervisor. Also returns 0 would make code logic very complicated, since memcpy_s() doesn't support address 0 copy. Tracked-On: #5626 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-02-26 16:38:32 +08:00
Victor Sun	2e72bb97e7	HV: refine acpi rsdp initialize interface In previous code, the rsdp initialization is done in get_rsdp() api implicitly. The function is called multiple times in following acpi table parsing functions and the condition (rsdp == NULL) need to be added in each parsing function. This is not needed since the panic would occur if rsdp is NULL when do acpi initialization. Tracked-On: #5626 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-02-26 16:38:32 +08:00
Yonghua Huang	fdfd28b140	hv: unmap software region of pre-RTVM from Service VM EPT Accessing to software SRAM region is not allowed when software SRAM is pass-thru to prelaunch RTVM. This patch removes software SRAM region from service VM EPT if it is enabled for prelaunch RTVM. Tracked-On: #5649 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com>	2021-02-25 09:35:31 +08:00
Tao Yuhong	50d8525618	HV: deny HV owned PCI bar access from SOS This patch denies Service VM the access permission to device resources owned by hypervisor. HV may own these devices: (1) debug uart pci device for debug version (2) type 1 pci device if have pre-launched VMs. Current implementation exposes the mmio/pio resource of HV owned devices to SOS, should remove them from SOS. Tracked-On: #5615 Signed-off-by: Tao Yuhong <yuhong.tao@intel.com>	2021-02-03 14:01:23 +08:00
Tao Yuhong	6e7ce4a73f	HV: deny pre-launched VM ptdev bar access from SOS This patch denies Service VM the access permission to device resources owned by pre-launched VMs. Rationale: * Pre-launched VMs in ACRN are independent of service VM, and should be immune to attacks from service VM. However, current implementation exposes the bar resource of passthru devices to service VM for some reason. This makes it possible for service VM to crash or attack pre-launched VMs. * It is same for hypervisor owned devices. NOTE: * The MMIO spaces pre-allocated to VFs are still presented to Service VM. The SR-IOV capable devices assigned to pre-launched VMs doesn't have the SR-IOV capability. So the MMIO address spaces pre-allocated by BIOS for VFs are not decoded by hardware and couldn't be enabled by guest. SOS may live with seeing the address space or not. We will revisit later. Tracked-On: #5615 Signed-off-by: Tao Yuhong <yuhong.tao@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-02-03 14:01:23 +08:00
Shuo A Liu	d4aaf99d86	hv: keylocker: Support keylocker backup MSRs for Guest VM The logical processor scoped IWKey can be copied to or from a platform-scope storage copy called IWKeyBackup. Copying IWKey to IWKeyBackup is called ‘backing up IWKey’ and copying from IWKeyBackup to IWKey is called ‘restoring IWKey’. IWKeyBackup and the path between it and IWKey are protected against software and simple hardware attacks. This means that IWKeyBackup can be used to distribute an IWKey within the logical processors in a platform in a protected manner. Linux keylocker implementation uses this feature, so they are introduced by this patch. Tracked-On: #5695 Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-02-03 13:54:45 +08:00
Shuo A Liu	38cd5b481d	hv: keylocker: host keylocker iwkey context switch Different vCPU may have different IWKeys. Hypervisor need do the iwkey context switch. This patch introduce a load_iwkey() function to do that. Switches the host iwkey when the switch_in vCPU satisfies: 1) keylocker feature enabled 2) Different from the current loaded one. Two opportunities to do the load_iwkey(): 1) Guest enables CR4.KL bit. 2) vCPU thread context switch. load_iwkey() costs ~600 cycles when do the load IWKey action. Tracked-On: #5695 Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-02-03 13:54:45 +08:00
Shuo A Liu	c11c07e0fe	hv: keylocker: Support Key Locker feature for guest VM KeyLocker is a new security feature available in new Intel CPUs that protects data-encryption keys for the Advanced Encryption Standard (AES) algorithm. These keys are more valuable than what they guard. If stolen once, the key can be repeatedly used even on another system and even after vulnerability closed. It also introduces a CPU-internal wrapping key (IWKey), which is a key- encryption key to wrap AES keys into handles. While the IWKey is inaccessible to software, randomizing the value during the boot-time helps its value unpredictable. Keylocker usage: - New “ENCODEKEY” instructions take original key input and returns HANDLE crypted by an internal wrap key (IWKey, init by “LOADIWKEY” instruction) - Software can then delete the original key from memory - Early in boot/software, less likely to have vulnerability that allows stealing original key - Later encrypt/decrypt can use the HANDLE through new AES KeyLocker instructions - Note: * Software can use original key without knowing it (use HANDLE) * HANDLE cannot be used on other systems or after warm/cold reset * IWKey cannot be read from CPU after it's loaded (this is the nature of this feature) and only 1 copy of IWKey inside CPU. The virtualization implementation of Key Locker on ACRN is: - Each vCPU has a 'struct iwkey' to store its IWKey in struct acrn_vcpu_arch. - At initilization, every vCPU is created with a random IWKey. - Hypervisor traps the execution of LOADIWKEY (by 'LOADIWKEY exiting' VM-exectuion control) of vCPU to capture and save the IWKey if guest set a new IWKey. Don't support randomization (emulate CPUID to disable) of the LOADIWKEY as hypervisor cannot capture and save the random IWKey. From keylocker spec: "Note that a VMM may wish to enumerate no support for HW random IWKeys to the guest (i.e. enumerate CPUID.19H:ECX[1] as 0) as such IWKeys cannot be easily context switched. A guest ENCODEKEY will return the type of IWKey used (IWKey.KeySource) and thus will notice if a VMM virtualized a HW random IWKey with a SW specified IWKey." - In context_switch_in() of each vCPU, hypervisor loads that vCPU's IWKey into pCPU by LOADIWKEY instruction. - There is an assumption that ACRN hypervisor will never use the KeyLocker feature itself. This patch implements the vCPU's IWKey management and the next patch implements host context save/restore IWKey logic. Tracked-On: #5695 Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-02-03 13:54:45 +08:00
Shuo A Liu	4483e93bd1	hv: keylocker: Enable the tertiary VM-execution controls In order for a VMM to capture the IWKey values of guests, processors that support Key Locker also support a new "LOADIWKEY exiting" VM-execution control in bit 0 of the tertiary processor-based VM-execution controls. This patch enables the tertiary VM-execution controls. Tracked-On: #5695 Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-02-03 13:54:45 +08:00
Shuo A Liu	e9247dbca0	hv: keylocker: Simulate CPUID of keylocker caps for guest VM KeyLocker is a new security feature available in new Intel CPUs that protects data-encryption keys for the Advanced Encryption Standard (AES) algorithm. This patch emulates Keylocker CPUID leaf 19H to support Keylocker feature for guest VM. To make the hypervisor being able to manage the IWKey correctly, this patch doesn't expose hardware random IWKey capability (CPUID.0x19.ECX[1]) to guest VM. Tracked-On: #5695 Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-02-03 13:54:45 +08:00
Shuo A Liu	15c967ad34	hv: keylocker: Add CR4 bit CR4_KL as CR4_TRAP_AND_PASSTHRU_BITS Bit19 (CR4_KL) of CR4 is CPU KeyLocker feature enable bit. Hypervisor traps the bit's writing to track the keylocker feature on/off of guest. While the bit is set by guest, - set cr4_kl_enabled to indicate the vcpu's keylocker feature enabled status - load vcpu's IWKey in host (will add in later patch) While the bit is clear by guest, - clear cr4_kl_enabled This patch trap and passthru the CR4_KL bit to guest for operation. Tracked-On: #5695 Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-02-03 13:54:45 +08:00
Li Fei1	94a980c923	hv: hypercall: prevent sos can touch hv/pre-launched VM resource Current implementation, SOS may allocate the memory region belonging to hypervisor/pre-launched VM to a post-launched VM. Because it only verifies the start address rather than the entire memory region. This patch verifies the validity of the entire memory region before allocating to a post-launched VM so that the specified memory can only be allocated to a post-launched VM if the entire memory region is mapped in SOS’s EPT. Tracked-On: #5555 Signed-off-by: Li Fei1 <fei1.li@intel.com> Reviewed-by: Yonghua Huang <yonghua.huang@intel.com>	2021-02-02 16:55:40 +08:00
Yonghua Huang	8bec63a6ea	hv: remove the hardcoding of Software SRAM GPA base Currently, we hardcode the GPA base of Software SRAM to an address that is derived from TGL platform, as this GPA is identical with HPA for Pre-launch VM, This hardcoded address may not work on other platforms if the HPA bases of Software SRAM are different. Now, Offline tool configures above GPA based on the detection of Software SRAM on specific platform. This patch removes the hardcoding GPA of Software SRAM, and also renames MACRO 'SOFTWARE_SRAM_BASE_GPA' to 'PRE_RTVM_SW_SRAM_BASE_GPA' to avoid confusing, as it is for Prelaunch VM only. Tracked-On: #5649 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-01-30 13:41:02 +08:00
Yonghua Huang	c9ca23d268	hv: refine RTCM initialization code - RTCM is initialized in hypervisor only if RTCM binaries are detected. - Remove address space of RTCM binary from Software SRAM region. - Refine parse_rtct() function, validity of ACPI RTCT table shall be checked by caller. Tracked-On: #5649 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-01-28 11:29:25 +08:00
Yonghua Huang	a6e666dbe7	hv: remove hardcoding of SW SRAM HPA base Physical address to SW SRAM region maybe different on different platforms, this hardcoded address may result in address mismatch for SW SRAM operations. This patch removes above hardcoded address and uses the physical address parsed from native RTCT. Tracked-On: #5649 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-01-28 11:29:25 +08:00
Yonghua Huang	a6420e8cfa	hv: cleanup legacy terminologies in RTCM module This patch updates below terminologies according to the latest TCC Spec: PTCT -> RTCT PTCM -> RTCM pSRAM -> Software SRAM Tracked-On: #5649 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com>	2021-01-28 11:29:25 +08:00
Yonghua Huang	806f479108	hv: rename RTCM source files 'ptcm' and 'ptct' are legacy name according to the latest TCC spec, hence rename below files to avoid confusing: ptcm.c -> rtcm.c ptcm.h -> rtcm.h ptct.h -> rtct.h Tracked-On: #5649 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com>	2021-01-28 11:29:25 +08:00
Liang Yi	e8a76868c9	hv: modularization: remove global variable efiloader_sig. Simplify multiboot API by removing the global variable efiloader_sig. Replaced by constant at the use site. Tracked-On: #5661 Signed-off-by: Yi Liang <yi.liang@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-01-27 15:59:47 +08:00
Liang Yi	67926cee81	hv: modularization: remove include/boot.h. Remove include/boot.h since it contains only assembly variables that should only be accessed in arch/x86/init.c. Tracked-On: #5661 Signed-off-by: Yi Liang <yi.liang@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-01-27 15:59:47 +08:00
Liang Yi	1de396363f	hv: modularization: avoid dependency of multiboot on zeropage.h. Split off definition of "struct efi_info" into a separate header file lib/efi.h. Tracked-On: #5661 Signed-off-by: Jason Chen CJ <jason.cj.chen@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-01-27 15:59:47 +08:00
Liang Yi	681688fbe4	hv: modularization: change of multiboot API. The init_multiboot_info() and sanitize_multiboot_ifno() APIs now require parameters instead of implicitly relying on global boot variables. Tracked-On: #5661 Signed-off-by: Vijay Dhanraj <vijay.dhanraj@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-01-27 15:59:47 +08:00
Liang Yi	66599e0aa7	hv: modularization: multiboot Calling sanitize_multiboot() from init.c instead of cpu.c. Tracked-On: #5661 Signed-off-by: Vijay Dhanraj <vijay.dhanraj@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-01-27 15:59:47 +08:00
Liang Yi	c23e557a18	hv: modularization: make parse_hv_cmdline() an internal function. This way, we void exposing acrn_mbi as a global variable. Tracked-On: #5661 Signed-off-by: Vijay Dhanraj <vijay.dhanraj@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-01-27 15:59:47 +08:00
Liang Yi	8f9ec59a53	hv: modularization: cleanup boot.h Move multiboot specific declarations from boot.h to multiboot.h. Tracked-On: #5661 Signed-off-by: Vijay Dhanraj <vijay.dhanraj@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-01-27 15:59:47 +08:00
Jie Deng	5c5d272358	hv: remove bitmap_clear_lock of split-lock after completing emulation When "signal_event" is called, "wait_event" will actually not block. So it is ok to remove this line. Tracked-On: #5605 Signed-off-by: Jie Deng <jie.deng@intel.com>	2021-01-13 15:32:27 +08:00
Yin Fengwei	ef411d4ac3	hv: ptirq: Shouldn't change sid if intx irq mapping was added Now, we use hash table to maintain intx irq mapping by using the key generated from sid. So once the entry is added,we can not update source ide any more. Otherwise, we can't locate the entry with the key generated from new source ide. For source id change, remove_remapping/add_remapping is used instead of update source id directly if entry was added already. Tracked-On: #5640 Signed-off-by: Yin Fengwei <fengwei.yin@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-01-12 15:23:44 +08:00
Jie Deng	8aebf5526f	hv: move split-lock logic into dedicated file This patch move the split-lock logic into dedicated file to reduce LOC. This may make the logic more clear. Tracked-On: #5605 Signed-off-by: Jie Deng <jie.deng@intel.com>	2021-01-08 17:37:20 +08:00
Jie Deng	27d5711b62	hv: add a cache register for VMX_PROC_VM_EXEC_CONTROLS This patch adds a cache register for VMX_PROC_VM_EXEC_CONTROLS to avoid the frequent VMCS access. Tracked-On: #5605 Signed-off-by: Jie Deng <jie.deng@intel.com>	2021-01-08 17:37:20 +08:00
Jie Deng	f291997811	hv: split-lock: using MTF instead of TF(#DB) The TF is visible to guest which may be modified by the guest, so it is not a safe method to emulate the split-lock. While MTF is specifically designed for single-stepping in x86/Intel hardware virtualization VT-x technology which is invisible to the guest. Use MTF to single step the VCPU during the emulation of split lock. Tracked-On: #5605 Signed-off-by: Jie Deng <jie.deng@intel.com>	2021-01-08 17:37:20 +08:00
Jie Deng	6852438e3a	hv: Support concurrent split-lock emulation on SMP. For a SMP guest, split-lock check may happen on multiple vCPUs simultaneously. In this case, one vCPU at most can be allowed running in the split-lock emulation window. And if the vCPU is doing the emulation, it should never be blocked in the hypervisor, it should go back to the guest to execute the lock instruction immediately and trap back to the hypervisor with #DB to complete the split-lock emulation. Tracked-On: #5605 Signed-off-by: Jie Deng <jie.deng@intel.com>	2021-01-08 17:37:20 +08:00
Li Fei1	0b18389d95	hv: vcpuid: expose mce feature to guest Windows64 seems only support processor which has MCE (Machine Check Error) feature. Tracked-On: #5638 Signed-off-by: Li Fei1 <fei1.li@intel.com>	2021-01-08 17:22:34 +08:00
Jie Deng	b14c32a110	hv: Retain RIP only for fault exception. We have trapped the #DB for split-lock emulation. Only fault exception need RIP being retained. Tracked-On: #5605 Signed-off-by: Jie Deng <jie.deng@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2020-12-31 11:12:33 +08:00
Jie Deng	977e862192	hv: Add split-lock emulation for xchg xchg may also cause the #AC for split-lock check. This patch adds this emulation. 1. Kick other vcpus of the guest to stop execution if the guest has more than one vcpu. 2. Emulate the xchg instruction. 3. Notify other vcpus (if any) to restart execution. Tracked-On: #5605 Signed-off-by: Jie Deng <jie.deng@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2020-12-31 11:12:33 +08:00
Jie Deng	47e193a7bb	hv: Add split-lock emulation for LOCK prefix instruction This patch adds the split-lock emulation. If a #AC is caused by instruction with LOCK prefix then emulate it, otherwise, inject it back as it used to be. 1. Kick other vcpus of the guest to stop execution and set the TF flag to have #DB if the guest has more than one vcpu. 2. Skip over the LOCK prefix and resume the current vcpu back to guest for execution. 3. Notify other vcpus to restart exception at the end of handling the #DB since we have completed the LOCK prefix instruction emulation. Tracked-On: #5605 Signed-off-by: Jie Deng <jie.deng@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2020-12-31 11:12:33 +08:00
Yonghua Huang	643bbcfe34	hv: check the availability of guest CR4 features Check hardware support for all features in CR4, and hide bits from guest by vcpuid if they're not supported for guests OS. Tracked-On: #5586 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2020-12-18 11:21:22 +08:00
Yonghua Huang	442fc30117	hv: refine virtualization flow for cr0 and cr4 - The current code to virtualize CR0/CR4 is not well designed, and hard to read. This patch reshuffle the logic to make it clear and classify those bits into PASSTHRU, TRAP_AND_PASSTHRU, TRAP_AND_EMULATE & reserved bits. Tracked-On: #5586 Signed-off-by: Eddie Dong <eddie.dong@intel.com> Signed-off-by: Yonghua Huang <yonghua.huang@intel.com>	2020-12-18 11:21:22 +08:00
Yonghua Huang	08c42f91c9	hv: rename hypercall for hv-emulated device management Coding style cleanup, use add/remove instead of create/destroy. Tracked-On: #5586 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com>	2020-12-07 16:25:17 +08:00
Shiqing Gao	6f10bd00bf	hv: coding style clean-up related to Boolean While following two styles are both correct, the 2nd one is simpler. bool is_level_triggered; 1. if (is_level_triggered == true) {...} 2. if (is_level_triggered) {...} This patch cleans up the style in hypervisor. Tracked-On: #861 Signed-off-by: Shiqing Gao <shiqing.gao@intel.com>	2020-11-28 14:51:32 +08:00
Junming Liu	1cd932e568	hv: refine code style refine code style Tracked-On: #4020 Signed-off-by: Junming Liu <junming.liu@intel.com>	2020-11-26 12:56:28 +08:00
Junming Liu	56eb859ea4	hv: vmexit: refine xsetbv_vmexit_handler API From SDM Vol.2C - XSETBV instruction description, If CR4.OSXSAVE[bit 18] = 0, execute "XSETBV" instruction will generate #UD exception. From SDM Vol.3C 25.1.1,#UD exception has priority over VM exits, So if vCPU execute "XSETBV" instruction when CR4.OSXSAVE[bit 18] = 0, VM exits won't happen. While hv inject #GP if vCPU execute "XSETBV" instruction when CR4.OSXSAVE[bit 18] = 0. It's a wrong behavior, this patch will fix the bug. Tracked-On: #4020 Signed-off-by: Junming Liu <junming.liu@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2020-11-26 12:56:28 +08:00
Peter Fang	68dc8d9f8f	hv: pm: avoid duplicate shutdowns on RTVM It is possible for more than one vCPUs to trigger shutdown on an RTVM. We need to avoid entering VM_READY_TO_POWEROFF state again after the RTVM has been paused or shut down. Also, make sure an RTVM enters VM_READY_TO_POWEROFF state before it can be paused. v1 -> v2: - rename to poweroff_if_rt_vm for better clarity Tracked-On: #5411 Signed-off-by: Peter Fang <peter.fang@intel.com>	2020-11-11 14:05:39 +08:00
dongshen	ca5683f78d	hv: add support for shutdown for pre-launched VMs Currently, ACRN only support shutdown when triple fault happens, because ACRN doesn't present/emulate a virtual HW, i.e. port IO, to support shutdown. This patch emulate a virtual shutdown component, and the vACPI method for guest OS to use. Pre-launched VM uses ACPI reduced HW mode, intercept the virtual sleep control/status registers for pre-launched VMs shutdown Tracked-On: #5411 Signed-off-by: dongshen <dongsheng.x.zhang@intel.com>	2020-11-04 10:33:31 +08:00
dongshen	8f79ceefbd	hv: fix out-of-date comments related to pre-launched VMs rebooting Like post-launched VMs, for pre-launched VMs, the ACPI reset register is also fixed at 0xcf9 and the reset value is 0xE, so pre-launched VMs now also use ACPI reset register for rebooting. Tracked-On: #5411 Signed-off-by: dongshen <dongsheng.x.zhang@intel.com>	2020-11-04 10:33:31 +08:00
Peter Fang	70b1218952	hv: pm: support shutting down multiple VMs when pCPUs are shared More than one VM may request shutdown on the same pCPU before shutdown_vm_from_idle() is called in the idle thread when pCPUs are shared among VMs. Use a per-pCPU bitmap to store all the VMIDs requesting shutdown. v1 -> v2: - use vm_lock to avoid a race on shutdown Tracked-On: #5411 Signed-off-by: Peter Fang <peter.fang@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2020-11-04 10:33:31 +08:00
Li Fei1	c6f9404f55	hv: psram: add kconfig to enable psram Add two Kconfig pSRAM config: one for whether to enable the pSRAM on the platfrom or not; another for if the pSRAM is enabled on the platform whether to enable the pSRAM in the pre-launched RTVM. If we enable the pSRAM on the platform, we should remove the pSRAM EPT mapping from the SOS to prevent it could flush the pSRAM cache. Tracked-On: #5330 Signed-off-by: Qian Wang <qian1.wang@intel.com>	2020-11-02 15:56:30 +08:00

1 2 3 4 5 ...

2118 Commits