acrn-hypervisor

Commit Graph

Author	SHA1	Message	Date
Jie Deng	e97b171ca2	hv: tee: x86_tee interrupt support Secure interrupt (interrupt belongs to TEE) comes when TEE vcpu is running, the interrupt will be injected to TEE directly. But when REE vcpu is running at that time, we need to switch to TEE for handling. Non-Secure interrupt (interrupt belongs to REE) comes when REE vcpu is running, the interrupt will be injected to REE directly. But when TEE vcpu is running at that time, we need to inject a predefined vector to TEE for notification and continue to switch back to TEE for running. To sum up, when secure interrupt comes, switch to TEE immediately regardless of whether REE is running or not; when non-Secure interrupt comes and TEE is running, just notify the TEE and keep it running, TEE will switch to REE on its own initiative after completing its work. Tracked-On: projectacrn#6571 Signed-off-by: Jie Deng <jie.deng@intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-11-17 15:25:14 +08:00
Jie Deng	314d9ca8af	hv: tee: implement the x86_tee hypercalls This patch implements the following x86_tee hypercalls, - HC_TEE_VCPU_BOOT_DONE - HC_SWITCH_EE Tracked-On: #6571 Signed-off-by: Jie Deng <jie.deng@intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-11-17 15:25:14 +08:00
Jie Deng	3c9c41b656	hv: tee: add x86_tee hypercall interfaces This patch adds the x86_tee hypercall interfaces. - HC_TEE_VCPU_BOOT_DONE This hypercall is used to notify the hypervisor that the TEE VCPU Boot is done, so that we can sleep the corresponding TEE VCPU. REE will be started at the last time this hypercall is called by TEE. - HC_SWITCH_EE For REE VM, it uses this hypercall to request TEE service. For TEE VM, it uses this hypercall to switch back to REE when it completes the REE service. Tracked-On: #6571 Signed-off-by: Jie Deng <jie.deng@intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-11-17 15:25:14 +08:00
Jie Deng	f3792a74a3	hv: tee: add TEE VM memmap support TEE is a secure VM which has its own partitioned resources while REE is a normal VM which owns the rest of platform resources. The TEE, as a secure world, it can see the memory of the REE VM, also known as normal world, but not the other way around. But please note, TEE and REE can only see their own devices. So this patch does the following things: 1. go through physical e820 table, to ept add all system memory entries. 2. remove hv owned memory. Tracked-On: #6571 Signed-off-by: Jie Deng <jie.deng@intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-11-17 15:25:14 +08:00
Jie Deng	0b1418d395	hv: tee: add an API for creating identical memmap according to e820 Given an e820, this API creates an identical memmap for specified e820 memory type, EPT memory cache type and access right. Tracked-On: #6571 Signed-off-by: Jie Deng <jie.deng@intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-11-17 15:25:14 +08:00
Mingqiang Chi	bb0327e700	hv: remove UUID With current arch design the UUID is used to identify ACRN VMs, all VM configurations must be deployed with given UUIDs at build time. For post-launched VMs, end user must use UUID as acrn-dm parameter to launch specified user VM. This is not friendly for end users that they have to look up the pre-configured UUID before launching VM, and then can only launch the VM which its UUID in the pre-configured UUID list,otherwise the launch will fail.Another side, VM name is much straight forward for end user to identify VMs, whereas the VM name defined in launch script has not been passed to hypervisor VM configuration so it is not consistent with the VM name when user list VM in hypervisor shell, this would confuse user a lot. This patch will resolve these issues by removing UUID as VM identifier and use VM name instead: 1. Hypervisor will check the VM name duplication during VM creation time to make sure the VM name is unique. 2. If the VM name passed from acrn-dm matches one of pre-configured VM configurations, the corresponding VM will be launched, we call it static configured VM. If there is no matching found, hypervisor will try to allocate one unused VM configuration slot for this VM with given VM name and get it run if VM number does not reach CONFIG_MAX_VM_NUM, we will call it dynamic configured VM. 3. For dynamic configured VMs, we need a guest flag to identify them because the VM configuration need to be destroyed when it is shutdown or creation failed. v7->v8: -- rename is_static_vm_configured to is_static_configured_vm -- only set DM owned guest_flags in hcall_create_vm -- add check dynamic flag in get_unused_vmid v6->v7: -- refine get_vmid_by_name, return the first matching vm_id -- the GUEST_FLAG_STATIC_VM is added to identify the static or dynamic VM, the offline tool will set this flag for all the pre-defined VMs. -- only clear name field for dynamic VM instead of clear entire vm_config Tracked-On: #6685 Signed-off-by: Mingqiang Chi <mingqiang.chi@intel.com> Reviewed-by: Zhao Yakui <yakui.zhao@intel.com> Reviewed-by: Victor Sun<victor.sun@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-11-16 14:42:59 +08:00
Chenli Wei	022df1fb2e	hv: align the MAX_IR_ENTRIES to MAX_PT_IRQ_ENTRIES The CONFIG_MAX_IR_ENTRIES and CONFIG_MAX_PT_IRQ_ENTRIES are separate configuration items, and they can be configured through configuration tool When the number of PT irq entries are more than IR entries, then some passthrough devices' irqs may failed to be protected by interrupt remapping or automatically injected by post-interrupt mechanism. And it waste memory if the CONFIG_MAX_IR_ENTRIES is larger. This patch replace the CONFIG_MAX_IR_ENTRIES to MAX_IR_ENTRIES and enforce it align to CONFIG_PT_IRQ_ENTRIES and round up to > 2^n as the IRTA_REG spec.This way can enforce all PT irqs works with IR or PI mechanism. Tracked-On: #6745 Signed-off-by: Chenli Wei <chenli.wei@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com>	2021-11-15 09:00:27 +08:00
Yuanyuan Zhao	c111dd2e2c	hv : encapsulate page align in e820_alloc_memory e820_alloc_memory requires 4k alignment, so conversion to size is encapsulated in the function. And then the pre-condition of `size_arg` is removed. Tracked-On: #6805 Signed-off-by: Yuanyuan Zhao <yuanyuan.zhao@linux.intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-11-12 11:56:03 +08:00
Yuanyuan Zhao	4f6aa38ea5	hv: remove CONFIG_LOW_RAM_SIZE The CONFIG_LOW_RAM_SIZE is used to describe the size of trampoline code that is never changed. And it totally confused user to configure it. This patch hard code it to 1MB and remove the macro for configuration. In the trampoline related code, use ld_trampoline_end and ld_trampoline_start symbol to calculate the real size. Tracked-On: #6805 Signed-off-by: Yuanyuan Zhao <yuanyuan.zhao@linux.intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-11-12 11:56:03 +08:00
Shiqing Gao	7bbd17ce80	hv: initialize and save/restore IA32_TSC_AUX MSR for guest Commit `cbf3825` "hv: Pass-through IA32_TSC_AUX MSR to L1 guest" lets guest own the physical MSR IA32_TSC_AUX and does not handle this MSR in the hypervisor. If multiple vCPUs share the same pCPU, when one vCPU reads MSR IA32_TSC_AUX, it may get the value set by other vCPUs. To fix this issue, this patch does: - initialize the MSR content to 0 for the given vCPU, which is consistent with the value specified in SDM Vol3 "Table 9-1. IA-32 and Intel 64 Processor States Following Power-up, Reset, or INIT" - save/restore the MSR content for the given vCPU during context switch v1 -> v2: * According to Table 9-1, the content of IA32_TSC_AUX MSR is unchanged following INIT, v2 updates the initialization logic so that the content for vCPU is consistent with SDM. Tracked-On: #6799 Signed-off-by: Shiqing Gao <shiqing.gao@intel.com> Reviewed-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-11-12 09:30:12 +08:00
Zhou, Wu	1bc25ed198	HV: refine the ve820 tab for pre-VMs This patch moves the ssram area in ve820 tab, and reunites the hpa1_low_part1/2 areas. The ve820 building code is refined. before: \|<---low_1M--->\| \|<---hpa1_low_part1--->\| \|<---SSRAM--->\| \|<---hpa1_low_part2--->\| \|<---GPU_OpRegion--->\| \|<---ACPI DATA--->\| \|<---ACPI NVS--->\| ---2G--- after: \|<---low_1M--->\| \|<---hpa_low--->\| \|<---SSRAM--->\| \|<---GPU_OpRegion--->\| \|<---ACPI DATA--->\| \|<---ACPI NVS--->\| ---2G--- The SSRAM area's address is described in the ACPI's RTCT/PTCT table. To simplify the SSRAM implementation, SSRAM area was identical mapped to GPA, and resulted in the divition of hpa_low. Then the ve820 building logic became too complicated. Now we managed to edit the guest's RTCT/PTCT table by offline tools in the former patch, so we can move the guest's SSRAM area, and reunite the hpa_low areas again. After doing this, this patch rewrites the ve820 building code in a much simpler way. Tracked-On: #6674 Signed-off-by: Zhou, Wu <wu.zhou@intel.com> Reviewed-by: Eddie Dong <eddie.dong@intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com>	2021-11-08 13:13:14 +08:00
Zhou, Wu	f1f6fe11c1	HV: move the ve820 GPU OpRegion address The ve820 table' hpa1_low area is divided into two parts, which is making the code too complicated and causing problems. Moving the entries that divides the hpa1_low could make things easier. This patch moves the GPU OpRegion to the tail area of 2G, consecutive to the acpi data/nvs area. before: \|<---low_1M--->\| \|<---hpa1_low_part1--->\| \|<---SSRAM--->\| \|<---GPU_OpRegion--->\| \|<---hpa1_low_part2--->\| \|<---ACPI DATA--->\| \|<---ACPI NVS--->\| ---2G--- after: \|<---low_1M--->\| \|<---hpa1_low_part1--->\| \|<---SSRAM--->\| \|<---hpa1_low_part2--->\| \|<---GPU_OpRegion--->\| \|<---ACPI DATA--->\| \|<---ACPI NVS--->\| ---2G--- Tracked-On: #6674 Signed-off-by: Zhou, Wu <wu.zhou@intel.com> Reviewed-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com>	2021-11-08 13:13:14 +08:00
Zhou, Wu	e00421d5be	HV: Fix the problems in ve820 acpi area The length of the ACPI data entry in ve820 tab was 960K, while the ACPI file is 1MB. It causes the ACPI file copy failed due to reserved ACPI regions in ve820 table is not enough when loading pre-launched VMs. This patch changes ACPI data area to 1MB to fix the problem. And the ACPI data length was missed when calculating ENTRY_HPA1_LOW_PART2 length. Fixed here too. Also adds some refinement to the hard-coded ACPI base/addr definations Tracked-On: #6674 Signed-off-by: Zhou, Wu <wu.zhou@intel.com> Reviewed-by: Eddie Dong <eddie.dong@intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com>	2021-11-08 13:13:14 +08:00
Junjie Mao	83a938bae6	HV: treewide: fix violations of coding guideline C-TY-27 & C-TY-28 The coding guideline rules C-TY-27 and C-TY-28, combined, requires that assignment and arithmetic operations shall be applied only on operands of the same kind. This patch either adds explicit type casts or adjust types of variables to align the types of operands. The only semantic change introduced by this patch is the promotion of the second argument of set_vmcs_bit() and clear_vmcs_bit() to uint64_t (formerly uint32_t). This avoids clear_vmcs_bit() to accidentally clears the upper 32 bits of the requested VMCS field. Other than that, this patch has no semantic change. Specifically this patch is not meant to fix buggy narrowing operations, only to make these operations explicit. Tracked-On: #6776 Signed-off-by: Junjie Mao <junjie.mao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-11-04 18:15:47 +08:00
Junjie Mao	2c86795fa8	HV: arch: fix a violation of coding guideline C-TY-24 The coding guideline rule C-TY-24 requires that 'cast shall not be performed on a function pointer'. This patch removes a duplicated explicit cast on timer_expired_handler in tsc_deadline_timer.c. This patch has no semantic impacts. Tracked-On: #6776 Signed-off-by: Junjie Mao <junjie.mao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-11-04 18:15:47 +08:00
Junjie Mao	9781873e77	HV: treewide: fix violations of coding guideline C-TY-12 The coding guideline rule C-TY-12 requires that 'all type conversions shall be explicit'. Especially implicit cases on the signedness of variables shall be avoided. This patch either adds explicit type casts or adjust local variable types to make sure that Booleans, signed and unsigned integers are not used mixedly. This patch has no semantic changes. Tracked-On: #6776 Signed-off-by: Junjie Mao <junjie.mao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-11-04 18:15:47 +08:00
Junjie Mao	db20e277b6	HV: treewide: fix violations of coding guideline C-TY-02 The coding guideline rule C-TY-02 requires that 'the operands of bit operations shall be unsigned'. This patch adds explicit casts or literal suffixes to make explicit the type of values involved in bit operations. Explicit casts to widen integers before shift operations are also introduced to make explicit that the variables are expanded BEFORE it is shifted (which is already so in C99 but implicitly). This patch has no semantic changes. Tracked-On: #6776 Signed-off-by: Junjie Mao <junjie.mao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-11-04 18:15:47 +08:00
Junjie Mao	fba343bd05	HV: treewide: fix violations of coding guideline C-FN-06 The coding guideline rule C-FN-06 requires that 'a parameter passed by value to a function shall not be modified directly'. This patch rewrites two functions which does modify its parameters today. This patch has no semantic impact. Tracked-On: #6776 Signed-off-by: Junjie Mao <junjie.mao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-11-04 18:15:47 +08:00
Junjie Mao	3f3f4be642	HV: treewide: fix violations of coding guideline C-EP-05 The coding guideline rule C-EP-05 requires that 'parentheses shall be used to set the operator precedence explicitly'. This patch adds the missing parentheses detected by the static analyzer. Tracked-On: #6776 Signed-off-by: Junjie Mao <junjie.mao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-11-04 18:15:47 +08:00
Junjie Mao	4cf6c288cd	HV: treewide: fix warnings raised by Clang This patch fixes the following warnings detected by the LLVM/Clang compiler: 1. Unused static functions in C sources, which are fixed by explicitly tagging them with __unused 2. Duplicated parentheses around branch conditions 3. Assigning 64-bit constants to 32-bit variables, which is fixed by promoting the variables to uint64_t 4. Using { '\0' } to zero-fill an array, which is fixed by replacing it with { 0 } 5. Taking a bit out of a variable using && (which should be & instead) Most changes do not have a semantic impact, except item 5 which is probably a real code issue. Tracked-On: #6776 Signed-off-by: Junjie Mao <junjie.mao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-11-04 18:15:47 +08:00
Yifan Liu	10963b04d1	hv: Fix vcpu signaling racing problem in lock instruction emulation In lock instruction emulation, we use vcpu_make_request and signal_event pairs to shoot down/release other vcpus. However, vcpu_make_request is async and does not guarantee an execution of wait_event on target vcpu, and we want wait_event to be consistent with signal_event. Consider following scenarios: 1, When target vcpu's state has not yet turned to VCPU_RUNNING, vcpu_make_request on ACRN_REQUEST_SPLIT_LOCK does not make sense, and will not result in wait_event. 2, When target vcpu is already requested on ACRN_REQUEST_SPLIT_LOCK (i.e., the corresponding bit in pending_req is set) but not yet handled, the vcpu_make_request call does not result in wait_event as 1 bit is not enough to cache multiple requests. This patch tries to add checks in vcpu_kick_lock_instr_emulation and vcpu_complete_lock_instr_emulation to resolve these issues. Tracked-On: #6502 Signed-off-by: Yifan Liu <yifan1.liu@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-11-02 15:01:20 +08:00
Liu Long	3f4ea38158	ACRN: misc: Unify terminology for service vm/user vm Rename SOS_VM type to SERVICE_VM rename UOS to User VM in XML description rename uos_thread_pid to user_vm_thread_pid rename devname_uos to devname_user_vm rename uosid to user_vmid rename UOS_ACK to USER_VM_ACK rename SOS_VM_CONFIG_CPU_AFFINITY to SERVICE_VM_CONFIG_CPU_AFFINITY rename SOS_COM to SERVICE_VM_COM rename SOS_UART1_VALID_NUM" to SERVICE_VM_UART1_VALID_NUM rename SOS_BOOTARGS_DIFF to SERVICE_VM_BOOTARGS_DIFF rename uos to user_vm in launch script and xml Tracked-On: #6744 Signed-off-by: Liu Long <long.liu@linux.intel.com> Reviewed-by: Geoffroy Van Cutsem <geoffroy.vancutsem@intel.com>	2021-11-02 10:00:55 +08:00
Liu Long	14c6e21efa	ACRN: misc: Unify terminology for sos/uos rin macro Rename SOS_VM_NUM to SERVICE_VM_NUM. rename SOS_SOCKET_PORT to SERVICE_VM_SOCKET_PORT. rename PROCESS_RUN_IN_SOS to PROCESS_RUN_IN_SERVICE_VM. rename PCI_DEV_TYPE_SOSEMUL to PCI_DEV_TYPE_SERVICE_VM_EMUL. rename SHUTDOWN_REQ_FROM_SOS to SHUTDOWN_REQ_FROM_SERVICE_VM. rename PROCESS_RUN_IN_SOS to PROCESS_RUN_IN_SERVICE_VM. rename SHUTDOWN_REQ_FROM_UOS to SHUTDOWN_REQ_FROM_USER_VM. rename UOS_SOCKET_PORT to USER_VM_SOCKET_PORT. rename SOS_CONSOLE to SERVICE_VM_OS_CONSOLE. rename SOS_LCS_SOCK to SERVICE_VM_LCS_SOCK. rename SOS_VM_BOOTARGS to SERVICE_VM_OS_BOOTARGS. rename SOS_ROOTFS to SERVICE_VM_ROOTFS. rename SOS_IDLE to SERVICE_VM_IDLE. rename SEVERITY_SOS to SEVERITY_SERVICE_VM. rename SOS_VM_UUID to SERVICE_VM_UUID. rename SOS_REQ to SERVICE_VM_REQ. rename RTCT_NATIVE_FILE_PATH_IN_SOS to RTCT_NATIVE_FILE_PATH_IN_SERVICE_VM. rename CBC_REQ_T_UOS_ACTIVE to CBC_REQ_T_USER_VM_ACTIVE. rename CBC_REQ_T_UOS_INACTIVE to CBC_REQ_T_USER_VM_INACTIV. rename uos_active to user_vm_active. Tracked-On: #6744 Signed-off-by: Liu Long <long.liu@linux.intel.com> Reviewed-by: Geoffroy Van Cutsem <geoffroy.vancutsem@intel.com>	2021-11-02 10:00:55 +08:00
Liu Long	e9c4ced460	ACRN: hv: Unify terminology for user vm Rename gpa_uos to gpa_user_vm rename base_gpa_in_uos to base_gpa_in_user_vm rename UOS_VIRT_PCI_MMCFG_BASE to USER_VM_VIRT_PCI_MMCFG_BASE rename UOS_VIRT_PCI_MMCFG_START_BUS to USER_VM_VIRT_PCI_MMCFG_START_BUS rename UOS_VIRT_PCI_MMCFG_END_BUS to USER_VM_VIRT_PCI_MMCFG_END_BUS rename UOS_VIRT_PCI_MEMBASE32 to USER_VM_VIRT_PCI_MEMBASE32 rename UOS_VIRT_PCI_MEMLIMIT32 to USER_VM_VIRT_PCI_MEMLIMIT32 rename UOS_VIRT_PCI_MEMBASE64 to USER_VM_VIRT_PCI_MEMBASE64 rename UOS_VIRT_PCI_MEMLIMIT64 to USER_VM_VIRT_PCI_MEMLIMIT64 rename UOS in comments message to User VM. Tracked-On: #6744 Signed-off-by: Liu Long <long.liu@linux.intel.com> Reviewed-by: Geoffroy Van Cutsem <geoffroy.vancutsem@intel.com>	2021-11-02 10:00:55 +08:00
Liu Long	92b7d6a9a3	ACRN: hv: Terminology modification in hv code Rename sos_vm to service_vm. rename sos_vmid to service_vmid. rename sos_vm_ptr to service_vm_ptr. rename get_sos_vm to get_service_vm. rename sos_vm_gpa to service_vm_gpa. rename sos_vm_e820 to service_vm_e820. rename sos_efi_info to service_vm_efi_info. rename sos_vm_config to service_vm_config. rename sos_vm_hpa2gpa to service_vm_hpa2gpa. rename vdev_in_sos to vdev_in_service_vm. rename create_sos_vm_e820 to create_service_vm_e820. rename sos_high64_max_ram to service_vm_high64_max_ram. rename prepare_sos_vm_memmap to prepare_service_vm_memmap. rename post_uos_sworld_memory to post_user_vm_sworld_memory rename hcall_sos_offline_cpu to hcall_service_vm_offline_cpu. rename filter_mem_from_sos_e820 to filter_mem_from_service_vm_e820. rename create_sos_vm_efi_mmap_desc to create_service_vm_efi_mmap_desc. rename HC_SOS_OFFLINE_CPU to HC_SERVICE_VM_OFFLINE_CPU. rename SOS to Service VM in comments message. Tracked-On: #6744 Signed-off-by: Liu Long <long.liu@linux.intel.com> Reviewed-by: Geoffroy Van Cutsem <geoffroy.vancutsem@intel.com>	2021-11-02 10:00:55 +08:00
Liu Long	26e507a06e	ACRN: hv: Unify terminology for service vm Rename is_sos_vm to is_service_vm Tracked-On: #6744 Signed-off-by: Liu Long <longliu@intel.com>	2021-11-02 10:00:55 +08:00
dongshen	dcafcadaf9	hv: rename some C preprocessor macros Rename some C preprocessor macros: NUM_GUEST_MSRS --> NUM_EMULATED_MSRS CAT_MSR_START_INDEX --> FLEXIBLE_MSR_INDEX NUM_VCAT_MSRS --> NUM_CAT_MSRS NUM_VCAT_L2_MSRS --> NUM_CAT_L2_MSRS NUM_VCAT_L3_MSRS --> NUM_CAT_L3_MSRS Tracked-On: #5917 Signed-off-by: Eddie Dong <eddie.dong@Intel.com>	2021-10-28 19:12:29 +08:00
dongshen	c0d95558c1	hv: vCAT: propagate vCBM to other vCPUs that share cache with vcpu Implement the propagate_vcbm() function: Set vCBM to to all the vCPUs that share cache with vcpu to mimic hardware CAT behavior Tracked-On: #5917 Signed-off-by: dongshen <dongsheng.x.zhang@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-10-28 19:12:29 +08:00
dongshen	a7014f4654	hv: vCAT: implementing the vCAT MSRs write handler Implement the write_vcbm() function to handle the MSR_IA32_type_MASK_n vCBM MSRs write request Call write_vclosid() to handle MSR_IA32_PQR_ASSOC MSR write request Several vCAT P2V (physical to virtual) and V2P (virtual to physical) mappings exist: struct acrn_vm_config *vm_config = get_vm_config(vm_id) max_pcbm = vm_config->max_type_pcbm (type: l2 or l3) mask_shift = ffs64(max_pcbm) vclosid = vmsr - MSR_IA32_type_MASK_0 pclosid = vm_config->pclosids[vclosid] pmsr = MSR_IA32_type_MASK_0 + pclosid pcbm = vcbm << mask_shift vcbm = pcbm >> mask_shift Where MSR_IA32_type_MASK_n: L2 or L3 mask msr address for CLOSIDn, from 0C90H through 0D8FH (inclusive). max_pcbm: a bitmask that selects all the physical cache ways assigned to the VM vclosid: virtual CLOSID, always starts from 0 pclosid: corresponding physical CLOSID for a given vclosid vmsr: virtual msr address, passed to vCAT handlers by the caller functions rdmsr_vmexit_handler()/wrmsr_vmexit_handler() pmsr: physical msr address vcbm: virtual CBM, passed to vCAT handlers by the caller functions rdmsr_vmexit_handler()/wrmsr_vmexit_handler() pcbm: physical CBM Tracked-On: #5917 Signed-off-by: dongshen <dongsheng.x.zhang@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-10-28 19:12:29 +08:00
dongshen	3ab50f2ef5	hv: vCAT: implementing the vCAT MSRs read handlers Implement the read_vcbm() and read_vclosid() functions to handle the MSR_IA32_PQR_ASSOC and MSR_IA32_type_MASK_n vCAT MSRs read request. Tracked-On: #5917 Signed-off-by: dongshen <dongsheng.x.zhang@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-10-28 19:12:29 +08:00
dongshen	be855d2352	hv: vCAT: expose CAT capabilities to vCAT-enabled VM Expose CAT feature to vCAT VM by reporting the number of cache ways/CLOSIDs via the 04H/10H cpuid instructions, so that the VM can take advantage of CAT to prioritize and partition cache resource for its own tasks. Add the vcat_pcbm_to_vcbm() function to map pcbm to vcbm Tracked-On: #5917 Signed-off-by: dongshen <dongsheng.x.zhang@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-10-28 19:12:29 +08:00
dongshen	77ae989379	hv: vCAT: initialize vCAT MSRs during vmcs init Initialize vCBM MSRs Initialize vCLOSID MSR Add some vCAT functions: Retrieve max_vcbm and max_pcbm Check if vCAT is configured or not for the VM Map vclosid to pclosid write_vclosid: vCLOSID MSR write handler write_vcbm: vCBM MSR write handler Tracked-On: #5917 Signed-off-by: dongshen <dongsheng.x.zhang@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-10-28 19:12:29 +08:00
Fei Li	1905ed6124	hv: vMSR: minor fix about rdmsr_vmexit_handler Specifying a reserved or unimplemented MSR address in ECX for rdmsr will cause a general protection exception. In this case, we should not change the contents of registers EDX:EAX. Tracked-On: #4550 Signed-off-by: Fei Li <fei1.li@intel.com>	2021-10-27 08:23:43 +08:00
dongshen	39461ef9dd	hv: vCAT: initialize the emulated_guest_msrs array for CAT msrs during platform initialization Initialize the emulated_guest_msrs[] array at runtime for MSR_IA32_type_MASK_n and MSR_IA32_PQR_ASSOC msrs, there is no good way to do this initialization statically at build time Tracked-On: #5917 Signed-off-by: dongshen <dongsheng.x.zhang@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-10-26 11:48:27 +08:00
dongshen	cb2bb78b6f	hv/config_tools: amend the struct acrn_vm_config to make it compatible with vCAT For vCAT, it may need to store more than MAX_VCPUS_PER_VM of closids, change clos in vm_config.h to a pointer to accommodate this situation Rename clos to pclosids pclosids now is a pointer to an array of physical CLOSIDs that is defined in vm_configurations.c by vmconfig. The number of elements in the array must be equal to the value given by num_pclosids Add max_type_pcbm (type: l2 or l3) to struct acrn_vm_config, which stores a bitmask that selects/covers all the physical cache ways assigned to the VM Change vmsr.c to accommodate this amended data structure Change the config-tools to generate vm_configurations.c, and fill in the num_closids and clos pointers based on the information from the scenario file. Now vm_configurations.c.xsl generates all the clos related code so remove the same code from misc_cfg.h.xsl. Examples: Scenario file: <RDT> <RDT_ENABLED>y</RDT_ENABLED> <CDP_ENABLED>n</CDP_ENABLED> <VCAT_ENABLED>y</VCAT_ENABLED> <CLOS_MASK>0x7ff</CLOS_MASK> <CLOS_MASK>0x7ff</CLOS_MASK> <CLOS_MASK>0x7ff</CLOS_MASK> <CLOS_MASK>0xff800</CLOS_MASK> <CLOS_MASK>0xff800</CLOS_MASK> <CLOS_MASK>0xff800</CLOS_MASK> <CLOS_MASK>0xff800</CLOS_MASK> <CLOS_MASK>0xff800</CLOS_MASK> /RDT> <vm id="0"> <guest_flags> <guest_flag>GUEST_FLAG_VCAT_ENABLED</guest_flag> </guest_flags> <clos> <vcpu_clos>3</vcpu_clos> <vcpu_clos>4</vcpu_clos> <vcpu_clos>5</vcpu_clos> <vcpu_clos>6</vcpu_clos> <vcpu_clos>7</vcpu_clos> </clos> </vm> <vm id="1"> <clos> <vcpu_clos>1</vcpu_clos> <vcpu_clos>2</vcpu_clos> </clos> </vm> vm_configurations.c (generated by config-tools) with the above vCAT config: static uint16_t vm0_vcpu_clos[5U] = {3U, 4U, 5U, 6U, 7U}; static uint16_t vm1_vcpu_clos[2U] = {1U, 2U}; struct acrn_vm_config vm_configs[CONFIG_MAX_VM_NUM] = { { .guest_flags = (GUEST_FLAG_VCAT_ENABLED), .pclosids = vm0_vcpu_clos, .num_pclosids = 5U, .max_l3_pcbm = 0xff800U, }, { .pclosids = vm1_vcpu_clos, .num_pclosids = 2U, }, }; Tracked-On: #5917 Signed-off-by: dongshen <dongsheng.x.zhang@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-10-26 11:48:27 +08:00
Yonghua Huang	c8e2060d37	hv: unmap IOMMU register pages from service VM EPT IOMMU hardware resource is owned by hypervisor, while IOMMU capability is reported to service VM in its ACPI table. In this case, Service VM may access IOMMU hardware resource, which is not expected. This patch unmaps all Intel IOMMU register pages for service VM EPT. Tracked-On: #6677 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com> Reviewed-by: Victor Sun <victor.sun@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-10-22 09:31:10 +08:00
Fei Li	6c5bf4a642	hv: enhance e820_alloc_memory could allocate memory than 4G Enhance e820_alloc_memory could allocate memory than 4G. Tracked-On: #5830 Signed-off-by: Fei Li <fei1.li@intel.com>	2021-10-14 15:04:36 +08:00
Fei Li	df7ffab441	hv: remove CONFIG_HV_RAM_SIZE It's difficult to configure CONFIG_HV_RAM_SIZE properly at once. This patch not only remove CONFIG_HV_RAM_SIZE, but also we use ld linker script to dynamically get the size of HV RAM size. Tracked-On: #6663 Signed-off-by: Fei Li <fei1.li@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-10-14 15:04:36 +08:00
Zide Chen	e48962faa6	hv: optimize run_vcpu() for nested This patch implements a separate path for L2 VMEntry in run_vcpu(), which has several benefits: - keep run_vcpu() clean, to reduce the number of is_vcpu_in_l2_guest() statements: - current code has three is_vcpu_in_l2_guest() already. - supposed to have another 2 statement so that nested VMEntry won't hit the "Starting vCPU" and "vCPU launched" pr_info() and a few other statements in the VM launch path. - save few other things in run_vcpu() that are not needed for nested. Tracked-On: #6289 Signed-off-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-10-13 15:55:31 +08:00
Zide Chen	89bbc44962	hv: inject external interrupts only if LAPIC is not passthru Tracked-On: #6289 Signed-off-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-10-08 09:18:34 +08:00
Zide Chen	228b052fdb	hv: operations on vcpu->reg_cached/reg_updated don't need LOCK prefix In run time, one vCPU won't read or write a register on other vCPUs, thus we don't need the LOCK prefixed instructions on reg_cached and reg_updated. Tracked-On: #6289 Signed-off-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-10-08 09:11:10 +08:00
Zide Chen	2b683f8f5b	hv: call vcpu_inject_exception() only when ACRN_REQUEST_EXCP is set move the bitmap test call out of vcpu_inject_exception(), then we call the expensive bitmap_test_and_clear_lock() only pending_req_bits is non-zero and call vcpu_inject_exception() only if needed. Tracked-On: #6289 Signed-off-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-10-07 20:48:43 +08:00
Zide Chen	f801ba4ed7	hv: update guest RIP only if vcpu->arch.inst_len is non zero In very large number of VM extis, the VM-exit instruction length could be zero, and it's no need to update VMX_GUEST_RIP. Some examples: - all external interrupt VM exits in non LAPIC passthru setup. - for all the nested VM-exits that are reflecting to L1 hypervisor. Tracked-On: #6289 Signed-off-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-10-07 20:47:07 +08:00
Zide Chen	b7e9a68923	hv: code cleanup in run_vcpu() - wrap a new function exec_vmentry() to reduce code duplication. - remove exec_vmread(VMX_GUEST_RSP) since ACRN doesn't need to know guest RSP in run time. Tracked-On: #6289 Signed-off-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-10-07 20:47:07 +08:00
Zide Chen	ee12daff84	hv: nested: refine vmcs12_read/write_field APIs Change "uint64_t vmcs_hva" to "void *vmcs_hva" in the input argument, list, so that no type casting is needed when calling them from pointers. Tracked-On: #6289 Signed-off-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-10-07 20:45:34 +08:00
Liu,Junming	4105ca2cb4	hv: deny the launch of VM if pass-thru PIO bar isn't identical mapping In current design, when pass-thru dev, for the PIO bar, need to ensure the guest PIO start address equals to host PIO start address. Then set the VMCS io bitmap to pass-thru the corresponding port io to guest for performance. ACRN-DM and acrn-config should ensure the identical mapping of PIO bar. If ACRN-DM or acrn-config failed to achieve this, we should deny the launch of VM Tracked-On: #6508 Signed-off-by: Liu,Junming <junming.liu@intel.com> Reviewed-by: Zhao Yakui <yakui.zhao@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2021-09-28 08:49:01 +08:00
Victor Sun	28824c1e74	HV: init e820 before init paging In the commit of `4e1deab3d9`, we changed the init sequence that init paging first and then init e820 because we worried about the efi memory map could be beyond 4GB space on some platform. After we double checked multiboot2 spec, when system boot from multiboot2 protocol, the efi memory map info will be embedded in multiboot info so it is guaranteed that the efi memory map must be under 4GB space. Consider that the page table will be allocated in free memory space in future, we have to change the init sequence back that init e820 first and then init paging. If we need to support other boot protocol in future that the efi memory map might be put beyond 4GB, we could have below options: 1. Request bootloader put efi memory map below 4GB; 2. Call EFI_BOOT_SERVICES.GetMemoryMap() before ExitBootServices(); 3. Enable a early 64bit page table to get the efi memory map only; Tracked-On: #5626 Signed-off-by: Victor Sun <victor.sun@intel.com>	2021-09-27 09:03:15 +08:00
Zide Chen	a62dd6ad8a	hv: nested: fixed vmxoff_vmexit_handler() issue In VMXOFF vmexit handler, it's supposed to remove VMCS shadowing. Tracked-On: #6289 Signed-off-by: Zide Chen <zide.chen@intel.com>	2021-09-26 08:49:35 +08:00
Zide Chen	45b036e028	hv: nested: enable multiple active VMCS12 support This patch changes the size of vvmcs[] array from 1 to PER_VCPU_ACTIVE_VVMCS_NUM, and actually enables multiple active VMCS12 support in ACRN. The basic operations: - if L1 VMPTRLDs a VMCS12 without previously VMCLEAR the current VMCS12, ACRN no longer unconditionally flushes the current VMCS12 back to L1. Instead, it tries to keep both the current and the newly loaded VMCS12 in the nested->vvmcs[] array, unless: - if there is no more available vvmcs[] entry, ACRN flushes one active VMCS12 to make room for this new VMCS12. Tracked-On: #6289 Signed-off-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-09-26 08:49:35 +08:00
Mingqiang Chi	f39c882359	hv:change log level for check_vmx_ctrl Some processors don't support VMX_PROCBASED_CTLS_TERTIARY bit and VMX_PROCBASED_CTLS2_UWAIT_PAUSE bit in MSRs (IA32_VMX_PROCBASED_CTLS & IA32_VMX_PROCBASED_CTLS2), HV will output error log which will cause confusion, change the log level from pr_err to pr_info. Tracked-On: #6397 Signed-off-by: Mingqiang Chi <mingqiang.chi@intel.com>	2021-09-24 10:17:19 +08:00

1 2 3 4 5 ...

2228 Commits