acrn-hypervisor

Commit Graph

Author	SHA1	Message	Date
Yuan Lu	dbc3ff39aa	hv: vm_reset: simulate RESET_CONTROL(0xCF9) register Add reset_control in acrn_vm. Use this reset_control to simulate RESET_CONTROL(0xCF9) register in hypervisor. Tracked-On: #8724 Signed-off-by: Yuan Lu <yuan.y.lu@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2024-09-12 14:09:17 +08:00
Jiaqing Zhao	eae668268e	hv: handle reboot from Service VM properly Service VM may write 0x6 to port 0xcf9 to trigger a warm reset, but current hypervisor always performs a cold reset by writing 0xE to CF9. Hypervisor should reboot the system in the same mode as Service VM specified. Specific OS features (like linux pstore) requires warm reset to keep data across reboot. The behavior of hv console's reboot command (cold reset) remains unchanged. Tracked-On: #8539 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-09-09 14:37:16 +08:00
Haiwei Li	17c4ce75a1	hv: cpuid: expose CPUID.EAX=07H subleaf to VMs Per SDM, VPDPBUSD/VPDPBUSDS/VPDPWSSD/VPDPWSSDS instructions depend on CPUID Feature Flag 'AVX-VNNI, AVX512_VNNI, AVX512VL'. 'AVX512_VNNI' and 'AVX512VL' are already exposed to any VM. 'AVX-VNNI' is in CPUID.(EAX=07H,ECX=1):EAX.AVX-VNNI[bit 4]. This patch is to expose all the CPUID.EAX=07H subleaf features to VMs. Mask corresponding bits if want to disable some features in the future. Tracked-On: #8710 Reviewed-by: Fei Li <fei1.li@intel.com> Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-09-09 14:03:51 +08:00
Haiwei Li	aba53e78ef	doc: add module design for peripheral vuart device GAI Tooling Notice: These contents may have been developed with support from one or more generative artificial intelligence solutions. This patch is to add doxygen style comments for some elements in vp-dm_vperipheral vuart module. Tracked-On: #8665 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-08-20 17:23:42 +08:00
Haiwei Li	172c56fe0a	doc: add module design for peripheral vrtc device GAI Tooling Notice: These contents may have been developed with support from one or more generative artificial intelligence solutions. This patch is to add doxygen style comments for some elements in vp-dm_vperipheral vrtc module. Tracked-On: #8665 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-08-20 13:36:13 +08:00
Jiaqing Zhao	5c351bee0f	hv: vtd: allocate drhd_dev_scope based on board file Determine the size of drhd_dev_scope based on DRHD_MAX_DEVSCOPE_COUNT in board file instead of hardcoding. The current default value 16 will be used if it is not defined in board file to keep compatibility, a warning will be raised in this case. Tracked-On: #8494 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-08-05 15:51:17 +08:00
Haiwei Li	fa2b8fcfbe	doc: add module design for some defines in hwmgmt_page GAI Tooling Notice: These contents may have been developed with support from one or more generative artificial intelligence solutions. ACRN hypervisor is decomposed into a series of components and modules. The module design in hypervisor is to add inline doxygen style comments above functions, macros, structures, etc. This patch is to add comments for some elements in hwmgmt_page module. Tracked-On: #8665 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-08-01 14:50:27 +08:00
Jiaqing Zhao	2dc56a8f23	hv: add GUEST_FLAG_STATELESS flag GUEST_FLAG_STATELESS indicates guest is running a stateless operating system and need to be shutdown forcefully without data loss. This flag is only appalicable to pre-launched VM. For TEE_VM, this flag will be set implicitly. Tracked-On: #8671 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-07-30 09:26:50 +08:00
Haiwei Li	529ade37a4	config_tools: support vUART Timer pCPU configuration This patch is to allow user to pin vUART timer to specific pCPU via ACRN config tool. User can configure by setting "vUART timer pCPU ID" under Hypervisor->Advanced Parameters. Tracked-On: #8648 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-07-10 10:26:21 +08:00
Haiwei Li	f47b2b6860	config_tools: support vUART Tx/Rx buffer size configuration Introduce an interface to define Tx/Tx buffer size via ACRN config tool. User can configure under Hypervisor->Advanced Parameters. Tracked-On: #8644 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-07-10 10:26:21 +08:00
YuanXin-Intel	e4b1584577	Change Service VM to supervisor role 1. Enable Service VM to power off or restart the whole platform even when RTVM is running. 2. Allow Service VM stop the RTVM using acrnctl tool with option "stop -f". 3. Add 'Service VM supervisor role enabled' option in ACRN configurator Tracked-On: #8618 Signed-off-by: YuanXin-Intel <xin.yuan@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com> Reviewed-by: Jian Jun Chen <jian.jun.chen@intel.com>	2024-06-28 13:35:07 +08:00
nacui	512c98fd79	hv: trace: show cpu usage of vms in pcpu sharing case To maximize the cpu utilization, core 0 is usually shared by service vm and guest vm. But there are no statistics to show the cpu occupation of each vm. This patch is to provide cpu usage statistic for users. To calculate it, a new trace event is added and marked in scheduling context switch, accompanying with a new python script to analyze the data from acrntrace output. Tracked-On: #8621 Signed-off-by: nacui <na.cui@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com> Reviewed-by: Haiwei Li <haiwei.li@intel.com>	2024-06-28 12:55:23 +08:00
Haiwei Li	3d6ca845e2	hv: s3: add timer support When resume from s3, Service VM OS will hang because timer interrupt on BSP is not triggered. Hypervisor won't update physical timer because there are expired timers on pcpu timer list. Add suspend and resume ops for modules that use timers. This patch is just for Service VM OS. Support for User VM will be added in the future. Tracked-On: #8623 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-06-27 11:26:09 +08:00
Haiwei Li	5283c147ef	hv: pci: Add guest cfg header access handling of type 1 device When guests resume form s3, an error occurs in guest: ``` pcieport 0000:00:1c.0: refused to change power state from D0 to D3hot ``` PCI bridge (type 1 device) will access configuration space header but now acrn is not supported. So add handling support. Tracked-On: #8623 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-06-27 11:26:09 +08:00
Haiwei Li	2cd0edaf9c	hv: pci: restore bus and memory/IO info after reset After some kind of reset, such as s3, pci bridge tries to restore the bus and memory/IO info (from 0x18 to 0x32, except for Secondary Latency Timer 0x1b) to resume device state. This patch is to restore these info by hypervisor. Tracked-On: #8623 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-06-27 11:26:09 +08:00
Haiwei Li	81935737ff	hv: s3: reset vm after resume Now only BSP is reset. After Service VM OS resumes from s3, APs' apic_base_msr are incorrect with x2apic bit en. To avoid incorrect states, do `reset_vm` after resume. Tracked-On: #8623 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-06-27 11:26:09 +08:00
Jian Jun Chen	74bc2f7cfb	hv: asyncio: support data match of the same addr Virtio legacy device (ver < 1.0) uses a single PIO for all virtqueues. Notifications from different virtqueues are implemented by writing virtqueue index to the PIO. Writing different values to the same addr needs to be mapped to different eventfds by asyncio. This is called data match feature of asyncio. v3 -> v4: * Update the definition of `struct asyncio_desc` Use `struct acrn_asyncio_info` inside it, instaed of defining the duplicated fileds. * Update `add_asyncio` to use `memcpy_s` rather than assigning all the fields using 5 assignment statements. * Update `asyncio_is_conflict` for coding style 120-character line is sufficient to write all conditions. * Update the checks related to `wildcard` Because we require every conditional clause to have a Boolean type in the coding guideline. v2 -> v3: No change v1 -> v2: No change Tracked-On: #8612 Signed-off-by: Jian Jun Chen <jian.jun.chen@intel.com> Signed-off-by: Shiqing Gao <shiqing.gao@intel.com> Acked-by: Wang, Yu1 <yu1.wang@intel.com>	2024-06-05 15:23:33 +08:00
Jiaqing Zhao	91e0612e88	hv: dm: refine create/destroy functions The create function of hv-emulated device must check the return value of vpci_init_vdev() as it returns NULL pointer on failure, and that function should be called atomically. Also, the destory function should deinit the vpci devices created to prevent resource leak. Tracked-On: #8590 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-06-04 09:38:34 +08:00
Jiaqing Zhao	626e2f1d17	hv: vpci: clear vdev structure on device deassign In devicemodel, a passthrough device is deassigned and then assigned to guest on guest reboot. Each time hypervisor allocates a new pci_vdev structure to keep its info. As it was stored in a statically-allocated array, it will eventually use up all slots, resulting both resource leak and out-of-bounds access. Fix it by clearing the corresponding vdev structure on device deassign, thus a bitmap is introduced to track the usage, replacing the existing array count. Tracked-On: #8590 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-06-04 09:38:34 +08:00
Haiwei Li	b885d02396	hv: cpuid: add several leaf to per-cpu list in hybrid architecture P-cores and E-cores accessing leaf 0x2U/0x14U/0x16U/0x18U/0x1A/0x1C/0x80000006U will have different information in hybrid architecture. So add them to per-cpu list in hybrid architecture and directly return the physical value. Note: 0x14U is hided and return 0. Tracked-On: #8608 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-05-28 11:02:56 +08:00
Haiwei Li	d6fe8b0892	hv: cpuid: make leaf 0x6 per-cpu in hybrid architecture Leaf 0x6 returns thermal and power management information. In hybrid architecture, P-cores and E-cores have different information. Add leaf 0x6 to per-cpu list in hybrid architecture and handle specific cpuid access. Tracked-On: #8608 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-05-28 11:02:56 +08:00
Haiwei Li	59a8cc4c28	hv: cpuid: make leaf 0x4 per-cpu in hybrid architecture Leaf 0x4 returns deterministic cache parameters for each level. In hybrid architecture, P-cores and E-cores have different cache information. Add leaf 0x4 to per-cpu list in hybrid architecture and handle specific cpuid access. Tracked-On: #8608 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-05-28 11:02:56 +08:00
Haiwei Li	f7506424e4	hv: cpuid: refactor per-cpu leaves definition CPUID returns processor identification and feature information. Different pcpus may return different infos. That is, the info is per-cpu. In hybrid architecture, per-cpu leaf is different from the previous. So introduce a struct percpu_cpuids to indicate the per-cpu leaf. struct percpu_cpuids will consist of two parts: generic percpu leaves and hybrid related percpu leaves. This patch is just to add generic percpu leaves. Tracked-On: #8608 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-05-28 11:02:56 +08:00
Zhangwei6	ddfcb8c3fc	hv: enable thermal lvt interrupt This patch can fetch the thermal lvt irq and propagate it to VM. At this stage we support the case that there is only one VM governing thermal. And we pass the hardware thermal irq to this VM. First, we register the handler for thermal lvt interrupt, its irq vector is THERMAL_VECTOR and the handler is thermal_irq_handler(). Then, when a thermal irq occurs, it flags the SOFTIRQ_THERMAL bit of softirq_pending, This bit triggers the thermal_softirq() function. And this function will inject the virtual thermal irq to VM. Tracked-On: #8595 Signed-off-by: Zhangwei6 <wei6.zhang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-05-16 09:40:32 +08:00
Zhangwei6	78243c3f49	hv: expose thermal MSRs to VM. In this phase, we only use one VM to control thermal. So we make thermal MSRs readable and writable by this VM. This VM is flagged with GUEST_FLAG_VTM, and can read/write thermal MSRs. For the VMs not flagged with GUEST_FLAG_VTM, can only read these thermal MSRs to get current status. Tracked-On: #8595 Signed-off-by: Zhangwei6 <wei6.zhang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-05-16 09:40:32 +08:00
Qiang Zhang	c623e11125	debug: vuart: add guest break key support The break key (key value 0x0) was used as switch key from guest serial to hv console and guest serial could not receive break key. This blocked some guest debugging features like KGDB/KDB, sysrq, etc. This patch leverages escape sequence "<escape> + <break>" to send break to guest and "<escape> + e" to switch from guest serial to hv console. Tracked-On: #8583 Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-04-25 15:00:09 +08:00
Yonghua Huang	ddfe218747	hv: fill region ID to hv-land ivshmem PCI config space 1) region ID shall be configured by user via config tool. 2) region ID is programmed to "Subsystem ID" of PCI config space. 2) "Subsystem Vendor ID" is harded coded as 0x8086 Tracked-On: #8566 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-03-28 14:34:38 +08:00
Yonghua Huang	a7a6732580	config_tools: support IVSHMEM devices region ID configuration This patch adds ivshmem region ID configuration support when user configure ACRN IVSHMEM devices via ACRN config tool, this ID provides VMs with a stable identification of multiple shared memory regions. Also add logic to generate launch script with region ID configured as below: `add_virtual_device 8 ivshmem hv:/shm_region_0,256,1` Tracked-On: #8566 Signed-off-by: Kunhui-Li <kunhuix.li@intel.com> Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-03-28 14:34:38 +08:00
Wu Zhou	925e3d95b4	hv: add max_len for sbuf_put param sbuf_put copies sbuf->ele_size of data, and puts into ring. Currently this function assumes that data size from caller is no less than sbuf->ele_size. But as sbuf->ele_size is usually setup by some sources outside of the HV (e.g., the service VM), it is not meant to be trusted. So caller should provide the max length of the data for safety reason. sbuf_put() will return UINT32_MAX if max_len of data is less than element size. Additionally, a helper function sbuf_put_many() is added for putting multiple entries. Tracked-On: #8547 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-02-20 11:52:02 +08:00
Wu Zhou	262a48f346	dm: vm_event: add support for RTC change event When a guest OS performs an RTC change action, we wish this event be captured by developers, and then they can decide what to do with it. (e.g., whether to change physical RTC) There are some facts that makes RTC change event a bit complicated: - There are 7 RTC date/time regs (year, month…). They can only be updated one by one. - RTC time is not reliable before date/time update is finished. - Guests can update RTC date/time regs in any order. - Guests may update RTC date/time regs during either RTC halted or not halted. A single date/time update event is not reliable. We have to wait for the guest to finish the update process. So the DM's event handler sets up a timer, and wait for some time (1 second). If no more change happens befor the timer expires, we can conclude that the RTC change has been done. Then the rtc change event is emitted. This logic of event handler can be used to process HV vrtc time change event too. Tracked-On: #8547 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Jian Jun Chen <jian.jun.chen@intel.com>	2024-02-01 17:01:31 +08:00
Wu Zhou	d9ccf1ccb2	dm: vm_event: create vm_event thread This patch creates a thread for vm_event delivery. The thread uses epoll to poll event notifications, then read out the msg data queued in sbuf. An event handler is called upon success receiving. Both HV and DM event sources share the same process. Also vm_event tx API for DM event source is added in this patch. Tracked-On: #8547 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Jian Jun Chen <jian.jun.chen@intel.com>	2024-02-01 17:01:31 +08:00
Wu Zhou	581ec58fbb	hv: vm_event: create vm_event support This patch creates vm_event support in HV, including: 1. Create vm_event data type. 2. Add vm_event sbuf and its initializer. The sbuf will be allocated by DM in Service VM. Its page address will then be share to HV through hypercall. 3. Add an API to send the HV generated event. Tracked-On: #8547 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-02-01 17:01:31 +08:00
Qiang Zhang	04a4f31d28	config: add four per-vm bvt parameters Add four per-vm bvt parameters as the initial bvt parameter values for vCPU threads. - bvt_weight The time sharing of a thread on CPU. - bvt_warp_value Boost value of virtual time of a thread (time borrowed from future) to reduce Effective Virtual Time to prioritize the thread. - bvt_warp_limit Max warp time in one warp. - bvt_unwarp_period Min unwarp time after a warp. Tracked-On: #8500 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com>	2023-09-18 16:26:05 +08:00
Qiang Zhang	6a1d91c740	hv: sched: Add sched_params struct for thread parameters Abstract out schedulers config data for vCPU threads and other hypervisor threads to sched_params structure. And it's used to initialize per thread scheduler private data. The sched_params for vCPU threads come from vm_config generated by config tools while other hypervisor threads need give them explicitly. Tracked-On: #8500 Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com>	2023-09-18 16:26:05 +08:00
Qiang Zhang	c000a3f70b	hv: add clamp macro for convenience Add clamp macro to clamp a value within a range. Tracked-On: #8500 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com>	2023-09-18 16:26:05 +08:00
Wu Zhou	064be1e3e6	hv: support halt in hv idle When all vCPU threads on one pCPU are put to sleep (e.g., when all guests execute HLT), hv would schedule to idle thread. Currently the idle thread executes PAUSE which does not enter any c-state and consumes a lot of power. This patch is to support HLT in the idle thread. When we switch to HLT, we have to make sure events that would wake a vCPU must also be able to wake the pCPU. Those events are either generated by local interrupt or issued by other pCPUs followed by an ipi kick. Each of them have an interrupt involved, so they are also able to wake the halted pCPU. Except when the pCPU has just scheduled to idle thread but not yet halted, interrupts could be missed. sleep-------schedule to idle------IRQ ON---HLT--(kick missed) ^ wake---kick\| This areas should be protected. This is done by a safe halt mechanism leveraging STI instruction’s delay effect (same as Linux). vCPUs with lapic_pt or hv with CONFIG_KEEP_IRQ_DISABLED=y does not allow interrupts in root mode, so they could never wake from HLT (INIT kick does not wake HLT in root mode either). They should continue using PAUSE in idle. Tracked-On: #8507 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-09-15 11:52:40 +08:00
Yonghua Huang	791019edc5	hv: define a MACRO to indicate maximum memory size ~0UL is widely used to specify the maximum memory size when calling e820_alloc_memory(), this patch to define a MACRO for it to avoid using this magic number. Tracked-On: #8502 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-09-12 13:52:48 +08:00
Qiang Zhang	a4a73b5aac	HV: emulate dummy multi-function dev in Service VM For a pdev which allocated to prelaunched VM or owned by HV, we need to check whether it is a multifuction dev at function 0. If yes we have to emulate a dummy function dev in Service VM, otherwise the sub-function devices will be lost in guest OS pci probe process. Tracked-On: #8492 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com> Signed-off-by: Victor Sun <victor.sun@intel.com>	2023-09-11 16:13:16 +08:00
Qiang Zhang	bf653d277b	HV: init one dev config with service vm config param When we do init_all_dev_config() in pci.c, the pdevs added to pci dev_config will be exposed to Service VM or passthru to prelauched VM. The original code would find service VM config in every pci pdev init loop, this is unnecessary and definitely impact performance. Here we generate Service VM config pointer with config tool so that init_one_dev_config() could refer service VM config directly. Tracked-On: #8491 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com> Signed-off-by: Victor Sun <victor.sun@intel.com>	2023-09-11 16:13:16 +08:00
Muhammad Qasim Abdul Majeed	a457e65619	doc: Fix spelling and typo mistakes. Tracked-On: #8488 Signed-off-by: Muhammad Qasim Abdul Majeed <qasim.majeed20@gmail.com>	2023-09-05 09:34:21 +08:00
Jiaqing Zhao	7bfbdf04b8	doc: remove '@return None' for void functions doxygen will warn that documented return type is found for functions that does not return anything in 1.9.4 or later versions. 'None' is not a special keyword in doxyge, it will recognize it as description to the return value that does not exist in void functions. Tracked-On: #8425 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-08-03 14:56:29 -07:00
Wu Zhou	8af2c263db	hv: disable HFI and ITD for guests The Hardware Feedback Interface (HFI) and Intel® Thread Director (ITD) features require OS to provide a physical page address to IA32_HW_FEEDBACK_PTR. Then the hardware will update the processor information to the page address. The issue is that guest VM will program its GPA to that MSR, causing great risk of tempering memory. So HFI and ITD should be made invisible to guests, until we provide proper virtulization of those features. Tracked-On: #8463 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-08-01 14:57:23 +08:00
Jiaqing Zhao	3cc1127ae9	hv: fix undefined reference to nested_vmexit_handler() in vmexit.c arch/x86/guest/nested.c, where nested_vmexit_handler() is defined, is only compiled when CONFIG_NVMX_ENABLED is enabled. Define a dummy function in include/arch/x86/asm/guest/nested.h to fix the undefined reference linker error. Tracked-On: #8465 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-08-01 14:22:14 +08:00
Wu Zhou	89d11d91e2	hv: bugfix: fix the ptdev irq destination issue According to SDM Vol3 11.12.10, in x2APIC mode, Logical Destination has two parts: - Cluster ID (LDR[31:16]) - Logical ID (LDR[15:0]) Cluster ID is a numerical address, while Logical ID is a 16bit mask. We can only use Logical ID to address multi destinations within a Cluster. So we can't just 'or' all the Logical Destination in LDR registers to get one mask for all target pCPUs. This would get a wrong destination mask if the target Destinations are from different Clusters. For example in ADL/RPL x2APIC LDRs for core 2-5 are 0x10001 0x10100 0x20001 0x20100. If we 'or' them together, we would get a Logical Destination of 0x30101, which points to core 6 and another core. If core 6 is running a RTVM, then the irq is unable to get to core 2-5, causing the guest on core 2-5 driver fail. Guests working in xAPIC mode may use 'Flat Model' to select an arbitrary list of CPUs as its irq destination. HV may not be able to include them all when transfering to physical destinations, because the HW is working in x2APIC mode and can only use 'Cluster Model'. There would be no perfect fix for this issue. This patch is a simple fix, by just keep the first Cluster of all target Logical Destinations. Tracked-On: #8435 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-07-05 17:41:16 +08:00
Jiaqing Zhao	e5d46dcc7d	hv: vpci: ignore PCI I/O BAR with non-zero upper 16 bits On x86 platform, the upper 16 bit of I/O BAR should be initialized to zero by BIOS. Howerever, some buggy BIOS still programs the upper 16 bits to non-zero, which causes error in check_pt_dev_pio_bars(). Since I/O BAR reprogramming by VM is currently unsupported, this patch ignores such I/O BARs when creating vpci devices to make VM boot. Tracked-On: #8373 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-06-26 14:40:57 +08:00
Wu Zhou	db83648a8d	hv: hide thermal interface from guests Thermal events are delivered through lapic thermal LVT. Currently ACRN does not support delivering those interrupts to guests by virtual lapic. They need to be virtualized to provide guests some thermal management abilities. Currently we just hide thermal lvt from guests, including: 1. Thermal LVT: There is no way to hide thermal LVT from guests. But we need do something to make sure no interrupt can be actually trigered: - skip thermal LVT in vlapic_trigger_lvt() - trap-and-emulate thermal LVT in lapic-pt mode 2. As We have plan to introduce virtualization of thermal monitor in the future, we use a vm flag GUEST_FLAG_VTM which is default 0 to control the access to it. So that it can help enabling VTM in the future. Tracked-On: #8414 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-06-15 20:36:44 +08:00
Wu Zhou	c5d019b836	hv: emulate cpuids and MSRs for VHWP Changes made by this patch includes: 1. Emulate HWP and pstate MSRs/CPUIDs. Those are exposed to guest when the GUEST_FLAG_VHWP is set: - CPUID[6].EAX[7,9,10]: MSR_IA32_PM_ENABLE(enabled by hv, always read 1), MSR_IA32_HWP_CAPABILITIES, MSR_IA32_HWP_REQUEST, MSR_IA32_HWP_STATUS, - CPUID[6].ECX[0]: MSR_IA32_MPERF, MSR_IA32_APERF - MSR_IA32_PERF_STATUS(read as base frequency when not owning pCPU) - MSR_IA32_PERF_CTL(ignore writes) 2. Always hide HWP interrupt and package control MSRs/CPUIDs: - CPUID[6].EAX[8]: MSR_IA32_HWP_INTERRUPT(currently ACRN is not able to deliver thermal LVT virtual interrupt to guests) - CPUID[6].EAX[11,22]: MSR_IA32_HWP_REQUEST_PKG, MSR_IA32_HWP_CTL Tracked-On: #8414 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-06-09 10:06:42 +08:00
Wu Zhou	2edf141047	hv: add VHWP guest flag and its helper func Currently CPU frequency control is hidden to guests, and controlled by hypervisor. While it is sufficient in most cases, some guest OS may still need CPU performance info to make multi-core scheduling decisions. This is seen on Linux kernel, which uses HWP highest performance level as CPU core's priority in multi-core scheduling (CONFIG_SCHED_MC_PRIO). Enabling this kernel feature could improve performance as single thread workloads are scheduled on the highest performance cores. This is significantly useful for guests with hybrid cores. The concept is to expose performance interface to guest who exclusively owns pCPU assigned to it. So that Linux guest can load intel_pstate driver which will then provide the kernel with each core's schedule priority. Intel_pstate driver also relies on CONFIG_ACPI_CPPC_LIB to implement this mechanic, this means we also need to provide ACPI _CPC in DM. This patch sets up a guest flag GUEST_FLAG_VHWP to indicate whether the guest can have VHWP feature. Tracked-On: #8414 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-06-09 10:06:42 +08:00
Jiaqing Zhao	770cf8c434	hv: emulate MSR_PLATFORM_INFO (17h) This patch emulates the PLATFORM_INFO MSR in hypervisor to make it only visible to Service VM, and only processor ratios (bit 15:8, 47:40 and 55:48) and sample part bit (27) are exponsed. This is intended to prevent Service VM from changing processor parameters like turbo ratio. Tracked-On: #8406 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com>	2023-05-30 15:10:05 +08:00
Qiang Zhang	fcb8e9bb2d	ptirq: Fix INTx assignment for Post-launched VM When assigning a physical interrupt to a Post-launched VM, if it has been assigned to ServiceVM, we should remove that mapping first to reset ioapic pin state and rte, and build new mapping for the Post-launched VM. Tracked-On: #8370 Signed-off-by: Qiang Zhang <qiang4.zhang@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2023-04-13 12:24:57 +08:00

1 2 3 4 5 ...

1800 Commits