Commits · 18c918c5f59bc35f9c567689daef8c255b575fdc · nexedi / linux

12 Jan, 2011 40 commits

KVM: SVM: Add manipulation functions for exception intercepts · 18c918c5

Joerg Roedel authored Nov 30, 2010

This patch wraps changes to the exception intercepts of SVM
into seperate functions to abstract nested-svm better and
prepare the implementation of the vmcb-clean-bits feature.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

18c918c5

KVM: SVM: Add manipulation functions for DRx intercepts · 3aed041a

Joerg Roedel authored Nov 30, 2010

This patch wraps changes to the DRx intercepts of SVM into
seperate functions to abstract nested-svm better and prepare
the implementation of the vmcb-clean-bits feature.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

3aed041a

KVM: SVM: Add manipulation functions for CRx intercepts · 4ee546b4

Roedel, Joerg authored Dec 03, 2010

This patch wraps changes to the CRx intercepts of SVM into
seperate functions to abstract nested-svm better and prepare
the implementation of the vmcb-clean-bits feature.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

4ee546b4

KVM: SVM: Add function to recalculate intercept masks · 384c6368

Joerg Roedel authored Nov 30, 2010

This patch adds a function to recalculate the effective
intercepts masks when the vcpu is in guest-mode and either
the host or the guest intercept masks change.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

384c6368

KVM: X86: Don't report L2 emulation failures to user-space · fc3a9157

Joerg Roedel authored Nov 29, 2010

This patch prevents that emulation failures which result
from emulating an instruction for an L2-Guest results in
being reported to userspace.
Without this patch a malicious L2-Guest would be able to
kill the L1 by triggering a race-condition between an vmexit
and the instruction emulator.
With this patch the L2 will most likely only kill itself in
this situation.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

fc3a9157

KVM: SVM: Make Use of the generic guest-mode functions · 2030753d

Joerg Roedel authored Nov 29, 2010

This patch replaces the is_nested logic in the SVM module
with the generic notion of guest-mode.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

2030753d

KVM: X86: Introduce generic guest-mode representation · ec9e60b2

Joerg Roedel authored Nov 29, 2010

This patch introduces a generic representation of guest-mode
fpr a vcpu. This currently only exists in the SVM code.
Having this representation generic will help making the
non-svm code aware of nesting when this is necessary.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

ec9e60b2

KVM: Pull extra page fault information into struct x86_exception · 6389ee94

Avi Kivity authored Nov 29, 2010

Currently page fault cr2 and nesting infomation are carried outside
the fault data structure.  Instead they are placed in the vcpu struct,
which results in confusion as global variables are manipulated instead
of passing parameters.

Fix this issue by adding address and nested fields to struct x86_exception,
so this struct can carry all information associated with a fault.
Signed-off-by: Avi Kivity <avi@redhat.com>
Tested-by: Joerg Roedel <joerg.roedel@amd.com>
Tested-by: Gleb Natapov <gleb@redhat.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

6389ee94

KVM: Push struct x86_exception into walk_addr() · 8c28d031

Avi Kivity authored Nov 22, 2010

Signed-off-by: Avi Kivity <avi@redhat.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

8c28d031

KVM: Push struct x86_exception info the various gva_to_gpa variants · ab9ae313
Avi Kivity authored Nov 22, 2010
```
Signed-off-by: Avi Kivity <avi@redhat.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>
```
ab9ae313

KVM: x86 emulator: simplify exception generation · 35d3d4a1

Avi Kivity authored Nov 22, 2010

Immediately after we generate an exception, we want a X86EMUL_PROPAGATE_FAULT
constant, so return it from the generation functions.
Signed-off-by: Avi Kivity <avi@redhat.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

35d3d4a1

KVM: x86 emulator: tighen up ->read_std() and ->write_std() error checks · db297e3d

Avi Kivity authored Nov 22, 2010

Instead of checking for X86EMUL_PROPAGATE_FAULT, check for any error,
making the callers more reliable.
Signed-off-by: Avi Kivity <avi@redhat.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

db297e3d

KVM: x86 emulator: drop dead pf injection in emulate_popf() · 42438e36

Avi Kivity authored Nov 22, 2010

If rc == X86EMUL_PROPAGATE_FAULT, we would have returned earlier.
Signed-off-by: Avi Kivity <avi@redhat.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

42438e36

KVM: x86 emulator: make emulator memory callbacks return full exception · bcc55cba

Avi Kivity authored Nov 22, 2010

This way, they can return #GP, not just #PF.
Signed-off-by: Avi Kivity <avi@redhat.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

bcc55cba

KVM: x86 emulator: introduce struct x86_exception to communicate faults · da9cb575

Avi Kivity authored Nov 22, 2010

Introduce a structure that can contain an exception to be passed back
to main kvm code.
Signed-off-by: Avi Kivity <avi@redhat.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

da9cb575

KVM: MMU: delay flush all tlbs on sync_page path · a4ee1ca4

Xiao Guangrong authored Nov 23, 2010

Quote from Avi:
| I don't think we need to flush immediately; set a "tlb dirty" bit somewhere
| that is cleareded when we flush the tlb.  kvm_mmu_notifier_invalidate_page()
| can consult the bit and force a flush if set.
Signed-off-by: Xiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

a4ee1ca4

KVM: MMU: abstract invalid guest pte mapping · 407c61c6

Xiao Guangrong authored Nov 23, 2010

Introduce a common function to map invalid gpte
Signed-off-by: Xiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

407c61c6

KVM: MMU: remove 'clear_unsync' parameter · a4a8e6f7

Xiao Guangrong authored Nov 19, 2010

Remove it since we can judge it by using sp->unsync
Signed-off-by: Xiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

a4a8e6f7

KVM: MMU: rename 'reset_host_protection' to 'host_writable' · 9bdbba13

Lai Jiangshan authored Nov 19, 2010

Rename it to fit its sense better
Signed-off-by: Lai Jiangshan <laijs@cn.fujitsu.com>
Signed-off-by: Xiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

9bdbba13

KVM: MMU: don't drop spte if overwrite it from W to RO · b330aa0c

Xiao Guangrong authored Nov 19, 2010

We just need flush tlb if overwrite a writable spte with a read-only one.

And we should move this operation to set_spte() for sync_page path
Signed-off-by: Xiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

b330aa0c

KVM: MMU: fix forgot flush tlbs on sync_page path · 30bfb3c4

Xiao Guangrong authored Nov 19, 2010

We should flush all tlbs after drop spte on sync_page path since

Quote from Avi:
| sync_page
| drop_spte
| kvm_mmu_notifier_invalidate_page
| kvm_unmap_rmapp
| spte doesn't exist -> no flush
| page is freed
| guest can write into freed page?

KVM-Stable-Tag.
Signed-off-by: Xiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

30bfb3c4

KVM: PPC: Fix compile warning · 27923eb1

Alexander Graf authored Nov 25, 2010

KVM compilation fails with the following warning:

include/linux/kvm_host.h: In function 'kvm_irq_routing_update':
include/linux/kvm_host.h:679:2: error: 'struct kvm' has no member named 'irq_routing'

That function is only used and reasonable to have on systems that implement
an in-kernel interrupt chip. PPC doesn't.

Fix by #ifdef'ing it out when no irqchip is available.
Signed-off-by: Alexander Graf <agraf@suse.de>
Signed-off-by: Avi Kivity <avi@redhat.com>

27923eb1

KVM: Add instruction-set-specific exit qualifications to kvm_exit trace · 586f9607

Avi Kivity authored Nov 18, 2010

The exit reason alone is insufficient to understand exactly why an exit
occured; add ISA-specific trace parameters for additional information.

Because fetching these parameters is expensive on vmx, and because these
parameters are fetched even if tracing is disabled, we fetch the
parameters via a callback instead of as traditional trace arguments.
Signed-off-by: Avi Kivity <avi@redhat.com>

586f9607

KVM: Record instruction set in kvm_exit tracepoint · aa17911e

Avi Kivity authored Nov 17, 2010

exit_reason's meaning depend on the instruction set; record it so a trace
taken on one machine can be interpreted on another.
Signed-off-by: Avi Kivity <avi@redhat.com>

aa17911e

KVM: fast-path msi injection with irqfd · bd2b53b2

Michael S. Tsirkin authored Nov 18, 2010

Store irq routing table pointer in the irqfd object,
and use that to inject MSI directly without bouncing out to
a kernel thread.

While we touch this structure, rearrange irqfd fields to make fastpath
better packed for better cache utilization.

This also adds some comments about locking rules and rcu usage in code.

Some notes on the design:
- Use pointer into the rt instead of copying an entry,
  to make it possible to use rcu, thus side-stepping
  locking complexities.  We also save some memory this way.
- Old workqueue code is still used for level irqs.
  I don't think we DTRT with level anyway, however,
  it seems easier to keep the code around as
  it has been thought through and debugged, and fix level later than
  rip out and re-instate it later.
Signed-off-by: Michael S. Tsirkin <mst@redhat.com>
Acked-by: Marcelo Tosatti <mtosatti@redhat.com>
Acked-by: Gregory Haskins <ghaskins@novell.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

bd2b53b2

KVM: VMX: Fold __vmx_vcpu_run() into vmx_vcpu_run() · 104f226b

Avi Kivity authored Nov 18, 2010

cea15c2 ("KVM: Move KVM context switch into own function") split vmx_vcpu_run()
to prevent multiple copies of the context switch from being generated (causing
problems due to a label). This patch folds them back together again and adds
the __noclone attribute to prevent the label from being duplicated.
Signed-off-by: Avi Kivity <avi@redhat.com>

104f226b

KVM: x86 emulator: do not perform address calculations on linear addresses · 30b31ab6

Avi Kivity authored Nov 17, 2010

Linear addresses are supposed to already have segment checks performed on them;
if we play with these addresses the checks become invalid.
Signed-off-by: Avi Kivity <avi@redhat.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

30b31ab6

KVM: x86 emulator: preserve an operand's segment identity · 90de84f5

Avi Kivity authored Nov 17, 2010

Currently the x86 emulator converts the segment register associated with
an operand into a segment base which is added into the operand address.
This loss of information results in us not doing segment limit checks properly.

Replace struct operand's addr.mem field by a segmented_address structure
which holds both the effetive address and segment. This will allow us to
do the limit check at the point of access.
Signed-off-by: Avi Kivity <avi@redhat.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

90de84f5

KVM: x86 emulator: drop DPRINTF() · d53db5ef

Avi Kivity authored Nov 17, 2010

Failed emulation is reported via a tracepoint; the cmps printk is pointless.
Signed-off-by: Avi Kivity <avi@redhat.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

d53db5ef

KVM: x86 emulator: drop unused #ifndef __KERNEL__ · 8a6bcaa6

Avi Kivity authored Nov 17, 2010

Signed-off-by: Avi Kivity <avi@redhat.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

8a6bcaa6

KVM: VMX: Inform user about INTEL_TXT dependency · f9335afe

Shane Wang authored Nov 17, 2010

Inform user to either disable TXT in the BIOS or do TXT launch
with tboot before enabling KVM since some BIOSes do not set
FEATURE_CONTROL_VMXON_ENABLED_OUTSIDE_SMX bit when TXT is enabled.
Signed-off-by: Shane Wang <shane.wang@intel.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

f9335afe

KVM: rename hardware_[dis|en]able() to *_nolock() and add locking wrappers · 75b7127c

Takuya Yoshikawa authored Nov 16, 2010

The naming convension of hardware_[dis|en]able family is little bit confusing
because only hardware_[dis|en]able_all are using _nolock suffix.

Renaming current hardware_[dis|en]able() to *_nolock() and using
hardware_[dis|en]able() as wrapper functions which take kvm_lock for them
reduces extra confusion.
Signed-off-by: Takuya Yoshikawa <yoshikawa.takuya@oss.ntt.co.jp>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

75b7127c

KVM: take kvm_lock for hardware_disable() during cpu hotplug · 97e91e28

Takuya Yoshikawa authored Nov 16, 2010

In kvm_cpu_hotplug(), only CPU_STARTING case is protected by kvm_lock.
This patch adds missing protection for CPU_DYING case.
Signed-off-by: Takuya Yoshikawa <yoshikawa.takuya@oss.ntt.co.jp>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

97e91e28

KVM: MMU: don't mark spte notrap if reserved bit set · e730b63c

Xiao Guangrong authored Nov 17, 2010

If reserved bit is set, we need inject the #PF with PFEC.RSVD=1,
but shadow_notrap_nonpresent_pte injects #PF with PFEC.RSVD=0 only
Signed-off-by: Xiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

e730b63c