What is meaning of option "Quiesce Guest File System" while taking snapshot. And what is its use? Please share deep dive.
Snapshot - Quiesce Guest File System.
Datastore Not Freeing Up Space After Deleting VM
I have a VMFS 6 datastore on ESXi 6.5 that is 9.1 TB in size. I had a large VMDK that I was using to store files that was about 9.0 TB and I transferred the files elsewhere and then removed that VM. When browsing the file directory of the datastore it shows two other VMs that I still have in there that are about 35 GB each in size which is good, but the datastore overall says there is only about 30 GB free when it should say there is about 8 TB free.
Is there a manual process that needs to be run that will tell the datastore that files were deleted and there is free space now?
The ramdisk 'tmp' is full.
Hi
I'm getting repeated error messages "The ramdisk 'tmp' is full..." on an HP ESXi, 6.5.0, 5310538
In the /tmp folder I had a very large ql_ima.log which I deleted with the rm command.
If I do a vdf -h it shows the tmp folder as size 256M and used 256M with 0B available.
Doing an ls -lsa on the tmp and all its subfolders shows only 11M of used space.
The ql_ima.log has now grown to 1.6M
If I delete the ql_ima_sdm.log_old I can recover 4.7M and that shows in the vdf results. But that soon gets consumed by the ql_ima.log
Any idea what’s going on and how I can recover the lost free space would be much appreciated.
Paul
Possibly a Zombie VMDK file ! Please Check
Hi All,
I recently installed RVtools,(RVtools is a free tool and lite weight tool to view and get information on all the esxi/esx host)
In RVtools on one of the option vhealth check there i can one of the suggestion as
"Possibly a Zombie VMDK file ! Please Check"
what does it mean , and what can be done to remove that error.
Does any one got the similar message.
Regards
Raju Gunnal
How to trigger NTP client to sync time?
I have an internal NTP server in the LAN (so there's no need to configure any firewall for this matter) but my ESXi 4.1U1 servers are totally out of time (3 to 5 minutes).
I've tried to restart the NTP client n times but the time is still not synch'ed.
Of course, I could manually change the time but what's the point to have the NTP client if we have to do it manually?
Is there any command (through SSH) to trigger the time-sync?
TIA
Check VMDK for Corruption
Is there a way to check the integrity of a VMDK file? The system is reporting numerous I/O errors, and cannot be VMotion'ed to a different LUN. Other VMS on the same LUN appear to be working normally.
Can't mount NFS share - Operation failed, diagnostics report: Unable to get console path for volume
I'm trying to mount an NFS volume on ESXI 6, but keep running into this error. Googling about hasn't helped, so here I am. Error:
Call "HostDatastoreSystem.CreateNasDatastore" for object "ha-datastoresystem" on ESXi "192.168.xx.xx" failed.
Operation failed, diagnostics report: Unable to get console path for volume, sample name.
The NFS share is located on a synology nas. I've checked permissions and configuration. Everything looks correct based on the various tips and KB articles.
Ideas?
Thanks.
Unable to connect to the MKS: Internal error
Can anybody help me with this error
"Unable to connect to the MKS: Internal error"
It's appear on my console, when i want to install my first Host.
I'm using Vmware ESX
Thanks,
VssSyncStart' operation failed
Hi,
I just face a event log as below:
Warning message from localhost.localdomain:
The guest OS has reported an error during
quiescing. The error code was: 5 The error
message was: 'VssSyncStart' operation failed:
IDispatch error #8449 (0x80042301)
warning
it caused the Acronis Backup software fail to backup
In Communities information, it seem relate to disk space and i/o speed.
For disk space, we should have enough disk space for OS.
Also i trust it is not heavy duty when backup, i/o should be enough
In knowledge base 1018194, resolution is reboot the virtual machine.
is it reboot the acronis appliance or our production os?
Any method to prevent the same?
Existing our version is EXSi 5.1 built 799733.
Thank you
Rgds,
Sun
Is it normal for the vmkdump folder to contain a dumpfile?
Hi everyone,
It might be a stupid question, but I was wondering about the following:
I was trying to clean up one of our datastores, and came across a Folder called "vmkdump" containing a single file named something like "xxxxx-xxxxx-xxxxxx-xxxxx.dumpfile". I found out those a dumps of the ESXi Server used when the machine crashes. Now my question is, are those files created when the machine crashes, or is this file created while the Server is running and data is collected until the Server crashes?
I guess in the end what I'm really asking is, can I get rid of that dumpfile, because we don't Need it anymore, or is at least one active dumpfile needed for the Server to function properly?
Thanks in advance,
Fabian Fritsch
ESXi 6.5 Failing to upload/deploy OVA files postNFCData failed error
I've been tasked with testing out multiple SSL VPN solutions for our company. Naturally before putting it into our dev or production servers I wanted create a test environment with two desktops to try out different SSL VPN trials.
Client: Version 1.23.0 (Build 6360286)
Host:Version 6.5.0 (Build 6765664)
I deployed F5-BIG-IP's OVA (1.36GB) perfectly fine and could experiment around with the software. Next I went to try out Array Network's vxAG (2.40GB) and it failed with different errors, I thought it may just be Array's OVA is corrupt. From there I went out and tried damn small linux to see if it was an OVA problem but the linux client deployed without a hitch. So I went to try to deploy Pulse Secure SSL VPN solution which is an VMDK and OVF file (3.02GB) and that eventually error-ed out just like Array Network's. I tried uploading a large file and that ending up failing (Windows install.esd 2.87GB) so I have a feeling it has something to do with the file sizes.
Here are the multiple different errors:
If I try to create a new virtual machine the OVA it gets to a random percent freezes and eventually gives the 'Result: Failed - Upload disk cancelled' and I get a "Failed to deploy VM: postNFCData failed" error (Note the VMDK file uploads to the datastore but not the OVF or MF)
If I try to just upload the file to the datastore it also gets to the same percent, freezes then after a random amount of time the OVA shows in the datastore, but if you close the datastore page and reopen it the file magically disappears! Meanwhile the 'Upload file to datastore' Task is still hanging at the same percent.
Array Network's vxAG gets to 74-75% then fails
Pulse gets to 60% and fails
The Windows install.esd gets to 60% and fails
Attempted troubleshooting
I've tried Chrome, Firefox, and IE
I've tried ESXi 6.0 and different builds of 6.5
I've patched to the most up-to-date host and client version (Suggested in related discussions:ESXi 6.0 U2 - "Failed to deploy VM: postNFCData failed." - when importing a OVA template via ESXi Host Client 1.5.0.)
I've tried messing with certificates and DNS to a degree
Added a licencing key
I really rather not resort to using the ovftool even though I constantly hear how amazing it is.
Does anyone have a fix or other troubleshooting routes I could take? Any help would be much appreciated!
VM Ping/ARP issue
We are having a problem with some of our virtual machines intermittently losing communication with each other, and I’m at a loss as to the source.
We have about 250 VM’s running on about 20 HP BL465C blades installed on two HP C7000 chassis, using the HP Virtual Connect interconnect modules. The blade chassis are connected to our core Cisco 6500 switches. The VMWare hosts are at 5.0, the guest VM’s are a mix on Windows 2003, 2008, and 2008R2.
What’s going on is that everything seems to be OK, but then out of nowhere, we will get communication failures between specific machines. It looks like it’s an ARP issue. Using PING, it works fine in one direction, but we get an “unreachable” error when going the other way, unless we ping from the target back to the source first.
For example: we have servers, “A” and “B”. Ping A to B fails with “unreachable”. Ping “B” to “A” works fine. However after pinging “B” to “A”, we can now ping “A” to “B”, at least for a while until the entry falls out of the ARP cache. If we go into server “A” and set a static ARP entry (“arp –s”) for server “B”, everything works OK. Through all this both server “A” and server “B” have no issues communicating with any other machines.
We tried using vMotion to move the servers to a different host, different blade chassis, etc. Nothing worked except when we put both VM’s on the same host. Then everything worked OK. Moving one of the servers to a different host and the problem came back.
It seems like either the ARP broadcast from the one server, or the reply back from the target isn't making it through. However, according to our networking group, there are no issues showing up Cisco switches.
Early this year, we had an issue where it happened on about a third of machines at the same time (it caused significant outages to production systems!). It seemed like it was limited to machines on one chassis (but not all of the machines on that chassis). At that time, we opened up tickets with VMWare and HP. Neither found anything wrong with our configurations, but somewhere in the various server moves, configuration resets, etc., everything started working.
Since that time we’ve seen it very intermittently on a few machines, but then it seems to go away after a few days.
The issue we found today was that the server we’re using for the Microsoft WSUS server hadn’t been receiving updates from a couple of the member servers. We could ping from the WSUS to the member server, but not back from the member server unless we put a static ARP entry in the member server. The member servers are working fine otherwise, talking to other machines OK, etc. They are a production environment, so we’re limited on the testing we can do.
Also, when it has happened, it seems like always been between machines on the same subnet. However, most of our servers are on the same subnet, so it might just be coincidence.
I’ve done a lot of internet searching, and have found some postings with similar issues, but haven’t found any solution. I don’t know if it’s a VMWare issue, HP, Cisco, or Windows issue.
Any assistance would be appreciated.
Mike O'Donnell
vSphere HA agent on this host cannot reach some of the management network addresses
vSphere HA agent on this host cannot reach some of the management network addresses of other hosts, and HA may not be able to restart VMs if a host failure occurs
how to resolve thsi issue
Snapshot - Quiesce Guest File System.
What is meaning of option "Quiesce Guest File System" while taking snapshot. And what is its use? Please share deep dive.
ESXi 6.0 and Emulex LPe11000 (possibly other Emulex cards that aren't working oob)
I'm posting this so others might not have to spend hours on the line with support figuring this out.
My environment is older and has the Emulex LPe11000 hba cards. This card isn't discovered oob.
Obviously a problem right. After spending hours on the phone with VMWare support they pointed me to Emulex.
The issue really is that some of VMWare's support personnel don't know where they keep all the VIB files for their OS.
I was able to find the attached VIB here. Import this and apply it and your Emulex card will start working and you can take your hosts up to ESXi 6.
Datastore NFS or iSCSI
Hello,
I am installing my new VMware 6.7 infrastructure on my storage but I have the choice between NFS or iSCSI.
I know this question has already asked but when I see older discussion on it, it is on different storage.
But me I can choice NFS or iSCSI on the same storage.
Thanks
ESXi cannot detect disks.
Hi, I'm running esxi 6 and having trouble detecting disks. esx just can't see them at all whereas a ubuntu live disk can. I have an ssd and 2 hdd's connected through an adaptec 6405 raid contoller card in their own individual JBOD's, this is a configuration that works on two of our other servers. I have tried re-initialising disk and rebuilding partition table in gparted but nothing seems to work. Any ideas?
Raid controller is on HCL http://www.vmware.com/resources/compatibility/detail.php?deviceCategory=io&productid=20867&deviceCategory=io&details=1&partner=475&releases=273&keyword=6405&page=1&display_interval=10&sortColumn=Partner&sortOrder=Asc
Driver is installed
Unable to take quiesced snapshot
After a reboot of a server, it can happen that it is not possible to take a quiesced snapshot.It happened after an update to 6U1 from 6.0. Before the update there were no problems.
We did the following to troubleshoot the problem.
* Reinstall VMWare Tools
* Check VSS Providers for errors (no errors)
What we found out was that it can happen after a reboot of the server and only if the service "VMware Snapshot Provider" is running (service is set to manual). To correct it we need to stop "VMware Snapshot Provider" and "VMware Tools". If all is stable, ie no running services, which should not run, like "Volume Shadow Copy" we start the service "VMware Tools". After this we can take a quiesced snapshot again.
While this ain't a huge problem, it is annoying at best. Any one have a solution or have the same problem.
The ramdisk 'tmp' is full.
Hi
I'm getting repeated error messages "The ramdisk 'tmp' is full..." on an HP ESXi, 6.5.0, 5310538
In the /tmp folder I had a very large ql_ima.log which I deleted with the rm command.
If I do a vdf -h it shows the tmp folder as size 256M and used 256M with 0B available.
Doing an ls -lsa on the tmp and all its subfolders shows only 11M of used space.
The ql_ima.log has now grown to 1.6M
If I delete the ql_ima_sdm.log_old I can recover 4.7M and that shows in the vdf results. But that soon gets consumed by the ql_ima.log
Any idea what’s going on and how I can recover the lost free space would be much appreciated.
Paul
Failed: H:0x0 D:0x2 P:0x0 Valid sense data
On my ESXi 5.5 U3 servers, I'm seeing regular messages in /var/log/vmkernel about failed valid sense data:
2017-02-23T14:08:16.307Z cpu2:34366)NMP: nmp_ThrottleLogForDevice:2458: Cmd 0x85 (0x412e86932dc0, 34608) to dev "naa.6c81f660d81ddf001a52e77d094d4447" on path "vmhba0:C2:T0:L0" Failed: H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0. Act:NONE
2017-02-23T14:08:16.307Z cpu2:34366)ScsiDeviceIO: 2369: Cmd(0x412e86932dc0) 0x4d, CmdSN 0x105a from world 34608 to dev "naa.6c81f660d81ddf001a52e77d094d4447" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0.
2017-02-23T14:08:16.307Z cpu2:34366)ScsiDeviceIO: 2369: Cmd(0x412e86932dc0) 0x1a, CmdSN 0x105b from world 34608 to dev "naa.6c81f660d81ddf001a52e77d094d4447" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x24 0x0.
I thought this was due to the change in SCSI heartbeats (As per KB 2113956 ) I've applied the workaround:
# esxcli system settings advanced set -i 0 -o /VMFS3/UseATSForHBOnVMFS5
But I'm still seeing the errors. The disc is an onboard RAID card virtual drive:
# esxcli storage nmp device list
naa.6c81f660d81ddf001a52e77d094d4447
Device Display Name: Local DELL Disk (naa.6c81f660d81ddf001a52e77d094d4447)
Storage Array Type: VMW_SATP_LOCAL
Storage Array Type Device Config: SATP VMW_SATP_LOCAL does not support device configuration.
Path Selection Policy: VMW_PSP_FIXED
Path Selection Policy Device Config: {preferred=vmhba0:C2:T0:L0;current=vmhba0:C2:T0:L0}
Path Selection Policy Device Custom Config:
Working Paths: vmhba0:C2:T0:L0
Is Local SAS Device: false
Is USB: false
Is Boot USB Device: false
The hardware is a PowerEdge R720xd server with a PERC H710P Mini controller.