Site Reliability Engineer
Cohesity
Total des années d'expérience :12 years, 3 Mois
• Part of Support escalations team working on Cohesity Data Platform, Data Protection,
backup, restore and file services.
• Cohesity platform deployment, install, upgrade and troubleshooting the platform, network, backup, storage related issues.
• In particular, working on integration with various backup scenarios and workflows
(vSphere, Windows, Linux, Databases (MSSQL, Oracle), NAS, SMB, Pure Storage, NetApp, Nutanix Acropolis, Cisco Hyperflex).
• Troubleshooting DNS, NTP and AD related issues.
• Troubleshooting core services (Bridge, Magneto, Yoda, Gandalf, Icebox, Madrox, Apollo).
• Cohesity being a Hyperconverged infrastructure, troubleshooting metadata, replication factor
issues (Node down / disk failure), chunks, bricks, blobs and replication scenarios.
• Hands-on experience in configuring Nutanix Solutions for private cloud.
• Point of contact for Nutanix performance and core data path issues.
• Administer, Support for the Nutanix production clusters running on major Industry standard hypervisors. (AHV, VMWare Esxi and Microsoft Hyper-V).
• Assist customers, field engineers and consultants in deploying Nutanix solutions on multiple hypervisors across different hardware like HP, Cisco, Dell, Fujitsu and Lenovo.
• Troubleshoot, Debug and Diagnose Nutanix customer issues in the field.
• Provide full end support to Install/configure/test Nutanix clusters running various hypervisors and NOS (CentOS Based) versions.
• Administrate and support software-defined features on Nutanix clusters like - HA, DRS, DFS storage, De-duplication, Compression, Snapshots/clones, Remote backup and DR etc.
• Contribute to product enhancement by testing and identifying bugs while working closely with developers.
• Define and drive changes to our product with the development engineering team, given feedback from customers and field implementations.
• Improve serviceability of the product by testing new features.
• Work with technology partners to resolve issues and push improvements in our ecosystem.
• Develop and contribute to internal and external knowledge bases.
• Responsible for the management, maintenance and end-to-end production support for the VMware Infrastructure, Cisco UCS Infrastructure of Cisco Mission Critical Servers across all sites globally.
• System Administration support includes new implementation and installation, pro-active maintenance, break fix support on existing environment.
• Regular audit of the vCenter inventory, datastore usage, resource availability.
• Analyzing and improving the Virtual machine performance.
• Work on Root Cause Analysis of recurring issues, intermittent performance issues
• Worked as a part of Virtualization and Windows Server Team and responsible for continuous monitoring, administration and builds of enterprise client’s Physical/Virtual servers
• Provide the best possible solution for both Management and Monitoring for all the real time servers by acting proactively on Incident Management, Change Management and Problem Management methodologies abiding by the standard Six Sigma Quality Assurance standards under ITIL aligned process.
• Creating various availability\performance reports as per organizational requirements
• Coordinate with hardware\software vendors for troubleshooting related issues.
(BE ECE) with 81% Marks
80%
Le lien a été supprimé pour non-respect des conditions d'utilisation. Veuillez contacter l’équipe d'assistance pour plus d'informations.