Data Analyst Job Search

Data Analyst

Alex's headshot

ABSTRACT

Data analysis is a process of inspecting, cleansing, transforming and modeling data with the goal of discovering useful information, studying trends and supporting decision-making. Data analysis has multiple facets and approaches, encompassing diverse techniques under a variety of names, and is used in different business, science, and social science domains. In today's business world, data analysis plays a role in making decisions more scientific and helping businesses operate more effectively and efficiently.

Looking for a job as a Data Analyst? Perhaps this project can guide you along!

Motivations for this project:

Amidst the COVID19 pandemic many people lost their jobs, with this dataset it is possible to hone the job search so that more people in need can find employment in technical positions such as data/business analyst. This dataset was created by picklesueat and contains more than 2000 job listings for data analyst positions, with features such as:

Questions for exploration:

METHODS

Libraries used:

Data Cleaning:

  1. Remove 'Unnamed: 0' column --> .drop() method
  2. Parse the Lower and Upper Bounds of Salary Estimate, Company Size, and Revenue for granular analysis --> for loop with .replace(), .split() and .strip() methods
  3. Remove the "/n" in the Company Name column --> for loop with .split() method
  4. Rows with -1, '-1.0', and '1' were in place in certain columns. It seems like these values were input in place as Null values. I replaced them relevant values in each respective column.
  5. OpenCage Geocoding on the State column to obtain forward geocoding (text to lat/long) via a RESTful API.

Data Analysis & Visualizations

What are the top 20 industries hiring during COVID19?

What are the top 20 companies hiring during COVID19?

What is the distribution of a data analyst's salary?

What are the top 20 industries hiring ordered by median salary?

Lower Bound of Median Salary by Industries with at least 15 job postings

  1. Biotechnology and pharmaceutical companies both produce medicines, but the medicines made by biotechnology companies are derived from living organisms while those made by pharmaceutical companies generally have a chemical basis. The industry accounted for more than $1.3 trillion in economic output, representing 4% of total U.S. output in 2017 alone.
  2. According to Statista's IT Market Model, the global IT equipment market, which includes PCs and tablets, servers, and IT peripherals, had a size of $518 billion dollars in 2017.
  3. Video games are a billion-dollar business and have been for many years. In 2020, the revenue from the worldwide PC gaming market was estimated at almost $37 billion, while the mobile gaming market generated an estimated income of over $77 billion.

Upper Bound of Median Salary by Industries with at least 15 job postings

What are the top 10 sectors by max revenue?

  1. The Health and Medical Insurance sector, which is made up of carriers of private, group and public health, medical and dental insurance, is characterized by growth as a result of consistent increases in healthcare expenditure and medical cost inflation, in addition to a sharp decline in the uninsured rate.
  2. In 2019, the mining sectors leading companies had a total revenue of approximately $692 billion dollars.
  3. Aerospace & Defense's total sales revenue in 2018 exceeded $929 billion, an increase of 4.17% from the previous year. The impact of this growth has been substantial for the nation’s gross domestic product (GDP). In 2018 alone, A&D contributed over $374 billion to the GDP of the United States, representing 1.8% of the entire GDP.
  4. In 2019, the telecommunication sector's leading companies had a total revenue of approximately $276 billion dollar.
  5. Finance, retail, manufacturing, media, transportation, and pharmaceuticals are essential and each plays a role in everyday life.

What are the top 15 industries by max revenue?

Median Income vs. Upper Bound Salary

Which states offer the most data analyst roles?

  1. It's no surprise that California has the most job offerings due to its population, size, and location of Silicon Valley.
  2. Austin, Texas is one of the upcoming tech hubs forming outside of Silicon Valley. The cost of living is much lower than Silicon Valley which attracts many college graduates and young professionals.
  3. New York City is a global icon on the forefront of culture, finance, and media. Digital media platform, finance tech, and real estate are some of the few popular industries that employ data analyst roles.
  4. Chicago, Illinois has roughly 6,000 tech companies with popular industries of fintech, healthtech, and big data.

Detailed Company Info by Rating

Clustering Regions of Data Analyst roles

  1. Pacific Northwest - 81 jobs
  2. Northern California | Silicon Valley - 306 jobs
  3. Southern California - 325 jobs
  4. Southeast Texas - 397 jobs
  5. Colorado - 130 jobs
  6. The Midwest - 225 jobs
  7. Northeast - 655 jobs
  8. The South - 133 jobs

Takeaways

  1. As I am in the process of finding my first job as a data analyst, this project provided valuable insight that companies are actively hiring. Although big tech companies such as Uber, Airbnb, and LinkedIn exercised mass layoffs due to the COVID19. Many companies are cuttings costs to consolidate their spending since many are uncertain the length of shelter in place and looming residual effects on the economy. Tech giants such as Facebook and Google allowed their employees to work remotely until July 2021.
  2. The top 2 industries actively hiring are staffing agencies which make up nearly 30% of all Data Analyst jobs with a non-null entry. Outsourcing is cost- and time-efficient since it keeps finances flexible, not fixed. In-house – Hiring new staff members don't come cheap. Companies would have to face financial challenges with fixed HR expenses and even intangible costs like time to hire a full-time team. Companies are cutting costs due to the COVID19.
  3. Nearly half of the top 20 companies hiring (by listings) are recruitment agencies. The only non-staffing companies are Apple, MUFG (Financial Services), Citi, Bank of New York Mellon, and Molina Healthcare.
  4. It's no surprise that California has the most job offerings due to its population, size, and Silicon Valley. Texas, New York, Illinois, Washington, and Colorado are flourishing tech hubs outside of the Silicon Valley.
  5. People actively on the job hunt may refer to the company ratings plot to filter out desired companies.
  6. For experienced applicants, the upper bound salary exceeds the median income of each associated states in 2017.
  7. Northeastern America and Southeast Texas are flourishing tech hubs with the amount of opportunities compared to other region clusters.