Title: | Data Sets for Courses at the Münster School of Business |
Version: | 0.0.2 |
Description: | Provides sample data sets that are used in statistics and data science courses at the Münster School of Business. The datasets refer to different business topics but also other domains, e.g. sports, traffic, etc. |
License: | GPL (≥ 3) |
Encoding: | UTF-8 |
LazyData: | true |
RoxygenNote: | 7.2.3 |
Depends: | R (≥ 2.10) |
Imports: | tibble |
NeedsCompilation: | no |
Packaged: | 2024-03-08 11:16:59 UTC; michael |
Author: | Michael Bücker |
Maintainer: | Michael Bücker <michael.buecker@fh-muenster.de> |
Repository: | CRAN |
Date/Publication: | 2024-03-09 10:50:02 UTC |
MSBStatsData: Package containing data for my statistics courses at the Münster School of Business
Description
What the package does (one paragraph).
Author(s)
Maintainer: Michael Bücker michael.buecker@fh-muenster.de (ORCID)
Authors:
Niels Schlüsener niels.schluesener@fh-muenster.de
Data of competition results of decathlon for men
Description
The decathlon for men is a combined event in athletics consisting of 10 track and field events: 100 metres, 400 metres, 1500 metres, 110 metre hurdles, long jump, high jump, pole vault, discus throw, javelin throw, and shotput.
Usage
decathlon
Format
A tibble with 7,958 rows and 10 variables:
- race100m
dbl result of 100m race in seconds
- longjump
dbl result of long jump in meters
- shotput
dbl result of shot put in meters
- highjump
dbl result of high jump in meters
- race400m
dbl result of 400m race in seconds
- race110mhurdles
dbl result of 110m hurdles race in seconds
- discus
dbl result of discus throw in meters
- polevault
dbl result of pole valut in meters
- javelinthrow
dbl result of javelin throw in meters
- race1500m
dbl result of 1500m race in seconds
HR data
Description
The dataset contains human resources data of a footwear company. Each entity of the dataset represents one employee with a total of six attributes.
Usage
hr_data
Format
A tibble with 1,200 rows and 6 variables:
- position [fct]
Position of the employee in the company.
- working_hours [dbl]
Contracted working hours of the employee per week.
- salary [dbl]
Monthly salary of the employee.
- hourly_wage [dbl]
Hourly compensation of the employee.
- department [fct]
Department in which the employee is employed.
- sick_days [dbl]
Sick days of the employee in the period observed.
Data of Marketing expenses
Description
The dataset contains product-, marketing- and sales data of 235 shoes of a footwear company. Each entity represents one shoe, listed with its product data, marketing data and sales data in a total of 14 variables.
Usage
marketing_expenses
Format
A tibble with 235 rows and 14 variables:
- marketing_expenses [dbl]
Expenses for marketing activities for the shoe.
- customers_reached [dbl]
Estimated number of customers reached by the footwear marketing activity of the shoe.
- negative_reactions [dbl]
Number of negative reactions to the marketing activities of the shoe.
- price [dbl]
Retail price of the shoe.
- price_segment [fct]
Price segment of the shoe.
- number_of_sizes [dbl]
Number of sizes in which the shoe is available.
- target_customer [fct]
Gender the shoe is intended for.
- rating_testers [dbl]
Average product rating of the test customers for the shoe.
- rating_customers [dbl]
Average product rating of the real customers for the shoe.
- color_most_sold [fct]
Color in which the shoe is selled the most often.
- return_rate [dbl]
Rate how often the shoe is returned by the customer.
- sales_volume [dbl]
Number of sales for the shoe.
- rank_rating_customers [dbl]
Attribute 'rating_customers', divided into ranks for the calculation of the correlation coefficient according to Spearman.
- rank_price_segment [dbl]
Attribute 'price_segment', divided into ranks for the calculation of the correlation coefficient according to Spearman.
Salary trend data
Description
Dataset containing the average salary of a footwear company's employees over 10 years.
Usage
salary_trends
Format
A tibble with 10 rows and 2 variables:
- year [dbl]
Year of record.
- avg_salary [dbl]
Average salary in the corresponding year.
Shipping error data
Description
Dataset recording errors in the shipment and the type of delivery requested by the customer in the process.
Usage
shipping_errors
Format
A tibble with 576 rows and 2 variables:
- error [fct]
Indicator whether or not an error appeared during shipment.
- shipping [fct]
Shipment method requested by the customer.
Sorting times
Description
The dataset contains throughput times of a machine recorded in order to compare them to the manufacturer's specifications.
Usage
sorting_times
Format
A tibble with 60 rows and 1 variable:
- sorting_time [dbl]
Recorded time (in seconds) required by the machine for sorting individual parts.