Python script for to evaluate business dataData cleansing and formatting scriptOptimizing a python script...

What kind of hardware implements Fourier transform?

Is it a fallacy if someone claims they need an explanation for every word of your argument to the point where they don't understand common terms?

How would an AI self awareness kill switch work?

What to do when being responsible for data protection in your lab, yet advice is ignored?

What formula could mimic the following curve?

When does coming up with an idea constitute sufficient contribution for authorship?

Why do neural networks need so many training examples to perform?

Does the "particle exchange" operator have any validity?

What are the advantages of using `make` for small projects?

Can you earn endless XP using a Flameskull and its self-revival feature?

Reference on complex cobordism

How can I deal with a significant flaw I found in my previous supervisor’s paper?

Why is button three on trumpet almost never used alone?

How did the original light saber work?

How to remove trailing forward slash

What to do if authors don't respond to my serious concerns about their paper?

Can pricing be copyrighted?

What's a good word to describe a public place that looks like it wouldn't be rough?

Is it safe to try charging my laptop with a USB-C PD charger that has less wattage than recommended?

Do authors have to be politically correct in article-writing?

What makes the Forgotten Realms "forgotten"?

Why did Jodrell Bank assist the Soviet Union to collect data from their spacecraft in the mid 1960's?

Why can a 352GB NumPy ndarray be used on an 8GB memory macOS computer?

What do you call a fact that doesn't match the settings?



Python script for to evaluate business data


Data cleansing and formatting scriptOptimizing a python script with large arraysPython Script to query OSRM (drive-times)Python - Faster random business date generationBack-testing algorithm for equityPython/Pandas code creating 12 month product balance forecastLinear Regression on random dataPythonic way of managing data for plotting 2D variance plotstwitter data mining script in pythonSet of one-hot encoders in Python













4












$begingroup$


The following script is part of a further education I'm currently enrolled into.



Not all of the code is written by myself. Function signatures for example. Therefore I have put the sections, written by myself, put into # -- Own ---- comment-lines.



Moreover you were given three CSV-files with business data from different cities. The imaginary business was a bikeshare-company.



#!/usr/local/bin/python3
import time
import pandas as pd
import numpy as np
# --- Own Start ----------------------------------------------------------
CITY_DATA = { 'chicago': 'chicago.csv',
'new york city': 'new_york_city.csv',
'washington': 'washington.csv' }
feasible_cities = [ "new york city", "chicago", "washington" ]
feasible_months = [ "january", "february", "march", "april", "may", "june", "all" ]
feasible_days = [ "monday", "tuesday", "wednesday", "thursday",
"friday", "saturday", "sunday", "all" ]

def ask_user_selection(options, prompt_message):
answer = ""
while len(answer) == 0:
answer = input(prompt_message)
answer = answer.strip().lower()

if answer in options:
return answer
else:
answer = ""
print("Please enter one of the offered options.n")
# -- Own END -----------------------------------------------------------------------------------
def get_filters():
"""
Asks user to specify a city, month, and day to analyze.

Returns:
(str) city - name of the city to analyze
(str) month - name of the month to filter by, or "all" to apply no month filter
(str) day - name of the day of week to filter by, or "all" to apply no day filter
"""
print('n ---- Hello! Let's explore some US bikeshare data! ----n')
# --- Own Start ----------------------------------------------------------
# get user input for city (chicago, new york city, washington). HINT: Use a while loop to handle invalid inputs
city = ask_user_selection(
feasible_cities,
"Please enter: 'new york city', 'chicago' or 'washington' > ")

# get user input for month (all, january, february, ... , june)
month = ask_user_selection(
feasible_months,
"Please enter month: 'january', 'february', 'march', 'april', 'may', 'june' or 'all' > ")

# get user input for day of week (all, monday, tuesday, ... sunday)
day = ask_user_selection(
feasible_days,
"Please enter day: 'monday', 'tuesday', 'wednesday', 'thursday', 'friday', 'saturday', 'sunday' or 'all' > ")

print('-'*40)
return city, month, day
# --- Own End ----------------------------------------------------------


def load_data(city, month, day):
"""
Loads data for the specified city and filters by month and day if applicable.

Args:
(str) city - name of the city to analyze
(str) month - name of the month to filter by, or "all" to apply no month filter
(str) day - name of the day of week to filter by, or "all" to apply no day filter
Returns:
df - Pandas DataFrame containing city data filtered by month and day
"""
# --- Own Start ----------------------------------------------------------
df = pd.read_csv(CITY_DATA[city], index_col = 0)

df['Start Time'] = pd.to_datetime(df['Start Time']) # Casting "Start Time" to datetime.
df["month"] = df['Start Time'].dt.month # Get the weekday out of the "Start Time" value.
df["week_day"] = df['Start Time'].dt.weekday_name # Month-part from "Start Time" value.
df["start_hour"] = df['Start Time'].dt.hour # Hour-part from "Start Time" value.
df["start_end"] = df['Start Station'].astype(str) + ' to ' + df['End Station']

if month != 'all':
month_index = feasible_months.index(month) + 1 # Get the list-index of the month.
df = df[df["month"] == month_index ] # Establish a filter for month.

if day != 'all':
df = df[df["week_day"] == day.title() ] # Establish a filter for week day.

return df
# --- Own End ----------------------------------------------------------


def time_stats(df):
"""Displays statistics on the most frequent times of travel."""
# --- Own Start ----------------------------------------------------------
print('nCalculating The Most Frequent Times of Travel ... n')
start_time = time.time()

# display the most common month
month_index = df["month"].mode()[0] - 1
most_common_month = feasible_months[month_index].title()

print("Most common month: ", most_common_month)

# display the most common day of week
most_common_day = df["week_day"].mode()[0]
print("Most common day: ", most_common_day)

# display the most common start hour
most_common_hour = df["start_hour"].mode()[0]
print("Most common hour: ", most_common_hour)

print("nThis took %s seconds." % (time.time() - start_time))
print('-'*40)
# --- Own End ----------------------------------------------------------


def station_stats(df):
"""Displays statistics on the most popular stations and trip."""
# --- Own Start ----------------------------------------------------------
print('nCalculating The Most Popular Stations and Trip ...n')
start_time = time.time()

# display most commonly used start station
most_used_start = df['Start Station'].mode()[0]
print("Most used start: ", most_used_start)

# display most commonly used end station
most_used_end = df['End Station'].mode()[0]
print("Most used end: ", most_used_end)

# display most frequent combination of start station and end station trip
most_common_combination = df["start_end"].mode()[0]
print("Most common used combination concerning start- and end-station: ",
most_common_combination)

print("nThis took %s seconds." % (time.time() - start_time))
print('-'*40)
# --- Own End ----------------------------------------------------------


def trip_duration_stats(df):
"""Displays statistics on the total and average trip duration."""
# --- Own Start ----------------------------------------------------------
print("nCalculating Trip Duration ...n")
start_time = time.time()

# display total travel time
total_travel_time = df["Trip Duration"].sum()
print("Total time of travel: ", total_travel_time)

# display mean travel time
average_time = df["Trip Duration"].mean()
print("The average travel-time: ", average_time)

print("nThis took %s seconds." % (time.time() - start_time))
print('-'*40)
# --- Own End ----------------------------------------------------------


def user_stats(df):
"""Displays statistics on bikeshare users."""
# --- Own Start ----------------------------------------------------------
print('nCalculating User Stats ...n')
start_time = time.time()

# Display counts of user types
print("Count of user types: ",
df["User Type"].value_counts())

# Display counts of gender
if "Gender" in df:
print("nCounts concerning client`s gender")
print("Male persons: ", df.query("Gender == 'Male'").Gender.count())
print("Female persons: ", df.query("Gender == 'Female'").Gender.count())

# Display earliest, most recent, and most common year of birth
if "Birth Year" in df:
print("nEarliest year of birth: ", df["Birth Year"].min())
print("Most recent year of birth: ", df["Birth Year"].max())
print("Most common year of birth: ", df["Birth Year"].value_counts().idxmax())

print("nThis took %s seconds." % (time.time() - start_time))
print('-'*40)
# --- Own End ----------------------------------------------------------


def main():
while True:
city, month, day = get_filters()
df = load_data(city, month, day)

time_stats(df)
station_stats(df)
trip_duration_stats(df)
user_stats(df)
# --- Own Start ----------------------------------------------------------
restart = input('nWould you like to restart? Enter yes or no.n')
if restart.lower() != 'yes':
break
# --- Own End ----------------------------------------------------------


if __name__ == "__main__":
main()


Here's a screenshot how it looks on the command line:



enter image description here



The script has passed the review. But nevertheless I would appreciate other opinions.



What have I done well and should keep it up? What could I have done better and why?










share|improve this question











$endgroup$

















    4












    $begingroup$


    The following script is part of a further education I'm currently enrolled into.



    Not all of the code is written by myself. Function signatures for example. Therefore I have put the sections, written by myself, put into # -- Own ---- comment-lines.



    Moreover you were given three CSV-files with business data from different cities. The imaginary business was a bikeshare-company.



    #!/usr/local/bin/python3
    import time
    import pandas as pd
    import numpy as np
    # --- Own Start ----------------------------------------------------------
    CITY_DATA = { 'chicago': 'chicago.csv',
    'new york city': 'new_york_city.csv',
    'washington': 'washington.csv' }
    feasible_cities = [ "new york city", "chicago", "washington" ]
    feasible_months = [ "january", "february", "march", "april", "may", "june", "all" ]
    feasible_days = [ "monday", "tuesday", "wednesday", "thursday",
    "friday", "saturday", "sunday", "all" ]

    def ask_user_selection(options, prompt_message):
    answer = ""
    while len(answer) == 0:
    answer = input(prompt_message)
    answer = answer.strip().lower()

    if answer in options:
    return answer
    else:
    answer = ""
    print("Please enter one of the offered options.n")
    # -- Own END -----------------------------------------------------------------------------------
    def get_filters():
    """
    Asks user to specify a city, month, and day to analyze.

    Returns:
    (str) city - name of the city to analyze
    (str) month - name of the month to filter by, or "all" to apply no month filter
    (str) day - name of the day of week to filter by, or "all" to apply no day filter
    """
    print('n ---- Hello! Let's explore some US bikeshare data! ----n')
    # --- Own Start ----------------------------------------------------------
    # get user input for city (chicago, new york city, washington). HINT: Use a while loop to handle invalid inputs
    city = ask_user_selection(
    feasible_cities,
    "Please enter: 'new york city', 'chicago' or 'washington' > ")

    # get user input for month (all, january, february, ... , june)
    month = ask_user_selection(
    feasible_months,
    "Please enter month: 'january', 'february', 'march', 'april', 'may', 'june' or 'all' > ")

    # get user input for day of week (all, monday, tuesday, ... sunday)
    day = ask_user_selection(
    feasible_days,
    "Please enter day: 'monday', 'tuesday', 'wednesday', 'thursday', 'friday', 'saturday', 'sunday' or 'all' > ")

    print('-'*40)
    return city, month, day
    # --- Own End ----------------------------------------------------------


    def load_data(city, month, day):
    """
    Loads data for the specified city and filters by month and day if applicable.

    Args:
    (str) city - name of the city to analyze
    (str) month - name of the month to filter by, or "all" to apply no month filter
    (str) day - name of the day of week to filter by, or "all" to apply no day filter
    Returns:
    df - Pandas DataFrame containing city data filtered by month and day
    """
    # --- Own Start ----------------------------------------------------------
    df = pd.read_csv(CITY_DATA[city], index_col = 0)

    df['Start Time'] = pd.to_datetime(df['Start Time']) # Casting "Start Time" to datetime.
    df["month"] = df['Start Time'].dt.month # Get the weekday out of the "Start Time" value.
    df["week_day"] = df['Start Time'].dt.weekday_name # Month-part from "Start Time" value.
    df["start_hour"] = df['Start Time'].dt.hour # Hour-part from "Start Time" value.
    df["start_end"] = df['Start Station'].astype(str) + ' to ' + df['End Station']

    if month != 'all':
    month_index = feasible_months.index(month) + 1 # Get the list-index of the month.
    df = df[df["month"] == month_index ] # Establish a filter for month.

    if day != 'all':
    df = df[df["week_day"] == day.title() ] # Establish a filter for week day.

    return df
    # --- Own End ----------------------------------------------------------


    def time_stats(df):
    """Displays statistics on the most frequent times of travel."""
    # --- Own Start ----------------------------------------------------------
    print('nCalculating The Most Frequent Times of Travel ... n')
    start_time = time.time()

    # display the most common month
    month_index = df["month"].mode()[0] - 1
    most_common_month = feasible_months[month_index].title()

    print("Most common month: ", most_common_month)

    # display the most common day of week
    most_common_day = df["week_day"].mode()[0]
    print("Most common day: ", most_common_day)

    # display the most common start hour
    most_common_hour = df["start_hour"].mode()[0]
    print("Most common hour: ", most_common_hour)

    print("nThis took %s seconds." % (time.time() - start_time))
    print('-'*40)
    # --- Own End ----------------------------------------------------------


    def station_stats(df):
    """Displays statistics on the most popular stations and trip."""
    # --- Own Start ----------------------------------------------------------
    print('nCalculating The Most Popular Stations and Trip ...n')
    start_time = time.time()

    # display most commonly used start station
    most_used_start = df['Start Station'].mode()[0]
    print("Most used start: ", most_used_start)

    # display most commonly used end station
    most_used_end = df['End Station'].mode()[0]
    print("Most used end: ", most_used_end)

    # display most frequent combination of start station and end station trip
    most_common_combination = df["start_end"].mode()[0]
    print("Most common used combination concerning start- and end-station: ",
    most_common_combination)

    print("nThis took %s seconds." % (time.time() - start_time))
    print('-'*40)
    # --- Own End ----------------------------------------------------------


    def trip_duration_stats(df):
    """Displays statistics on the total and average trip duration."""
    # --- Own Start ----------------------------------------------------------
    print("nCalculating Trip Duration ...n")
    start_time = time.time()

    # display total travel time
    total_travel_time = df["Trip Duration"].sum()
    print("Total time of travel: ", total_travel_time)

    # display mean travel time
    average_time = df["Trip Duration"].mean()
    print("The average travel-time: ", average_time)

    print("nThis took %s seconds." % (time.time() - start_time))
    print('-'*40)
    # --- Own End ----------------------------------------------------------


    def user_stats(df):
    """Displays statistics on bikeshare users."""
    # --- Own Start ----------------------------------------------------------
    print('nCalculating User Stats ...n')
    start_time = time.time()

    # Display counts of user types
    print("Count of user types: ",
    df["User Type"].value_counts())

    # Display counts of gender
    if "Gender" in df:
    print("nCounts concerning client`s gender")
    print("Male persons: ", df.query("Gender == 'Male'").Gender.count())
    print("Female persons: ", df.query("Gender == 'Female'").Gender.count())

    # Display earliest, most recent, and most common year of birth
    if "Birth Year" in df:
    print("nEarliest year of birth: ", df["Birth Year"].min())
    print("Most recent year of birth: ", df["Birth Year"].max())
    print("Most common year of birth: ", df["Birth Year"].value_counts().idxmax())

    print("nThis took %s seconds." % (time.time() - start_time))
    print('-'*40)
    # --- Own End ----------------------------------------------------------


    def main():
    while True:
    city, month, day = get_filters()
    df = load_data(city, month, day)

    time_stats(df)
    station_stats(df)
    trip_duration_stats(df)
    user_stats(df)
    # --- Own Start ----------------------------------------------------------
    restart = input('nWould you like to restart? Enter yes or no.n')
    if restart.lower() != 'yes':
    break
    # --- Own End ----------------------------------------------------------


    if __name__ == "__main__":
    main()


    Here's a screenshot how it looks on the command line:



    enter image description here



    The script has passed the review. But nevertheless I would appreciate other opinions.



    What have I done well and should keep it up? What could I have done better and why?










    share|improve this question











    $endgroup$















      4












      4








      4


      1



      $begingroup$


      The following script is part of a further education I'm currently enrolled into.



      Not all of the code is written by myself. Function signatures for example. Therefore I have put the sections, written by myself, put into # -- Own ---- comment-lines.



      Moreover you were given three CSV-files with business data from different cities. The imaginary business was a bikeshare-company.



      #!/usr/local/bin/python3
      import time
      import pandas as pd
      import numpy as np
      # --- Own Start ----------------------------------------------------------
      CITY_DATA = { 'chicago': 'chicago.csv',
      'new york city': 'new_york_city.csv',
      'washington': 'washington.csv' }
      feasible_cities = [ "new york city", "chicago", "washington" ]
      feasible_months = [ "january", "february", "march", "april", "may", "june", "all" ]
      feasible_days = [ "monday", "tuesday", "wednesday", "thursday",
      "friday", "saturday", "sunday", "all" ]

      def ask_user_selection(options, prompt_message):
      answer = ""
      while len(answer) == 0:
      answer = input(prompt_message)
      answer = answer.strip().lower()

      if answer in options:
      return answer
      else:
      answer = ""
      print("Please enter one of the offered options.n")
      # -- Own END -----------------------------------------------------------------------------------
      def get_filters():
      """
      Asks user to specify a city, month, and day to analyze.

      Returns:
      (str) city - name of the city to analyze
      (str) month - name of the month to filter by, or "all" to apply no month filter
      (str) day - name of the day of week to filter by, or "all" to apply no day filter
      """
      print('n ---- Hello! Let's explore some US bikeshare data! ----n')
      # --- Own Start ----------------------------------------------------------
      # get user input for city (chicago, new york city, washington). HINT: Use a while loop to handle invalid inputs
      city = ask_user_selection(
      feasible_cities,
      "Please enter: 'new york city', 'chicago' or 'washington' > ")

      # get user input for month (all, january, february, ... , june)
      month = ask_user_selection(
      feasible_months,
      "Please enter month: 'january', 'february', 'march', 'april', 'may', 'june' or 'all' > ")

      # get user input for day of week (all, monday, tuesday, ... sunday)
      day = ask_user_selection(
      feasible_days,
      "Please enter day: 'monday', 'tuesday', 'wednesday', 'thursday', 'friday', 'saturday', 'sunday' or 'all' > ")

      print('-'*40)
      return city, month, day
      # --- Own End ----------------------------------------------------------


      def load_data(city, month, day):
      """
      Loads data for the specified city and filters by month and day if applicable.

      Args:
      (str) city - name of the city to analyze
      (str) month - name of the month to filter by, or "all" to apply no month filter
      (str) day - name of the day of week to filter by, or "all" to apply no day filter
      Returns:
      df - Pandas DataFrame containing city data filtered by month and day
      """
      # --- Own Start ----------------------------------------------------------
      df = pd.read_csv(CITY_DATA[city], index_col = 0)

      df['Start Time'] = pd.to_datetime(df['Start Time']) # Casting "Start Time" to datetime.
      df["month"] = df['Start Time'].dt.month # Get the weekday out of the "Start Time" value.
      df["week_day"] = df['Start Time'].dt.weekday_name # Month-part from "Start Time" value.
      df["start_hour"] = df['Start Time'].dt.hour # Hour-part from "Start Time" value.
      df["start_end"] = df['Start Station'].astype(str) + ' to ' + df['End Station']

      if month != 'all':
      month_index = feasible_months.index(month) + 1 # Get the list-index of the month.
      df = df[df["month"] == month_index ] # Establish a filter for month.

      if day != 'all':
      df = df[df["week_day"] == day.title() ] # Establish a filter for week day.

      return df
      # --- Own End ----------------------------------------------------------


      def time_stats(df):
      """Displays statistics on the most frequent times of travel."""
      # --- Own Start ----------------------------------------------------------
      print('nCalculating The Most Frequent Times of Travel ... n')
      start_time = time.time()

      # display the most common month
      month_index = df["month"].mode()[0] - 1
      most_common_month = feasible_months[month_index].title()

      print("Most common month: ", most_common_month)

      # display the most common day of week
      most_common_day = df["week_day"].mode()[0]
      print("Most common day: ", most_common_day)

      # display the most common start hour
      most_common_hour = df["start_hour"].mode()[0]
      print("Most common hour: ", most_common_hour)

      print("nThis took %s seconds." % (time.time() - start_time))
      print('-'*40)
      # --- Own End ----------------------------------------------------------


      def station_stats(df):
      """Displays statistics on the most popular stations and trip."""
      # --- Own Start ----------------------------------------------------------
      print('nCalculating The Most Popular Stations and Trip ...n')
      start_time = time.time()

      # display most commonly used start station
      most_used_start = df['Start Station'].mode()[0]
      print("Most used start: ", most_used_start)

      # display most commonly used end station
      most_used_end = df['End Station'].mode()[0]
      print("Most used end: ", most_used_end)

      # display most frequent combination of start station and end station trip
      most_common_combination = df["start_end"].mode()[0]
      print("Most common used combination concerning start- and end-station: ",
      most_common_combination)

      print("nThis took %s seconds." % (time.time() - start_time))
      print('-'*40)
      # --- Own End ----------------------------------------------------------


      def trip_duration_stats(df):
      """Displays statistics on the total and average trip duration."""
      # --- Own Start ----------------------------------------------------------
      print("nCalculating Trip Duration ...n")
      start_time = time.time()

      # display total travel time
      total_travel_time = df["Trip Duration"].sum()
      print("Total time of travel: ", total_travel_time)

      # display mean travel time
      average_time = df["Trip Duration"].mean()
      print("The average travel-time: ", average_time)

      print("nThis took %s seconds." % (time.time() - start_time))
      print('-'*40)
      # --- Own End ----------------------------------------------------------


      def user_stats(df):
      """Displays statistics on bikeshare users."""
      # --- Own Start ----------------------------------------------------------
      print('nCalculating User Stats ...n')
      start_time = time.time()

      # Display counts of user types
      print("Count of user types: ",
      df["User Type"].value_counts())

      # Display counts of gender
      if "Gender" in df:
      print("nCounts concerning client`s gender")
      print("Male persons: ", df.query("Gender == 'Male'").Gender.count())
      print("Female persons: ", df.query("Gender == 'Female'").Gender.count())

      # Display earliest, most recent, and most common year of birth
      if "Birth Year" in df:
      print("nEarliest year of birth: ", df["Birth Year"].min())
      print("Most recent year of birth: ", df["Birth Year"].max())
      print("Most common year of birth: ", df["Birth Year"].value_counts().idxmax())

      print("nThis took %s seconds." % (time.time() - start_time))
      print('-'*40)
      # --- Own End ----------------------------------------------------------


      def main():
      while True:
      city, month, day = get_filters()
      df = load_data(city, month, day)

      time_stats(df)
      station_stats(df)
      trip_duration_stats(df)
      user_stats(df)
      # --- Own Start ----------------------------------------------------------
      restart = input('nWould you like to restart? Enter yes or no.n')
      if restart.lower() != 'yes':
      break
      # --- Own End ----------------------------------------------------------


      if __name__ == "__main__":
      main()


      Here's a screenshot how it looks on the command line:



      enter image description here



      The script has passed the review. But nevertheless I would appreciate other opinions.



      What have I done well and should keep it up? What could I have done better and why?










      share|improve this question











      $endgroup$




      The following script is part of a further education I'm currently enrolled into.



      Not all of the code is written by myself. Function signatures for example. Therefore I have put the sections, written by myself, put into # -- Own ---- comment-lines.



      Moreover you were given three CSV-files with business data from different cities. The imaginary business was a bikeshare-company.



      #!/usr/local/bin/python3
      import time
      import pandas as pd
      import numpy as np
      # --- Own Start ----------------------------------------------------------
      CITY_DATA = { 'chicago': 'chicago.csv',
      'new york city': 'new_york_city.csv',
      'washington': 'washington.csv' }
      feasible_cities = [ "new york city", "chicago", "washington" ]
      feasible_months = [ "january", "february", "march", "april", "may", "june", "all" ]
      feasible_days = [ "monday", "tuesday", "wednesday", "thursday",
      "friday", "saturday", "sunday", "all" ]

      def ask_user_selection(options, prompt_message):
      answer = ""
      while len(answer) == 0:
      answer = input(prompt_message)
      answer = answer.strip().lower()

      if answer in options:
      return answer
      else:
      answer = ""
      print("Please enter one of the offered options.n")
      # -- Own END -----------------------------------------------------------------------------------
      def get_filters():
      """
      Asks user to specify a city, month, and day to analyze.

      Returns:
      (str) city - name of the city to analyze
      (str) month - name of the month to filter by, or "all" to apply no month filter
      (str) day - name of the day of week to filter by, or "all" to apply no day filter
      """
      print('n ---- Hello! Let's explore some US bikeshare data! ----n')
      # --- Own Start ----------------------------------------------------------
      # get user input for city (chicago, new york city, washington). HINT: Use a while loop to handle invalid inputs
      city = ask_user_selection(
      feasible_cities,
      "Please enter: 'new york city', 'chicago' or 'washington' > ")

      # get user input for month (all, january, february, ... , june)
      month = ask_user_selection(
      feasible_months,
      "Please enter month: 'january', 'february', 'march', 'april', 'may', 'june' or 'all' > ")

      # get user input for day of week (all, monday, tuesday, ... sunday)
      day = ask_user_selection(
      feasible_days,
      "Please enter day: 'monday', 'tuesday', 'wednesday', 'thursday', 'friday', 'saturday', 'sunday' or 'all' > ")

      print('-'*40)
      return city, month, day
      # --- Own End ----------------------------------------------------------


      def load_data(city, month, day):
      """
      Loads data for the specified city and filters by month and day if applicable.

      Args:
      (str) city - name of the city to analyze
      (str) month - name of the month to filter by, or "all" to apply no month filter
      (str) day - name of the day of week to filter by, or "all" to apply no day filter
      Returns:
      df - Pandas DataFrame containing city data filtered by month and day
      """
      # --- Own Start ----------------------------------------------------------
      df = pd.read_csv(CITY_DATA[city], index_col = 0)

      df['Start Time'] = pd.to_datetime(df['Start Time']) # Casting "Start Time" to datetime.
      df["month"] = df['Start Time'].dt.month # Get the weekday out of the "Start Time" value.
      df["week_day"] = df['Start Time'].dt.weekday_name # Month-part from "Start Time" value.
      df["start_hour"] = df['Start Time'].dt.hour # Hour-part from "Start Time" value.
      df["start_end"] = df['Start Station'].astype(str) + ' to ' + df['End Station']

      if month != 'all':
      month_index = feasible_months.index(month) + 1 # Get the list-index of the month.
      df = df[df["month"] == month_index ] # Establish a filter for month.

      if day != 'all':
      df = df[df["week_day"] == day.title() ] # Establish a filter for week day.

      return df
      # --- Own End ----------------------------------------------------------


      def time_stats(df):
      """Displays statistics on the most frequent times of travel."""
      # --- Own Start ----------------------------------------------------------
      print('nCalculating The Most Frequent Times of Travel ... n')
      start_time = time.time()

      # display the most common month
      month_index = df["month"].mode()[0] - 1
      most_common_month = feasible_months[month_index].title()

      print("Most common month: ", most_common_month)

      # display the most common day of week
      most_common_day = df["week_day"].mode()[0]
      print("Most common day: ", most_common_day)

      # display the most common start hour
      most_common_hour = df["start_hour"].mode()[0]
      print("Most common hour: ", most_common_hour)

      print("nThis took %s seconds." % (time.time() - start_time))
      print('-'*40)
      # --- Own End ----------------------------------------------------------


      def station_stats(df):
      """Displays statistics on the most popular stations and trip."""
      # --- Own Start ----------------------------------------------------------
      print('nCalculating The Most Popular Stations and Trip ...n')
      start_time = time.time()

      # display most commonly used start station
      most_used_start = df['Start Station'].mode()[0]
      print("Most used start: ", most_used_start)

      # display most commonly used end station
      most_used_end = df['End Station'].mode()[0]
      print("Most used end: ", most_used_end)

      # display most frequent combination of start station and end station trip
      most_common_combination = df["start_end"].mode()[0]
      print("Most common used combination concerning start- and end-station: ",
      most_common_combination)

      print("nThis took %s seconds." % (time.time() - start_time))
      print('-'*40)
      # --- Own End ----------------------------------------------------------


      def trip_duration_stats(df):
      """Displays statistics on the total and average trip duration."""
      # --- Own Start ----------------------------------------------------------
      print("nCalculating Trip Duration ...n")
      start_time = time.time()

      # display total travel time
      total_travel_time = df["Trip Duration"].sum()
      print("Total time of travel: ", total_travel_time)

      # display mean travel time
      average_time = df["Trip Duration"].mean()
      print("The average travel-time: ", average_time)

      print("nThis took %s seconds." % (time.time() - start_time))
      print('-'*40)
      # --- Own End ----------------------------------------------------------


      def user_stats(df):
      """Displays statistics on bikeshare users."""
      # --- Own Start ----------------------------------------------------------
      print('nCalculating User Stats ...n')
      start_time = time.time()

      # Display counts of user types
      print("Count of user types: ",
      df["User Type"].value_counts())

      # Display counts of gender
      if "Gender" in df:
      print("nCounts concerning client`s gender")
      print("Male persons: ", df.query("Gender == 'Male'").Gender.count())
      print("Female persons: ", df.query("Gender == 'Female'").Gender.count())

      # Display earliest, most recent, and most common year of birth
      if "Birth Year" in df:
      print("nEarliest year of birth: ", df["Birth Year"].min())
      print("Most recent year of birth: ", df["Birth Year"].max())
      print("Most common year of birth: ", df["Birth Year"].value_counts().idxmax())

      print("nThis took %s seconds." % (time.time() - start_time))
      print('-'*40)
      # --- Own End ----------------------------------------------------------


      def main():
      while True:
      city, month, day = get_filters()
      df = load_data(city, month, day)

      time_stats(df)
      station_stats(df)
      trip_duration_stats(df)
      user_stats(df)
      # --- Own Start ----------------------------------------------------------
      restart = input('nWould you like to restart? Enter yes or no.n')
      if restart.lower() != 'yes':
      break
      # --- Own End ----------------------------------------------------------


      if __name__ == "__main__":
      main()


      Here's a screenshot how it looks on the command line:



      enter image description here



      The script has passed the review. But nevertheless I would appreciate other opinions.



      What have I done well and should keep it up? What could I have done better and why?







      python numpy pandas






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited 1 min ago









      Jamal

      30.3k11119227




      30.3k11119227










      asked Dec 16 '18 at 15:32









      michael.zechmichael.zech

      1,7281734




      1,7281734






















          1 Answer
          1






          active

          oldest

          votes


















          1












          $begingroup$

          The ask_user_selection function could be implemented a bit simpler,
          by using a while True: loop, and an early return:



          def ask_user_selection(options, prompt_message):
          while True:
          answer = input(prompt_message).strip().lower()

          if answer in options:
          return answer

          print("Please enter one of the offered options.n")





          share|improve this answer









          $endgroup$













            Your Answer





            StackExchange.ifUsing("editor", function () {
            return StackExchange.using("mathjaxEditing", function () {
            StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
            StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["\$", "\$"]]);
            });
            });
            }, "mathjax-editing");

            StackExchange.ifUsing("editor", function () {
            StackExchange.using("externalEditor", function () {
            StackExchange.using("snippets", function () {
            StackExchange.snippets.init();
            });
            });
            }, "code-snippets");

            StackExchange.ready(function() {
            var channelOptions = {
            tags: "".split(" "),
            id: "196"
            };
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function() {
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled) {
            StackExchange.using("snippets", function() {
            createEditor();
            });
            }
            else {
            createEditor();
            }
            });

            function createEditor() {
            StackExchange.prepareEditor({
            heartbeatType: 'answer',
            autoActivateHeartbeat: false,
            convertImagesToLinks: false,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: null,
            bindNavPrevention: true,
            postfix: "",
            imageUploader: {
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            },
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            });


            }
            });














            draft saved

            draft discarded


















            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fcodereview.stackexchange.com%2fquestions%2f209768%2fpython-script-for-to-evaluate-business-data%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown

























            1 Answer
            1






            active

            oldest

            votes








            1 Answer
            1






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes









            1












            $begingroup$

            The ask_user_selection function could be implemented a bit simpler,
            by using a while True: loop, and an early return:



            def ask_user_selection(options, prompt_message):
            while True:
            answer = input(prompt_message).strip().lower()

            if answer in options:
            return answer

            print("Please enter one of the offered options.n")





            share|improve this answer









            $endgroup$


















              1












              $begingroup$

              The ask_user_selection function could be implemented a bit simpler,
              by using a while True: loop, and an early return:



              def ask_user_selection(options, prompt_message):
              while True:
              answer = input(prompt_message).strip().lower()

              if answer in options:
              return answer

              print("Please enter one of the offered options.n")





              share|improve this answer









              $endgroup$
















                1












                1








                1





                $begingroup$

                The ask_user_selection function could be implemented a bit simpler,
                by using a while True: loop, and an early return:



                def ask_user_selection(options, prompt_message):
                while True:
                answer = input(prompt_message).strip().lower()

                if answer in options:
                return answer

                print("Please enter one of the offered options.n")





                share|improve this answer









                $endgroup$



                The ask_user_selection function could be implemented a bit simpler,
                by using a while True: loop, and an early return:



                def ask_user_selection(options, prompt_message):
                while True:
                answer = input(prompt_message).strip().lower()

                if answer in options:
                return answer

                print("Please enter one of the offered options.n")






                share|improve this answer












                share|improve this answer



                share|improve this answer










                answered Dec 16 '18 at 22:26









                janosjanos

                98.1k12125350




                98.1k12125350






























                    draft saved

                    draft discarded




















































                    Thanks for contributing an answer to Code Review Stack Exchange!


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid



                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.


                    Use MathJax to format equations. MathJax reference.


                    To learn more, see our tips on writing great answers.




                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function () {
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fcodereview.stackexchange.com%2fquestions%2f209768%2fpython-script-for-to-evaluate-business-data%23new-answer', 'question_page');
                    }
                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    Fairchild Swearingen Metro Inhaltsverzeichnis Geschichte | Innenausstattung | Nutzung | Zwischenfälle...

                    Pilgersdorf Inhaltsverzeichnis Geografie | Geschichte | Bevölkerungsentwicklung | Politik | Kultur...

                    Marineschifffahrtleitung Inhaltsverzeichnis Geschichte | Heutige Organisation der NATO | Nationale und...