Notes:

  • I hit two times the <ENTER> key after the question. Don’t know if that affected the answer.
  • I don’t have any preloaded context.
  • driving_crooner@lemmy.eco.br
    link
    fedilink
    arrow-up
    4
    ·
    4 months ago

    Sure, here are the details I have so far:

    1. You are interested in improving your workflow in data science and Python.
    2. You work with databases from different sources like CSVs, accdb, and SQL, and want to create a module to extract and transform these databases into pandas DataFrames. You frequently use CSVs in different projects and need to prepare them every time by copying and pasting code. You want to create a module to import the databases and configurations, including columns, names, dates, and dtypes, to streamline the ETL process.
    3. You are using rpy2 to interface between R and Python.
    4. You often need to analyze different combinations of columns in a DataFrame, ensuring some columns are always analyzed together.

    Let me know if there’s anything you’d like to modify or add!

    1 and 3 are pretty good as baseline for my needs, but 2 and 4 were just specific problems I had to solve that are not something I need generally.

    • NegativeInf@lemmy.world
      link
      fedilink
      arrow-up
      2
      arrow-down
      1
      ·
      4 months ago

      Yours is short. Mine was 47 points long and had so much stuff about questions I had asked.

      Importantly, it did list that I explicitly do not want it to show me how to make unit tests.