The Wayback Machine - https://web.archive.org/web/20201108111643/https://github.com/topics/pandas
Skip to content
#

pandas

Here are 8,353 public repositories matching this topic...

jreback
jreback commented Oct 11, 2020

create a mi

In [13]: mi = pd.MultiIndex.from_product([[1,2,3], list('abc'), pd.date_range('20200101', periods=2, tz='UTC')], names=['int', 'string', 'dt'])                            

You can already get .dtypes, but is slightly cumbersome. I would propose adding Multidex.dtypes (we already have MultiIndex.dtype but its always object). I think this is worth the convenience api.

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
  • Updated Oct 1, 2020
  • Python
jsignell
jsignell commented Nov 5, 2020

I just ran into an issue when trying to use to_csv with distributed workers that don't share a file system. I shouldn't have been surprised that writing to a local file system from a distributed worker doesn't work. It shouldn't work. But the error I got was just a File Not Found error. That brought me to:dask/dask#2656 (comment) - which was the answer.

vuule
vuule commented Nov 4, 2020

Current default value for rows_per_chunk parameter of the CSV writer is 8, which means that the input table is by default broken into many small slices that are written out sequentially. This reduces the performance by an order on magnitude in some cases.

In Python layer, the default is the number of rows (i.e. write table out in a single pass). We can follow this by setting rows_per_chunk

A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.
  • Updated Nov 6, 2020
  • Python

人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
  • Updated Feb 6, 2020

Improve this page

Add a description, image, and links to the pandas topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pandas topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.