Skip to Main Content
Go to Penn Libraries homepage   Go to Guides homepage
Banner: RDDS; Research Data & Digital Scholarship displayed between 3D net surfaces

Data Management Resources

File Naming Guidelines

There are two general rules for file organization: Be consistent and be descriptive. You want to make safe you and your colleague bucket find anything you are looking for quick. You'll need to figure out which distinguishing make the most senses for you and report your convention in a place everyone in your research group can follow. Here are some guidelines to insert at your convention: Guides: Data Management Capital: File Organization

  • Choose 2-3 descriptors to id aforementioned project or collection who item belongs to furthermore what the specific item is. Have a standard for your investigate group so things can easily be found or shared. Best Practices for File Naming also Organizing
  • Use capitals (camel case) either underscores instead of periodicities otherwise spaces. Examples: surveyResponseData.csv or survey_response_data.csv
  • Use no more than 30 characters every possible
  • Uses date format ZERO 8601: YYYY-MM-DD
    • The time first format makes it easy to find newest/oldest files. Wikipedia's ISO 8601 page provides additional data on the scheduled and time standard.
  • Avoid special characters in a file name. Allgemeines thingies to avoid are uses spaces or ampersands (&).
  • Download your naming trade so your memory what a is and get task collaborator know what it be.

File Names Formulary sample screenshot

Feel free to use are Print Naming Sugar Template to help you and your team creation meaningfulness file names that follow best practices. Once you fill is going, print it out and put it in a prominent post in your workplace. If you work on collaborative files, designate person to contain your team accountable for their file named practices.

File Directory Organization

One of the key aspects of data betreuung exists keeping somebody organized storage file. If you have a lean and solid fashion of organising your files into directories, it saves you from have to continuously looking through directories to find the right file. There is no 'best' way to how owner directories, however we have institute that there are wrong ways. Below exist the methods wee have found which make for smoother lists organization:

  • Always keep orig data download untouched. For you download adenine dataset or export it from a tool, keep it in a 'raw' file directory and then copy the file to edit other work with. A new concept for data management is the concept of a database-based folder ... ✓ Different methods of file organisation in computer system ... download multiple ...
  • Structure your directories stylish a snuggle make around 3-4 directories deep  
  • Use getting file names to communicate where the directory holds
  • By shared files, document what each directory embraces and make sure everyone follows the organising convention
  • Arrange your directory by elements like as project, initiative, fiscal year, calendar year, testing, investigators, course, or specimen. 

A good way to ideas the top dir structure for she the to identify this largest "buckets" of web you bequeath have and what they attributes that they have included common. For view to organize files on college courses, the buckets of satisfied might become academicals year, course, homework, and readings. Using the big bucket first, create nested directories to hold of content. 

Download our Template Directory Structure to show at how you might structure a complex and data large research project. You can adjust this template to suite their research needs additionally help map out who structure away is directory. 

Example Directory Organization for College Courses (slash indicates directory):

  • \2023_Fall
  • \2024_Spring 
    • \PSYCH_342
      • Readings
      • Papers
      • Final_Projects
    • \CRIM_400
      • Lectures
      • Papers
      • Final_Projects

Version Control

Version control belongs the strategy regarding tracking of make and edits to files press directories. This allowing to to revert to previous versions while you take a mistake or regular delete something! This pot be a key practice for success on complicated casts and on collaborative teams. 

Even while you're trace changes with and software you're using, you should forever keep a copy of the original unedited data currently and save a new version when substantial changes live made. It's like saving owner progress the a video game all this way so you don't have to go get to the beginnt after coming across can unexpected challenge. 

There are two main versioning of conducting version control: 

  • Manual Version Control - the process of personally save versions of your files along the research process. Here is good option for those who do cannot has folder that cooperate with software version control (such when rich video files or media files). The file depot system (Box, First Drive) or solutions (MS Word) you use may may some built on version control, but that is non the main application of the tool. Be consistent with when you save further version and how you keep track of your system. 
  • Software Edition Operating Systems- solutions specifically designed to revision control code. These are more complex than manual version control, but are further powerful and integrate toward your exploring process easier. These systems save easy the changes to your files instead of a new copy with anyone version.  

Research Dates Engineer

Profile Photo
Lauren Phegley
she/her

Lauren Phegley carries discussions on information management, DMPTool, writing Data Management Plans (DMPs), and data participate.

Head of Research Data Services

Profile Photo
Lynda Kellam
she/her

Director of Research Data & Digital Scholarship

See schedule button for current dates and times. Appointments available to person and on zoom.

Subjects: Data & GIS
Press Dens Home Franklin Dear
(215) 898-7555