Should your shiny app be an R package? | Martin Frigaard | Data Science Hangout

Transcript#

This transcript was generated automatically and may contain errors.

Hey there, welcome to the Paws at Data Science Hangout. I'm Libby Herron, and this is a recording of our weekly community call that happens every Thursday at 12pm US Eastern Time. If you are not joining us live, you miss out on the amazing chat that's going on. So find the link in the description where you can add our call to your calendar and come hang out with the most supportive, friendly, and funny data community you'll ever experience.

Okay, I'm so excited to introduce our featured leader today, Martin Frigaard. Martin is a senior Shiny application developer, and he's here to tell us all about his experience as a shiny app developer and his journey. So get ready for all the questions about all things shiny. Martin, would you introduce yourself, tell us a little bit about what you do, and something you like to do for fun.

Okay, great. Yeah, so I'm Martin Frigaard. I just want to say first off, I'm appearing on the hangout as myself. All the views and opinions are my own and do not represent those of my employer. That being said, I'm an R programmer and shiny developer. What I do for fun, it sounds like a distant memory. I have two small children now, so most of what I do for fun involves what they want to do for fun. But before that, I was a pretty avid, I spent a lot of time in my 20s and 30s doing, I did various, I did a fair amount of personal Jiu Jitsu. I did a little bit of weightlifting and some other gym oriented stuff. And then I did for a small period of time, I was, I like to consider myself an amateur journalist. So I spent a fair amount of time kind of in the journalism world. In fact, my first kind of start with R was writing for Northeastern University's journalism school had StoryMatch was their blog. So my first kind of like R tutorials way back in like 2015 or 16 was writing for a journalism school. So I still mostly just now write. I write things to myself, it seems like more and more. They start out as little notes and turn into long essays where I'm lecturing myself on advice that I don't take.

Becoming a Shiny developer

Well, we have a question that we can jump in and ask right away to get us started. And it is from Nathan, who says, how does one become a Shiny developer? I did not know that was a job title. Yeah, it wasn't. I think it was like broadly app developer, but now I will actually have recruiters and you'll see them list Shiny app developer because it's obviously become big enough to become its own thing.

I think that the path is probably, it's going to be more R programming. I really recommend spending some time with a little bit of time in package development. So kind of once you get started with the basics of R and syntax, a little bit of knowledge on package development will make you a better Shiny app developer. And then, yeah, if you're just getting started, I would really try to be as bilingual as possible between Shiny for Python and regular Shiny. I really like everything. Every time I pick up Shiny for Python, I'm impressed at the things it can do. And it seems like the crossover between the two, you would kind of want some of that flexibility.

I think that the package versus Shiny app distinction is not clear for a lot of people about why that's important. A lot of Shiny apps are actually developed as packages. Could you talk a little bit about your experience with that? Yeah, that's why I wrote a book called Shiny App Packaging.

Shiny app packages and frameworks

So when I started writing the book, there was a lot of questions. At the time, I was working in the biotech pharma space, and there was a lot of adoption of Shiny and questions about, you know, should we use Gollum? So these frameworks were released. And Gollum is a fantastic framework out of the ThinkR group. They have a book that really walks you through, I think it's an engineering production grade Shiny app, so it's probably similar. But it's a really steep curve for package development. So there's a great book also adequately or aptly called R Packages. I think it's in its third edition now. That's a great overview of how to develop your R code into a package. The R reference manual has its own, I think it's called Writing R Extensions, that has a lot of that information, but really in a kind of like hard to find or navigate way. So R Packages has a bunch of great information on how to make your R code easier to download and install on any computer.

And so what I was missing in my role was, well, what if we don't want to adopt a framework? What if we just want to develop a Shiny app as an R package? And so the book was written just as a, you know, there was a gap in Shiny app development. And actually, this is like, I can go off on a tangent here. So you remember Joe Chang was going to write the Shiny book. And Joe Chang has said this on stage, so I don't feel bad throwing him under the bus here. Joe Chang didn't write the Shiny book, but Hadley Wickham wrote the Shiny book. And Hadley Wickham, like packages are introduced in, I think it's like chapter 19 or 20 of Mastering Shiny. And it should be like, in my opinion, it should be like chapter two or three. Because so much of your development of a Shiny, of a good Shiny application, you know, will be driven by a lot of package development behaviors and tools. So things like loading the code, things like writing tests, that kind of stuff to make sure that your application, especially if you're going into a production environment.

And it should be like, in my opinion, it should be like chapter two or three. Because so much of your development of a Shiny, of a good Shiny application, you know, will be driven by a lot of package development behaviors and tools.

You know, it's Shiny app packages basically was, I think I even have like a Venn diagram. It was like, you know, there was this spot missing of, okay, I know how to develop an R package. There's all this great, like how to build a Shiny app, but like, how do I build an R package with a Shiny app that isn't in a framework? I think since there's been, when I wrote it, I think Rhino was like in its infancy. I think it's a little bit bigger now. And Rhino is its own animal, not just literally, but figuratively. It's not a package. So Rhino applications are not packages. They actually use, I don't want to get, people are familiar, they use Box, which is this kind of different, unique way of maintaining dependencies.

Rhino is more of a framework. It's an integrated framework. Yeah. It helps you use software engineering principles really more to do what you're doing with your Shiny app development. And being very, very precise in dependency management and other things. You know, I think that, well, it's been, I think it's been used pretty successfully in some of the FTA submissions. So I know that it's great for what it does. I think that what my advice always is for newcomers is that it's still worthwhile to learn how to write R packages. It's still worthwhile to learn how to develop Shiny applications. And then once you get to a point where you want to take out something like Rhino, having that framework to kind of compare and contrast will just make it easier to learn that.

So yeah, I think getting into Shiny, I think there's never been a better time to get into Shiny development because it seems like I see more and more. In fact, just this morning before the call, I was deep into this. Let me just do a shameless plug for this amazing newsletter from the RDM Weekly on Substack. It's a great newsletter and they had this fantastic app that is a readme builder. So let me just plug everybody else's tools here. Yeah, there's a readme builder Shiny app that I was just really fascinated by and was trying to resist the temptation to dig into before I talked because I didn't want to have a bunch of distractions.

But yeah, so the Shiny app space seems like it's getting larger every day. And Shiny for Python, like I said, I think is worth learning. I know that there's two syntaxes there. So it probably is personal preference on which one you want to learn. But yeah, it's equally as I would say impressive and probably will follow the same kind of adoption trend.

Okay. We're going to get a link in there to the RDM Weekly from Crystal Lewis. That's who it was. Crystal is amazing. And I wanted to hop in and just talk about what Shiny is for a second just in case anybody has not used it. I saw some questions that were kind of filtering through the chat about Shiny. And the questions were tending towards like, oh, it sounds like Shiny app development is more software engineering. But writing a Shiny app, you can write it in R. And that's sort of why it exists, right? So that if you are a data scientist, a data analyst, and you do most of your work in R, you don't have to learn JavaScript and you don't have to learn HTML in order to create something and deploy it that lets you have an interactive app for data science or data analytics purposes, which is amazing. It makes web development, web app development, much more accessible for somebody who writes mostly in R or mostly in Python as well. There's Shiny for Python.

And Shiny for Python has two different versions because there's an express version. And then there's a regular version, which is a lot more like the R syntax. So definitely go explore Shiny because we are making it sound like it's really, really heavy software engineering. But you can make a Shiny app in a few lines of code and have it run and kind of dip your toes in the water. It's really, really amazing and fun. So I put a link to Shiny, but you can Google Shiny apps and get to all kinds of information, including some really easy like get started a few lines of code, you'll be good to go. And there's a wonderful community online of Shiny people who would be happy to help you and answer your questions.

I'm, I'm more of a fan of good enough than I am of expertise. I think that good enough, meaning good enough knowledge to get work done, get enough knowledge to get, to get what, to get where you need to be is so much more valuable than expertise, being an expert in something.

Code quality and maintainability

Data analysts are increasingly being asked to use R and Shiny. Many can write code that works in the short term, but struggle to build code that stays reliable and maintainable as projects become more complex. So what advice would you give analysts who want to improve their code quality as they grow?

Yeah, I think this is just part of the nature of open source is that it is always fun to use the developer written packages from GitHub and that kind of stuff. And I mean, I've been just as frustrated as anybody when like your deep layer code suddenly doesn't work because of some change in the syntax. This is kind of the nature of open source tooling that these things evolve and change over time. And so stability, base R is more your friend than you realize. I think people underestimate how important base R is to doing a lot of, I would say just regular programming. I haven't had many problems with Shiny itself in terms of being unstable. They're really good about backwards compatibility. But I always explain to developers that I'm either supervising or working with it. But the more that you go out on that edge of this is under development, under active development, the further that you're going to push the boundary that it comes at a cost of probably additional maintenance. And I think that maintenance is actually, there used to be the data wrangling was the thing that nobody wanted to talk about for statisticians. Like how much time do you actually spend cleaning the data versus like analyzing the data? And it was like an 80-20. But nobody likes, I think the term data wrangler actually sounds pretty cool, but nobody wanted to go, everybody wanted to be a data scientist.

But no, I think that the maintenance of the applications that are developed in open source is just a part of the cost of doing business. And I think we parse that in enough. I think that it should be part of an expectation that if you're using something that is on, if you're using bleeding edge, cutting edge stuff, that's going to, it's like building a Ferrari. It's going to perform really well, but you're going to have to provide a lot of maintenance to make sure it performs. You can scale it back quite a bit and use only cram packages and increasingly depend on base R for stability if that is your primary concern. Those are trade-offs you have to accept with open source tools.

I will add to this by saying the hardest time I've ever had maintaining a Shiny app, because I have been a Shiny developer actually for a short time and I've done freelance work as a Shiny developer. The hardest job I ever have is when a Shiny app was built using non-Crayon packages that were not versioned and are just living on some random person's GitHub repo that made a package one time and then never touched it again. So my plea to the universe is, if you are building a production-grade Shiny app, you're deploying it for somebody, you're doing something for freelance work or otherwise, please, please, please try to stick to vetted and versioned packages that are on Crayon and that will stick around. Otherwise, people are not going to be able to restore or build back to your version, especially if they're built on our end.

No, that's exactly, yeah, that's really what I was getting at, was just that there are really great developer-built packages that are adding, I would say, quality and features to your application that I would say are essential, but because they're not on Crayon, they can also be one of those things that maybe contact that author, see what their plan is for that package. I don't know how many of them, but I do think that, you know, if you said, hey, I want to make sure this lasts in the future, I can't imagine a developer saying, no, by all means, don't commit my package to CRAN. You know, don't. You know, become a co-author of that package if it's valuable to you and make sure it ends up on CRAN.

Your problem is solved. Like Martin said, like, Base.R is so much more powerful than you think it is, and having fewer dependencies because you did something in Base.R instead of depending on a package to do something can be really, really powerful. You can cut down your dependencies. Yeah. I've definitely been guilty of, like, looking at a package and being like, oh, I can do that in Base.R. I'm just going to replace that function with a little bit of Base.R, and I still get the functionality without needing a dependency.

Bridging the gap between Shiny developers and IT

Okay, well, we have time for one more question because we have about six minutes left. Mike Smith, I believe you had a question. Okay, I just put it in the chat for everybody. It says, the Venn diagram of DS, data science programmer, Shiny app developer, and IT org app developer has a lot of overlap, but do you think there are any practices from IT org developer that we could learn from a data science programmer, Shiny app developer perspective? This is a great question for Martin because he actually spent years as a Posit admin, sort of in a behind-the-scenes role, but was still developing Shiny apps from that direction. So you've dealt with this from both sides.

Yeah, I will say I was frustrated because Shiny app developer, I built something in that I'd run into that wall of can't get into production, can't deploy it, right? So the DevOps kind of wall. And then I just decided, you know what? I wonder what the problem is. So then I went over to the Posit system admin side and realized it's much more complicated than I thought it was. Like most things. So I think that the biggest solution to this is exactly what Libby said, and that's understanding more of what IT does and really asking them to go through, to the degree that they will let you. Some organizations have a very high wall between IT and developers and others are, they're able to get in a call and talk through, but understanding how these services like Connect and Workbench and Package Manager are set up in your organization, understanding how they're configured, understanding what maintenance of the services looks like.

You know, one of the first things that I did as a Posit system admin, because I'm a Shiny app developer, was like, all right, obviously a huge part of this job is looking at log files. It's just a ton of like, you know, grepping on these log files. And Shiny has this amazing reactive file reader that I just built a really simple Shiny application that just, I could select the log file from the UI and just spit the log into the app. And then I could search for that instead of having to open the terminal. Right. So it's, and believe me, that's one of those things that like, unless they're really diehard, some people are diehard terminal users and you're never going to change your mind. But, you know, if you can see what IT deals with and what they do, I think a lot of times and approach it from a, hey, I just want to understand, not, you know, selfishly, I want to understand what the problem is so I can get my stuff deployed, but just understand the difficulties that they deal with from, you know, a services and architecture standpoint.

It'll help you when you're deploying to think about like, okay, what, what resources is this going to take up in terms of, you know, what's on the server, you know, what, what additional kind of like firewall issues, what networking issues might I run into with this application that, you know, IT is going to say no, you know, say no to or need approval for. You know, I think that just a little bit of understanding of the sysadmin role and responsibilities makes you a much better app developer just in terms of like that final 10% of getting it into production.

Yeah. And then, you know, I mean, of course, like taking skills back and forth. Yeah, I definitely have a different, a different understanding of deploying applications after having worked as a system admin and the dependencies and stuff that are required, you know, from a, you know, command line dependencies that they need to install so that I can use these R packages because the, you know, the tools are required on the server itself long before the R package will be able to access it.

Yeah. It helps you with the, why can't you just problem because, because nobody wants to hear, why can't you just X, Y, Z, because you are not seeing clearly their struggles. I have a, yeah, I have a basic rule about not ever asking. So anybody I work with, I just avoid a just question because I assume everyone I work with, all of the just questions have been tried and answered, right? So if I find myself thinking, why don't you just, I immediately, they've tried that. They've already done that. If it was a just question, it would have been tried, right? So you can come at it with curiosity from a different angle and learn instead of like accusing.

So anybody I work with, I just avoid a just question because I assume everyone I work with, all of the just questions have been tried and answered, right? So if I find myself thinking, why don't you just, I immediately, they've tried that. They've already done that. If it was a just question, it would have been tried, right?

Well, we have reached one minute to top the hour. So I feel like we, we must say goodbye. This was very, very fun. There are so many questions that didn't get answered in Slido. They're so good. There were two, there's one from Adam and one from Jackie that were both sort of about like testing, which I think were really great. Martin, I will get you the unanswered questions afterwards so that you can see them and, you know, you can feel free to, to answer them or not if you want in Slido, but this was really fun. I hope you had a good time. Thank you so much for coming. Of course. Yeah. And I will, yeah, I will answer them either in

Should your shiny app be an R package? | Martin Frigaard | Data Science Hangout

Transcript#

Becoming a Shiny developer

Shiny app packages and frameworks

Learning Shiny: pet projects and resources

Advocating for Shiny over point-and-click tools

Shiny modules explained

Posit wish list and AI tooling

Journalism, career path, and communication

Code quality and maintainability

Bridging the gap between Shiny developers and IT

Featured software#

btw

Shiny for Python

Shiny for R